Remi Cadene
|
d782b029e1
|
Add aloha dataset
|
2024-03-06 10:26:32 +00:00 |
Remi
|
49c0955f97
|
Merge pull request #7 from Cadene/user/rcadene/2024_03_05_abstract_replay_buffer
Add AbstractReplayBuffer
|
2024-03-06 11:25:24 +01:00 |
Remi Cadene
|
eed24b083a
|
small fix
|
2024-03-06 10:21:22 +00:00 |
Remi Cadene
|
f95ecd66fc
|
Improve visualize_dataset, Improve AbstractReplayBuffer, Small improvements
|
2024-03-06 10:15:57 +00:00 |
Simon Alibert
|
a6d353c419
|
Fix
|
2024-03-05 17:00:17 +01:00 |
Remi Cadene
|
2f80d71c3e
|
Remove noqa-F821
|
2024-03-05 10:22:21 +00:00 |
Remi Cadene
|
d4e0849970
|
Refactor datasets with abstract class
|
2024-03-05 10:20:57 +00:00 |
Remi Cadene
|
a027f4edfb
|
Add cfg.offline_prioritized_sampler
|
2024-03-04 23:08:52 +00:00 |
Remi
|
e990f3e148
|
Merge pull request #6 from Cadene/user/rcadene/2024_03_04_diffusion
Make diffusion work
|
2024-03-04 18:30:40 +01:00 |
Remi Cadene
|
e29fbb50e8
|
Fix grad_clip_norm 0 -> 10, Fix normalization min_max to be per channel
|
2024-03-04 17:26:34 +00:00 |
Remi Cadene
|
cfc304e870
|
Refactor env queue, Training diffusion works (Still not converging)
|
2024-03-04 11:00:51 +00:00 |
Remi Cadene
|
fddd9f0311
|
Add possibility for the policy to provide a sequence of actions to the env
|
2024-03-03 14:02:24 +00:00 |
Remi Cadene
|
0f2fa4d9ef
|
Add obs queue to pusht, Set n_obs_steps=2 for diffusion (Not fully tested)
|
2024-03-03 13:21:31 +00:00 |
Remi Cadene
|
cbbed590a9
|
Add mode to NormalizeTransform with mean_std or min_max (Not fully tested)
|
2024-03-03 13:19:02 +00:00 |
Simon Alibert
|
b33ec5a630
|
Add run on cpu-only compatibility
|
2024-03-03 12:47:26 +01:00 |
Remi Cadene
|
48ded3dbc7
|
fix
|
2024-03-02 18:11:50 +00:00 |
Remi Cadene
|
80785f8d0e
|
Small fix, Refactor diffusion, Diffusion runs (TODO: remove normalization in diffusion)
|
2024-03-02 17:04:39 +00:00 |
Remi Cadene
|
45b4ecb727
|
pre-commit run -a
|
2024-03-02 15:58:21 +00:00 |
Remi Cadene
|
1ae6205269
|
Add Normalize, non_blocking=True in tdmpc, tdmpc run (TODO: diffusion)
|
2024-03-02 15:53:29 +00:00 |
Remi Cadene
|
b5a2f460ea
|
fix bus error
|
2024-03-01 14:22:05 +00:00 |
Simon Alibert
|
c1942d45d3
|
Fixes for PR #4
|
2024-03-01 14:59:05 +01:00 |
Simon Alibert
|
b862145e22
|
Added pusht dataset auto-download
|
2024-03-01 14:31:54 +01:00 |
Cadene
|
ca948c1e5b
|
fix zip strict=False
|
2024-03-01 00:45:23 +00:00 |
Cadene
|
ae050d2e94
|
Solve conflicts + pre-commit run -a
|
2024-02-29 23:31:32 +00:00 |
Cadene
|
0b9027f05e
|
Clean logging, Refactor
|
2024-02-29 23:21:27 +00:00 |
Simon Alibert
|
2c05b75f45
|
Fixes for PR #3
|
2024-02-29 21:46:41 +01:00 |
Simon Alibert
|
7e024fdce6
|
Ran pre-commit run --all-files
|
2024-02-29 13:37:48 +01:00 |
Cadene
|
ac90b9c3ee
|
Fix diffusion (rm transpose), Add prefetch
|
2024-02-28 17:45:01 +00:00 |
Cadene
|
cf5063e50e
|
Add diffusion policy (train and eval works, TODO: reproduce results)
|
2024-02-28 15:21:42 +00:00 |
Simon Alibert
|
98f8869743
|
WIP
|
2024-02-28 10:59:06 +01:00 |
Cadene
|
21670dce90
|
Refactor train, eval_policy, logger, Add diffusion.yaml (WIP)
|
2024-02-26 01:10:09 +00:00 |
Cadene
|
b16c334825
|
Refactor configs to have env in seperate yaml + Fix training
|
2024-02-25 17:42:47 +00:00 |
Cadene
|
ed80db2846
|
Sanitize cfg.env
|
2024-02-25 12:02:29 +00:00 |
Cadene
|
0eb9b5d1a5
|
Sanitize cfg.wandb
|
2024-02-25 11:15:09 +00:00 |
Cadene
|
e765e26b0b
|
Sanitize cfg.policy, Fix skip_frame pusht.yaml
|
2024-02-25 11:09:02 +00:00 |
Cadene
|
598bb496b0
|
Add policies/factory, Add test, Add _self_ in config
|
2024-02-25 10:50:23 +00:00 |
Cadene
|
64b5920e94
|
format
|
2024-02-24 18:19:18 +00:00 |
Cadene
|
aed02dc7c6
|
Add multithreading for video generation, Speed policy sampling
|
2024-02-24 18:18:39 +00:00 |
Cadene
|
591985c67d
|
Fix done in pusht, Fix --time in sbatch
|
2024-02-22 17:58:26 +00:00 |
Cadene
|
63d18475cc
|
fix simxarm factory
|
2024-02-22 13:04:24 +00:00 |
Cadene
|
96c53ad06f
|
remove comments
|
2024-02-22 12:15:14 +00:00 |
Cadene
|
e3643d6146
|
Wandb works, One output dir
|
2024-02-22 12:14:12 +00:00 |
Cadene
|
ece89730e6
|
Add pusht dataset (TODO verify reward is aligned), Refactor visualize_dataset, Add video_dir, fps, state_dim, action_dim to config (Training works)
|
2024-02-21 00:49:40 +00:00 |
Cadene
|
3dc14b5576
|
Add Prod transform, Add test_factory
|
2024-02-20 14:22:16 +00:00 |
Cadene
|
3da6ffb2cb
|
Fix unit tests, Refactor, Add pusht env, (TODO pusht replay buffer, image preprocessing)
|
2024-02-20 12:26:57 +00:00 |
Cadene
|
fdfb2010fd
|
black
|
2024-02-18 01:24:19 +00:00 |
Cadene
|
a5c305a7a4
|
offline training + online finetuning converge to 33 reward!
|
2024-02-18 01:23:44 +00:00 |
Cadene
|
c202c2b3c2
|
Online finetuning runs (sometimes crash because of nans)
|
2024-02-16 15:13:24 +00:00 |
Cadene
|
228c045674
|
Eval reproduced! Train running (but not reproduced)
|
2024-02-10 15:46:24 +00:00 |
Cadene
|
5a5b190f70
|
Add common, refactor eval with eval_policy
|
2024-01-31 13:48:12 +00:00 |