Commit Graph

300 Commits

Author SHA1 Message Date
Remi Cadene d782b029e1 Add aloha dataset 2024-03-06 10:26:32 +00:00
Remi 49c0955f97
Merge pull request #7 from Cadene/user/rcadene/2024_03_05_abstract_replay_buffer
Add AbstractReplayBuffer
2024-03-06 11:25:24 +01:00
Remi Cadene eed24b083a small fix 2024-03-06 10:21:22 +00:00
Remi Cadene f95ecd66fc Improve visualize_dataset, Improve AbstractReplayBuffer, Small improvements 2024-03-06 10:15:57 +00:00
Simon Alibert a6d353c419 Fix 2024-03-05 17:00:17 +01:00
Remi Cadene 2f80d71c3e Remove noqa-F821 2024-03-05 10:22:21 +00:00
Remi Cadene d4e0849970 Refactor datasets with abstract class 2024-03-05 10:20:57 +00:00
Remi Cadene a027f4edfb Add cfg.offline_prioritized_sampler 2024-03-04 23:08:52 +00:00
Remi e990f3e148
Merge pull request #6 from Cadene/user/rcadene/2024_03_04_diffusion
Make diffusion work
2024-03-04 18:30:40 +01:00
Remi Cadene e29fbb50e8 Fix grad_clip_norm 0 -> 10, Fix normalization min_max to be per channel 2024-03-04 17:26:34 +00:00
Remi Cadene cfc304e870 Refactor env queue, Training diffusion works (Still not converging) 2024-03-04 11:00:51 +00:00
Remi Cadene fddd9f0311 Add possibility for the policy to provide a sequence of actions to the env 2024-03-03 14:02:24 +00:00
Remi Cadene 0f2fa4d9ef Add obs queue to pusht, Set n_obs_steps=2 for diffusion (Not fully tested) 2024-03-03 13:21:31 +00:00
Remi Cadene cbbed590a9 Add mode to NormalizeTransform with mean_std or min_max (Not fully tested) 2024-03-03 13:19:02 +00:00
Simon Alibert b33ec5a630 Add run on cpu-only compatibility 2024-03-03 12:47:26 +01:00
Remi Cadene 48ded3dbc7 fix 2024-03-02 18:11:50 +00:00
Remi Cadene 80785f8d0e Small fix, Refactor diffusion, Diffusion runs (TODO: remove normalization in diffusion) 2024-03-02 17:04:39 +00:00
Remi Cadene 45b4ecb727 pre-commit run -a 2024-03-02 15:58:21 +00:00
Remi Cadene 1ae6205269 Add Normalize, non_blocking=True in tdmpc, tdmpc run (TODO: diffusion) 2024-03-02 15:53:29 +00:00
Remi Cadene b5a2f460ea fix bus error 2024-03-01 14:22:05 +00:00
Simon Alibert c1942d45d3 Fixes for PR #4 2024-03-01 14:59:05 +01:00
Simon Alibert b862145e22 Added pusht dataset auto-download 2024-03-01 14:31:54 +01:00
Cadene ca948c1e5b fix zip strict=False 2024-03-01 00:45:23 +00:00
Cadene ae050d2e94 Solve conflicts + pre-commit run -a 2024-02-29 23:31:32 +00:00
Cadene 0b9027f05e Clean logging, Refactor 2024-02-29 23:21:27 +00:00
Simon Alibert 2c05b75f45 Fixes for PR #3 2024-02-29 21:46:41 +01:00
Simon Alibert 7e024fdce6 Ran pre-commit run --all-files 2024-02-29 13:37:48 +01:00
Cadene ac90b9c3ee Fix diffusion (rm transpose), Add prefetch 2024-02-28 17:45:01 +00:00
Cadene cf5063e50e Add diffusion policy (train and eval works, TODO: reproduce results) 2024-02-28 15:21:42 +00:00
Simon Alibert 98f8869743 WIP 2024-02-28 10:59:06 +01:00
Cadene 21670dce90 Refactor train, eval_policy, logger, Add diffusion.yaml (WIP) 2024-02-26 01:10:09 +00:00
Cadene b16c334825 Refactor configs to have env in seperate yaml + Fix training 2024-02-25 17:42:47 +00:00
Cadene ed80db2846 Sanitize cfg.env 2024-02-25 12:02:29 +00:00
Cadene 0eb9b5d1a5 Sanitize cfg.wandb 2024-02-25 11:15:09 +00:00
Cadene e765e26b0b Sanitize cfg.policy, Fix skip_frame pusht.yaml 2024-02-25 11:09:02 +00:00
Cadene 598bb496b0 Add policies/factory, Add test, Add _self_ in config 2024-02-25 10:50:23 +00:00
Cadene 64b5920e94 format 2024-02-24 18:19:18 +00:00
Cadene aed02dc7c6 Add multithreading for video generation, Speed policy sampling 2024-02-24 18:18:39 +00:00
Cadene 591985c67d Fix done in pusht, Fix --time in sbatch 2024-02-22 17:58:26 +00:00
Cadene 63d18475cc fix simxarm factory 2024-02-22 13:04:24 +00:00
Cadene 96c53ad06f remove comments 2024-02-22 12:15:14 +00:00
Cadene e3643d6146 Wandb works, One output dir 2024-02-22 12:14:12 +00:00
Cadene ece89730e6 Add pusht dataset (TODO verify reward is aligned), Refactor visualize_dataset, Add video_dir, fps, state_dim, action_dim to config (Training works) 2024-02-21 00:49:40 +00:00
Cadene 3dc14b5576 Add Prod transform, Add test_factory 2024-02-20 14:22:16 +00:00
Cadene 3da6ffb2cb Fix unit tests, Refactor, Add pusht env, (TODO pusht replay buffer, image preprocessing) 2024-02-20 12:26:57 +00:00
Cadene fdfb2010fd black 2024-02-18 01:24:19 +00:00
Cadene a5c305a7a4 offline training + online finetuning converge to 33 reward! 2024-02-18 01:23:44 +00:00
Cadene c202c2b3c2 Online finetuning runs (sometimes crash because of nans) 2024-02-16 15:13:24 +00:00
Cadene 228c045674 Eval reproduced! Train running (but not reproduced) 2024-02-10 15:46:24 +00:00
Cadene 5a5b190f70 Add common, refactor eval with eval_policy 2024-01-31 13:48:12 +00:00