Commit Graph

288 Commits

Author SHA1 Message Date
Simon Alibert f5e76393eb
Release cleanup (#132)
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>
Co-authored-by: Cadene <re.cadene@gmail.com>
2024-05-06 03:03:14 +02:00
Alexander Soare 6e6feae09e
Remove additional frame from eval render (#137) 2024-05-05 13:49:53 +02:00
Alexander Soare f6fe162432
Rename Diffusion Policy model repository (#136) 2024-05-05 12:06:58 +01:00
Alexander Soare f3bba0270d
Remove EMA model from Diffusion Policy (#134) 2024-05-05 11:26:12 +01:00
Alexander Soare d747195c57
Fix loops in eval.py (#128) 2024-05-04 15:21:10 +01:00
Simon Alibert c77633c38c
Add regression tests (#119)
- Add `tests/scripts/save_policy_to_safetensor.py` to generate test artifacts
- Add `test_backward_compatibility to test generated outputs from the policies against artifacts
2024-05-04 16:20:30 +02:00
Remi 19812ca470
Add dataset visualization with rerun.io (#131)
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-05-04 16:07:14 +02:00
Alexander Soare bccee745c3
Refactor eval.py (#127) 2024-05-03 17:33:16 +01:00
Remi b2cda12f87
Add video decoding to LeRobotDataset (#92) 2024-05-03 00:50:19 +02:00
Alexander Soare d1855a202a
Refactor TD-MPC (#103)
Co-authored-by: Cadene <re.cadene@gmail.com>
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-05-01 16:40:04 +01:00
Alexander Soare a4891095e4
Use PytorchModelHubMixin to save models as safetensors (#125)
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-01 16:17:18 +01:00
Alexander Soare 9d60dce6f3
Tidy up yaml configs (#121) 2024-04-30 16:08:59 +01:00
Remi e4e739f4f8
Refactor push_dataset_to_hub (#118) 2024-04-30 14:25:41 +02:00
Simon Alibert 2765877f28
Speed up CI, add more checks (#115)
- Split pytest and end-to-end tests into separate jobs
- Add poetry check to ensure pyproject.toml and poetry.lock are in sync
- Add ruff format --diff to ensure style formatting is applied (fails if ruff would reformat anything)
2024-04-29 23:05:55 +02:00
Quentin Gallouédec 508bd92d03
Remove `update` method from the policy (#99)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-29 12:27:58 +02:00
Simon Alibert 791506dfb8
Remove warnings (#111)
- Replace `use_pretrained_backbone` with `pretrained_backbone_weights`
- Bump diffusers' minimum version `0.26.3` -> `0.27.2`
- Add ignore flags in CI's pytest
- Change Box observation spaces in simulation environments
- Set `version_base="1.2"` in Hydra initializations
- Bump einops' minimum version `0.7.0` -> `0.8.0`
2024-04-29 00:31:33 +02:00
Adil Zouitine 55dc9f7f51
Refactor the download and publication of the datasets and convert it into CLI script (#95)
Co-authored-by: Remi <re.cadene@gmail.com>
2024-04-29 00:08:17 +02:00
Alexander Soare 45f351c618
Make sure targets are normalized too (#106) 2024-04-26 11:18:39 +01:00
Remi e760e4cd63
Move normalization to policy for act and diffusion (#90)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-25 11:47:38 +02:00
Remi c1bcf857c5
Fix online training (#94) 2024-04-23 18:54:55 +02:00
Remi 1030ea0070
Loads episode_data_index and stats during dataset __init__ (#85)
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-23 14:13:25 +02:00
Alexander Soare e2168163cd
Quality of life patches for eval.py (#86) 2024-04-19 12:33:47 +01:00
Simon Alibert 7ad1909641
Tests cleaning & simplification (#81) 2024-04-18 14:47:42 +02:00
Remi 0928afd37d
Improve dataset examples (#82)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-18 11:43:16 +02:00
Alexander Soare bff4b673c9 Merge remote-tracking branch 'upstream/main' into unify_policy_api 2024-04-17 08:08:57 +01:00
Cadene 91badebdfc fix tests 2024-04-16 17:29:31 +00:00
Cadene 36d9e885ef Address comments 2024-04-16 17:20:54 +00:00
Cadene b241ea46dd move download_and_upload_dataset.py to root_dir 2024-04-16 17:20:53 +00:00
Cadene e09d25267e fix online training 2024-04-16 17:20:53 +00:00
Cadene 4a3eac4743 fix unit tests, stats was missing, visualize_dataset was broken 2024-04-16 17:20:53 +00:00
Cadene 0980fff6cc HF datasets works 2024-04-16 17:19:40 +00:00
Cadene c7a8218620 typo 2024-04-16 17:19:39 +00:00
Cadene 67d79732f9 Add download_and_upload_dataset.py in script, update all datasets, update online training 2024-04-16 17:19:39 +00:00
Alexander Soare 23be5e1e7b backup wip 2024-04-16 16:31:44 +01:00
Alexander Soare 9241b5e830 pass step as kwarg 2024-04-15 09:52:54 +01:00
Alexander Soare 6d0a45a97d ready for review 2024-04-12 11:36:52 +01:00
Alexander Soare 94cc22da9e Merge remote-tracking branch 'upstream/main' into refactor_dp 2024-04-11 17:52:10 +01:00
Alexander Soare 976a197f98 backup wip 2024-04-11 17:51:35 +01:00
Cadene 8e5b4365ac fix 2024-04-11 13:57:22 +00:00
Cadene 949f4d1a5b remove comment 2024-04-10 17:21:36 +00:00
Remi 4c3d8b061e
Update lerobot/scripts/eval.py
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-04-10 18:07:27 +02:00
Cadene 8866b22db1 remove policy is None eval end-to-end tests 2024-04-10 15:09:04 +00:00
Cadene 693f620df0 drop_last=False 2024-04-10 14:59:54 +00:00
Cadene 48ec479660 fix end-to-end aloha 2024-04-10 14:26:30 +00:00
Cadene a18bcb39a7 cfg.env.fps 2024-04-10 14:02:11 +00:00
Cadene c08003278e test_examples are passing 2024-04-10 13:45:45 +00:00
Cadene 7c8eb7ff19 Merge remote-tracking branch 'origin/user/rcadene/2024_03_31_remove_torchrl' into user/rcadene/2024_03_31_remove_torchrl 2024-04-10 11:34:51 +00:00
Cadene 06573d7f67 online training works (loss goes down), remove repeat_action, eval_policy outputs episodes data, eval_policy uses max_episodes_rendered 2024-04-10 11:34:01 +00:00
Alexander Soare e6c6c2367f Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act 2024-04-09 08:36:28 +01:00
Cadene 253e495df2 remove render(mode=visualization) 2024-04-09 03:46:05 +00:00
Alexander Soare 91e0e4e175 rever change 2024-04-08 15:05:40 +01:00
Alexander Soare 86365adf9f revision 2024-04-08 14:44:46 +01:00
Alexander Soare 863f28ffd8 ready for review 2024-04-08 13:10:19 +01:00
Alexander Soare 1bab4a1dd5 Eval reproduction works with gym_aloha 2024-04-08 10:23:26 +01:00
Alexander Soare e982c732f1 Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-08 09:25:45 +01:00
Alexander Soare 8d2463f45b backup wip 2024-04-05 18:46:30 +01:00
Alexander Soare ab2286025b Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 18:06:00 +01:00
Alexander Soare 1e71196fe3 backup wip 2024-04-05 17:38:29 +01:00
Cadene 26602269cd test_envs.py are passing, remove simxarm and pusht directories 2024-04-05 16:21:07 +00:00
Alexander Soare 4863e54ce9 Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 12:00:31 +01:00
Cadene ad3379a73a fix memory leak due to itertools.cycle 2024-04-05 10:59:32 +00:00
Alexander Soare 9d77f5773d Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 11:41:11 +01:00
Cadene 5af00d0c1e fix train.py, stats, eval.py (training is running) 2024-04-05 09:31:39 +00:00
Cadene 1cdfbc8b52 WIP
WIP

WIP train.py works, loss going down

WIP eval.py

Fix

WIP (eval running, TODO: verify results reproduced)

Eval works! (testing reproducibility)

WIP

pretrained model pusht reproduces same results as torchrl

pretrained model pusht reproduces same results as torchrl

Remove AbstractPolicy, Move all queues in select_action

WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare 2b928eedd4 backup wip 2024-04-02 19:11:53 +01:00
Alexander Soare b7c9c33072 revision 2024-03-27 18:33:48 +00:00
Alexander Soare 6cd671040f fix revision 2024-03-27 13:22:14 +00:00
Alexander Soare 1ed0110900 finish examples 2 and 3 2024-03-26 16:13:40 +00:00
Cadene 4a8c5e238e issue with cat_and_write_video 2024-03-26 10:12:16 +00:00
Alexander Soare 1a1308d62f fix environment seeding
add fixes for reproducibility

only try to start env if it is closed

revision

fix normalization and data type

Improve README

Improve README

Tests are passing, Eval pretrained model works, Add gif

Update gif

Update gif

Update gif

Update gif

Update README

Update README

update minor

Update README.md

Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>

Update README.md

Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>

Address suggestions

Update thumbnail + stats

Update thumbnail + stats

Update README.md

Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>

Add more comments

Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert c5635b7d94 Minor fixes for #47 2024-03-25 18:50:47 +01:00
Simon Alibert 7cdd6d2450 Renamed set_seed -> set_global_seed 2024-03-25 17:19:28 +01:00
Cadene be6364f109 fix, it's training now! 2024-03-25 12:28:07 +01:00
Alexander Soare 4a48b77540 fix normalization and data type 2024-03-25 09:44:03 +00:00
Alexander Soare bd40ffc53c revision 2024-03-22 15:43:45 +00:00
Alexander Soare d43fa600a0 only try to start env if it is closed 2024-03-22 15:32:55 +00:00
Alexander Soare e698d38a35 Merge remote-tracking branch 'upstream/main' into fix_environment_seeding 2024-03-22 15:11:15 +00:00
Alexander Soare 15ff3b3af8 add fixes for reproducibility 2024-03-22 15:06:57 +00:00
Alexander Soare 115927d0f6 make sure to pass stats.pth arg 2024-03-22 12:58:59 +00:00
Alexander Soare 529f42643d revision 2024-03-22 12:33:25 +00:00
Alexander Soare 8720c568d0 Add ability to eval hub model 2024-03-22 10:26:55 +00:00
Alexander Soare 41912b962b remove TODO 2024-03-21 13:51:26 +00:00
Alexander Soare 72d3c3120b Merge remote-tracking branch 'upstream/main' into fix_pusht_diffusion 2024-03-21 10:20:52 +00:00
Alexander Soare acf1174447 ready for review 2024-03-21 10:18:50 +00:00
Simon Alibert 4631d36c05 Add get_safe_torch_device in policies 2024-03-20 18:38:55 +01:00
Alexander Soare d323993569 backup wip 2024-03-20 15:01:27 +00:00
Alexander Soare 4b7ec81dde remove abstracmethods, fix online training 2024-03-20 14:49:41 +00:00
Alexander Soare 5332766a82 revision 2024-03-20 09:45:45 +00:00
Alexander Soare b1ec3da035 remove internal rendering hooks 2024-03-20 09:23:23 +00:00
Alexander Soare 52e149fbfd Only save video frames in first rollout 2024-03-20 08:32:11 +00:00
Alexander Soare 4f1955edfd Clear action queue when environment is reset 2024-03-20 08:31:06 +00:00
Alexander Soare b54cdc9a0f break_when_any_done==True for batch_size==1 2024-03-19 19:08:25 +00:00
Alexander Soare 896a11f60e backup wip 2024-03-19 18:50:04 +00:00
Alexander Soare ea17f4ce50 backup wip 2024-03-19 16:02:09 +00:00
Alexander Soare 88347965c2 revert dp changes, make act and tdmpc batch friendly 2024-03-18 19:18:21 +00:00
Alexander Soare 09ddd9bf92 Merge branch 'main' into user/alexander-soare/multistep_policy_and_serial_env 2024-03-18 18:27:50 +00:00
Alexander Soare 8e346b379d switch between train and eval 2024-03-18 09:45:17 +00:00
Alexander Soare ba91976944 wip: still needs batch logic for act and tdmp 2024-03-14 15:24:10 +00:00
Remi Cadene 9d002032d1 Add Aloha env and ACT policy
WIP Aloha env tests pass

Rendering works (fps look fast tho? TODO action bounding is too wide [-1,1])

Update README

Copy past from act repo

Remove download.py add a WIP for Simxarm

Remove download.py add a WIP for Simxarm

Add act yaml (TODO: try train.py)

Training can runs (TODO: eval)

Add tasks without end_effector that are compatible with dataset, Eval can run (TODO: training and pretrained model)

Add AbstractEnv, Refactor AlohaEnv, Add rendering_hook in env, Minor modifications, (TODO: Refactor Pusht and Simxarm)

poetry lock

fix bug in compute_stats for action normalization

fix more bugs in normalization

fix training

fix import

PushtEnv inheriates AbstractEnv, Improve factory Normalization

Add _make_env to EnvAbstract

Add call_rendering_hooks to pusht env

SimxarmEnv inherites from AbstractEnv (NOT TESTED)

Add aloha tests artifacts + update pusht stats

fix image normalization: before env was in [0,1] but dataset in [0,255], and now both in [0,255]

Small fix on simxarm

Add next to obs

Add top camera to Aloha env (TODO: make it compatible with set of cameras)

Add top camera to Aloha env (TODO: make it compatible with set of cameras)
2024-03-12 10:27:48 +00:00
Cadene 816b2e9d63 fix more bugs in normalization 2024-03-11 11:03:51 +00:00
Remi Cadene f95ecd66fc Improve visualize_dataset, Improve AbstractReplayBuffer, Small improvements 2024-03-06 10:15:57 +00:00
Remi Cadene 2bcf2631b9 minor comment 2024-03-04 22:34:44 +00:00
Remi e990f3e148
Merge pull request #6 from Cadene/user/rcadene/2024_03_04_diffusion
Make diffusion work
2024-03-04 18:30:40 +01:00
Remi Cadene cfc304e870 Refactor env queue, Training diffusion works (Still not converging) 2024-03-04 11:00:51 +00:00
Remi Cadene 4c400b41a5 Improve log msg in train.py 2024-03-03 13:22:09 +00:00
Remi Cadene cbbed590a9 Add mode to NormalizeTransform with mean_std or min_max (Not fully tested) 2024-03-03 13:19:02 +00:00
Simon Alibert b859e89936 Fix for PR #5 2024-03-03 13:05:21 +01:00
Simon Alibert b33ec5a630 Add run on cpu-only compatibility 2024-03-03 12:47:26 +01:00
Remi Cadene 80785f8d0e Small fix, Refactor diffusion, Diffusion runs (TODO: remove normalization in diffusion) 2024-03-02 17:04:39 +00:00
Remi Cadene 45b4ecb727 pre-commit run -a 2024-03-02 15:58:21 +00:00
Remi Cadene 1ae6205269 Add Normalize, non_blocking=True in tdmpc, tdmpc run (TODO: diffusion) 2024-03-02 15:53:29 +00:00
Cadene ae050d2e94 Solve conflicts + pre-commit run -a 2024-02-29 23:31:32 +00:00
Cadene 0b9027f05e Clean logging, Refactor 2024-02-29 23:21:27 +00:00
Simon Alibert 7e024fdce6 Ran pre-commit run --all-files 2024-02-29 13:37:48 +01:00
Cadene 5798979e9c comment 2024-02-28 18:31:54 +00:00
Cadene cf5063e50e Add diffusion policy (train and eval works, TODO: reproduce results) 2024-02-28 15:21:42 +00:00
Simon Alibert f1708c8a37 install fix 2024-02-28 12:35:49 +01:00
Cadene e543c9a42c small fix % 2024-02-27 11:54:31 +00:00
Cadene 7df542445c Small fix and improve logging message 2024-02-27 11:44:26 +00:00
Cadene 21670dce90 Refactor train, eval_policy, logger, Add diffusion.yaml (WIP) 2024-02-26 01:10:09 +00:00
Cadene b16c334825 Refactor configs to have env in seperate yaml + Fix training 2024-02-25 17:42:47 +00:00
Cadene ed80db2846 Sanitize cfg.env 2024-02-25 12:02:29 +00:00
Cadene 598bb496b0 Add policies/factory, Add test, Add _self_ in config 2024-02-25 10:50:23 +00:00
Cadene 64b5920e94 format 2024-02-24 18:19:18 +00:00
Cadene aed02dc7c6 Add multithreading for video generation, Speed policy sampling 2024-02-24 18:18:39 +00:00
Cadene 63d18475cc fix simxarm factory 2024-02-22 13:04:24 +00:00
Cadene e3643d6146 Wandb works, One output dir 2024-02-22 12:14:12 +00:00
Cadene ece89730e6 Add pusht dataset (TODO verify reward is aligned), Refactor visualize_dataset, Add video_dir, fps, state_dim, action_dim to config (Training works) 2024-02-21 00:49:40 +00:00
Cadene 3dc14b5576 Add Prod transform, Add test_factory 2024-02-20 14:22:16 +00:00
Cadene 3da6ffb2cb Fix unit tests, Refactor, Add pusht env, (TODO pusht replay buffer, image preprocessing) 2024-02-20 12:26:57 +00:00
Cadene a5c305a7a4 offline training + online finetuning converge to 33 reward! 2024-02-18 01:23:44 +00:00
Cadene 0b4084f0f8 Clean + alpha beta corresponds to config (before 0.7 and 0.9) 2024-02-16 16:27:54 +00:00
Cadene c202c2b3c2 Online finetuning runs (sometimes crash because of nans) 2024-02-16 15:13:24 +00:00
Cadene 228c045674 Eval reproduced! Train running (but not reproduced) 2024-02-10 15:46:24 +00:00
Cadene 937b2f8cba Add option for random policy 2024-01-31 13:54:32 +00:00
Cadene 5a5b190f70 Add common, refactor eval with eval_policy 2024-01-31 13:48:12 +00:00
Cadene 1e52499490 eval.mp4 works! 2024-01-30 23:30:14 +00:00
Cadene 1144819c29 First real commit, simxarm env added with torchrl! 2024-01-29 12:49:30 +00:00