Commit Graph

49 Commits

Author SHA1 Message Date
Simon Alibert 3c0a209f9f
Simplify configs (#550)
Co-authored-by: Remi <remi.cadene@huggingface.co>
Co-authored-by: HUANG TZU-CHUN <137322177+tc-huang@users.noreply.github.com>
2025-01-31 13:57:37 +01:00
Simon Alibert 32eb0cec8f
Dataset v2.0 (#461)
Co-authored-by: Remi <remi.cadene@huggingface.co>
2024-11-29 19:04:00 +01:00
Remi 97b1feb0b3
Add policy/act_aloha_real.yaml + env/act_real.yaml (#429)
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-10-10 17:12:45 +02:00
Alexander Soare f8a6574698
Add online training with TD-MPC as proof of concept (#338) 2024-07-25 11:16:38 +01:00
Alexander Soare abbb1d2367
Make sure policies don't mutate the batch (#323) 2024-07-22 20:38:33 +01:00
Alexander Soare c0101f0948
Fix ACT temporal ensembling (#319) 2024-07-16 10:27:21 +01:00
Alexander Soare 342f429f1c
Add test to make sure policy dataclass configs match yaml configs (#292) 2024-06-26 09:09:40 +01:00
Seungjae Lee 7d1542cae1
Add VQ-BeT (#166) 2024-06-26 08:55:02 +01:00
Thomas Wolf 48951662f2
Bug fix: missing attention mask in VAE encoder in ACT policy (#279)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-06-19 12:07:21 +01:00
Thomas Wolf 11f1cb5dc9
Bug fix: fix setting different learning rates between backbone and main model in ACT policy (#280) 2024-06-18 13:31:35 +01:00
Remi d585c73f9f
Add real-world support for ACT on Aloha/Aloha2 (#228)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-31 15:31:02 +02:00
Remi 01eae09ba6
Fix aloha real-world datasets (#175) 2024-05-20 13:48:09 +02:00
Alexander Soare 68c1b13406
Make policies compatible with other/multiple image keys (#149) 2024-05-16 13:51:53 +01:00
Simon Alibert f52f4f2cd2
Add copyrights (#157) 2024-05-15 12:13:09 +02:00
Alexander Soare e89521dfa0
Enable tests for TD-MPC (#160) 2024-05-09 13:42:12 +01:00
Alexander Soare f3bba0270d
Remove EMA model from Diffusion Policy (#134) 2024-05-05 11:26:12 +01:00
Simon Alibert c77633c38c
Add regression tests (#119)
- Add `tests/scripts/save_policy_to_safetensor.py` to generate test artifacts
- Add `test_backward_compatibility to test generated outputs from the policies against artifacts
2024-05-04 16:20:30 +02:00
Alexander Soare bccee745c3
Refactor eval.py (#127) 2024-05-03 17:33:16 +01:00
Alexander Soare d1855a202a
Refactor TD-MPC (#103)
Co-authored-by: Cadene <re.cadene@gmail.com>
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-05-01 16:40:04 +01:00
Alexander Soare a4891095e4
Use PytorchModelHubMixin to save models as safetensors (#125)
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-01 16:17:18 +01:00
Alexander Soare 01d5490d44
Follow `transformers` single file naming conventions (#124) 2024-05-01 13:09:42 +01:00
Alexander Soare 9d60dce6f3
Tidy up yaml configs (#121) 2024-04-30 16:08:59 +01:00
Quentin Gallouédec 508bd92d03
Remove `update` method from the policy (#99)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-29 12:27:58 +02:00
Alexander Soare ccffa9e406
Fix policy defaults (#113) 2024-04-29 08:26:59 +01:00
Remi 659c69a1c0
Refactor datasets into LeRobotDataset (#91)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-25 12:23:12 +02:00
Remi e760e4cd63
Move normalization to policy for act and diffusion (#90)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-25 11:47:38 +02:00
Simon Alibert 7ad1909641
Tests cleaning & simplification (#81) 2024-04-18 14:47:42 +02:00
Alexander Soare 8a322da422 backup wip 2024-04-16 16:35:04 +01:00
Alexander Soare 23be5e1e7b backup wip 2024-04-16 16:31:44 +01:00
Cadene 6902e01db0 tests are passing for aloha/act policies, removes abstract policy 2024-04-09 03:28:56 +00:00
Cadene 73dfa3c8e3 tests for tdmpc and diffusion policy are passing 2024-04-09 02:50:32 +00:00
Simon Alibert 3f6dfa4916 Add gym-aloha, rename simxarm -> xarm, refactor 2024-04-08 16:24:11 +02:00
Cadene 70aaf1c4cb test_datasets.py are passing! 2024-04-08 14:16:57 +00:00
Cadene 4371a5570d Remove latency, tdmpc policy passes tests (TODO: make it work with online RL) 2024-04-07 16:01:22 +00:00
Cadene 1cdfbc8b52 WIP
WIP

WIP train.py works, loss going down

WIP eval.py

Fix

WIP (eval running, TODO: verify results reproduced)

Eval works! (testing reproducibility)

WIP

pretrained model pusht reproduces same results as torchrl

pretrained model pusht reproduces same results as torchrl

Remove AbstractPolicy, Move all queues in select_action

WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare b7c9c33072 revision 2024-03-27 18:33:48 +00:00
Alexander Soare 1a1308d62f fix environment seeding
add fixes for reproducibility

only try to start env if it is closed

revision

fix normalization and data type

Improve README

Improve README

Tests are passing, Eval pretrained model works, Add gif

Update gif

Update gif

Update gif

Update gif

Update README

Update README

update minor

Update README.md

Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>

Update README.md

Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>

Address suggestions

Update thumbnail + stats

Update thumbnail + stats

Update README.md

Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>

Add more comments

Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert 058ac991eb Add simxarm back into tests 2024-03-25 16:35:46 +01:00
Cadene 40f3783fca v1.2 2024-03-23 11:41:56 +00:00
Alexander Soare b54cdc9a0f break_when_any_done==True for batch_size==1 2024-03-19 19:08:25 +00:00
Alexander Soare 46ac87d2a6 ready for review 2024-03-19 18:59:08 +00:00
Alexander Soare 896a11f60e backup wip 2024-03-19 18:50:04 +00:00
Alexander Soare ea17f4ce50 backup wip 2024-03-19 16:02:09 +00:00
Alexander Soare 88347965c2 revert dp changes, make act and tdmpc batch friendly 2024-03-18 19:18:21 +00:00
Alexander Soare ba91976944 wip: still needs batch logic for act and tdmp 2024-03-14 15:24:10 +00:00
Cadene 5881eec376 Add DEVICE constant from LEROBOT_TESTS_DEVICE 2024-03-12 14:14:39 +00:00
Cadene 21670dce90 Refactor train, eval_policy, logger, Add diffusion.yaml (WIP) 2024-02-26 01:10:09 +00:00
Cadene b16c334825 Refactor configs to have env in seperate yaml + Fix training 2024-02-25 17:42:47 +00:00
Cadene 6f5c731936 Rename test -> tests 2024-02-25 10:51:07 +00:00