Remi
97b1feb0b3
Add policy/act_aloha_real.yaml + env/act_real.yaml ( #429 )
...
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-10-10 17:12:45 +02:00
Alexander Soare
f8a6574698
Add online training with TD-MPC as proof of concept ( #338 )
2024-07-25 11:16:38 +01:00
Alexander Soare
abbb1d2367
Make sure policies don't mutate the batch ( #323 )
2024-07-22 20:38:33 +01:00
Alexander Soare
c0101f0948
Fix ACT temporal ensembling ( #319 )
2024-07-16 10:27:21 +01:00
Alexander Soare
342f429f1c
Add test to make sure policy dataclass configs match yaml configs ( #292 )
2024-06-26 09:09:40 +01:00
Seungjae Lee
7d1542cae1
Add VQ-BeT ( #166 )
2024-06-26 08:55:02 +01:00
Thomas Wolf
48951662f2
Bug fix: missing attention mask in VAE encoder in ACT policy ( #279 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-06-19 12:07:21 +01:00
Thomas Wolf
11f1cb5dc9
Bug fix: fix setting different learning rates between backbone and main model in ACT policy ( #280 )
2024-06-18 13:31:35 +01:00
Remi
d585c73f9f
Add real-world support for ACT on Aloha/Aloha2 ( #228 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-31 15:31:02 +02:00
Remi
01eae09ba6
Fix aloha real-world datasets ( #175 )
2024-05-20 13:48:09 +02:00
Alexander Soare
68c1b13406
Make policies compatible with other/multiple image keys ( #149 )
2024-05-16 13:51:53 +01:00
Simon Alibert
f52f4f2cd2
Add copyrights ( #157 )
2024-05-15 12:13:09 +02:00
Alexander Soare
e89521dfa0
Enable tests for TD-MPC ( #160 )
2024-05-09 13:42:12 +01:00
Alexander Soare
f3bba0270d
Remove EMA model from Diffusion Policy ( #134 )
2024-05-05 11:26:12 +01:00
Simon Alibert
c77633c38c
Add regression tests ( #119 )
...
- Add `tests/scripts/save_policy_to_safetensor.py` to generate test artifacts
- Add `test_backward_compatibility to test generated outputs from the policies against artifacts
2024-05-04 16:20:30 +02:00
Alexander Soare
bccee745c3
Refactor eval.py ( #127 )
2024-05-03 17:33:16 +01:00
Alexander Soare
d1855a202a
Refactor TD-MPC ( #103 )
...
Co-authored-by: Cadene <re.cadene@gmail.com>
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-05-01 16:40:04 +01:00
Alexander Soare
a4891095e4
Use PytorchModelHubMixin to save models as safetensors ( #125 )
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-01 16:17:18 +01:00
Alexander Soare
01d5490d44
Follow `transformers` single file naming conventions ( #124 )
2024-05-01 13:09:42 +01:00
Alexander Soare
9d60dce6f3
Tidy up yaml configs ( #121 )
2024-04-30 16:08:59 +01:00
Quentin Gallouédec
508bd92d03
Remove `update` method from the policy ( #99 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-29 12:27:58 +02:00
Alexander Soare
ccffa9e406
Fix policy defaults ( #113 )
2024-04-29 08:26:59 +01:00
Remi
659c69a1c0
Refactor datasets into LeRobotDataset ( #91 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-25 12:23:12 +02:00
Remi
e760e4cd63
Move normalization to policy for act and diffusion ( #90 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-25 11:47:38 +02:00
Simon Alibert
7ad1909641
Tests cleaning & simplification ( #81 )
2024-04-18 14:47:42 +02:00
Alexander Soare
8a322da422
backup wip
2024-04-16 16:35:04 +01:00
Alexander Soare
23be5e1e7b
backup wip
2024-04-16 16:31:44 +01:00
Cadene
6902e01db0
tests are passing for aloha/act policies, removes abstract policy
2024-04-09 03:28:56 +00:00
Cadene
73dfa3c8e3
tests for tdmpc and diffusion policy are passing
2024-04-09 02:50:32 +00:00
Simon Alibert
3f6dfa4916
Add gym-aloha, rename simxarm -> xarm, refactor
2024-04-08 16:24:11 +02:00
Cadene
70aaf1c4cb
test_datasets.py are passing!
2024-04-08 14:16:57 +00:00
Cadene
4371a5570d
Remove latency, tdmpc policy passes tests (TODO: make it work with online RL)
2024-04-07 16:01:22 +00:00
Cadene
1cdfbc8b52
WIP
...
WIP
WIP train.py works, loss going down
WIP eval.py
Fix
WIP (eval running, TODO: verify results reproduced)
Eval works! (testing reproducibility)
WIP
pretrained model pusht reproduces same results as torchrl
pretrained model pusht reproduces same results as torchrl
Remove AbstractPolicy, Move all queues in select_action
WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare
b7c9c33072
revision
2024-03-27 18:33:48 +00:00
Alexander Soare
1a1308d62f
fix environment seeding
...
add fixes for reproducibility
only try to start env if it is closed
revision
fix normalization and data type
Improve README
Improve README
Tests are passing, Eval pretrained model works, Add gif
Update gif
Update gif
Update gif
Update gif
Update README
Update README
update minor
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
Address suggestions
Update thumbnail + stats
Update thumbnail + stats
Update README.md
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
Add more comments
Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert
058ac991eb
Add simxarm back into tests
2024-03-25 16:35:46 +01:00
Cadene
40f3783fca
v1.2
2024-03-23 11:41:56 +00:00
Alexander Soare
b54cdc9a0f
break_when_any_done==True for batch_size==1
2024-03-19 19:08:25 +00:00
Alexander Soare
46ac87d2a6
ready for review
2024-03-19 18:59:08 +00:00
Alexander Soare
896a11f60e
backup wip
2024-03-19 18:50:04 +00:00
Alexander Soare
ea17f4ce50
backup wip
2024-03-19 16:02:09 +00:00
Alexander Soare
88347965c2
revert dp changes, make act and tdmpc batch friendly
2024-03-18 19:18:21 +00:00
Alexander Soare
ba91976944
wip: still needs batch logic for act and tdmp
2024-03-14 15:24:10 +00:00
Cadene
5881eec376
Add DEVICE constant from LEROBOT_TESTS_DEVICE
2024-03-12 14:14:39 +00:00
Cadene
21670dce90
Refactor train, eval_policy, logger, Add diffusion.yaml (WIP)
2024-02-26 01:10:09 +00:00
Cadene
b16c334825
Refactor configs to have env in seperate yaml + Fix training
2024-02-25 17:42:47 +00:00
Cadene
6f5c731936
Rename test -> tests
2024-02-25 10:51:07 +00:00