Wael Karkoub
54c9776bde
Improves Type Annotations ( #252 )
2024-06-10 19:09:48 +01:00
Alexander Soare
21f222fa1d
Add out_dir option to eval ( #244 )
2024-06-04 21:01:53 +02:00
amandip7
33362dbd17
Adding parameter dataloading_s to console logs and wandb for tracking… ( #243 )
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-06-04 17:02:05 +01:00
Radek Osmulski
504d2aaf48
add EpisodeAwareSampler ( #217 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-31 13:43:47 +01:00
Alexander Soare
111cd58f8a
Add `MultiLerobotDataset` for training with multiple `LeRobotDataset`s ( #229 )
2024-05-30 16:12:21 +01:00
Remi
265b0ec44d
Refactor env to add key word arguments from config yaml ( #223 )
...
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-05-30 13:45:22 +02:00
Alexander Soare
e3b9f1c19b
Add resume training ( #205 )
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-28 12:04:23 +01:00
Alexander Soare
b6c216b590
Add Automatic Mixed Precision option for training and evaluation. ( #199 )
2024-05-20 18:57:54 +01:00
Alexander Soare
2b270d085b
Disable online training ( #202 )
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-20 18:27:54 +01:00
Alexander Soare
68c1b13406
Make policies compatible with other/multiple image keys ( #149 )
2024-05-16 13:51:53 +01:00
Simon Alibert
f52f4f2cd2
Add copyrights ( #157 )
2024-05-15 12:13:09 +02:00
Alexander Soare
1249aee3ac
Enable logging all the information returned by the `forward` methods of policies ( #151 )
2024-05-10 07:45:32 +01:00
Simon Alibert
f5e76393eb
Release cleanup ( #132 )
...
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>
Co-authored-by: Cadene <re.cadene@gmail.com>
2024-05-06 03:03:14 +02:00
Alexander Soare
f3bba0270d
Remove EMA model from Diffusion Policy ( #134 )
2024-05-05 11:26:12 +01:00
Simon Alibert
c77633c38c
Add regression tests ( #119 )
...
- Add `tests/scripts/save_policy_to_safetensor.py` to generate test artifacts
- Add `test_backward_compatibility to test generated outputs from the policies against artifacts
2024-05-04 16:20:30 +02:00
Alexander Soare
bccee745c3
Refactor eval.py ( #127 )
2024-05-03 17:33:16 +01:00
Alexander Soare
d1855a202a
Refactor TD-MPC ( #103 )
...
Co-authored-by: Cadene <re.cadene@gmail.com>
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-05-01 16:40:04 +01:00
Alexander Soare
a4891095e4
Use PytorchModelHubMixin to save models as safetensors ( #125 )
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-01 16:17:18 +01:00
Alexander Soare
9d60dce6f3
Tidy up yaml configs ( #121 )
2024-04-30 16:08:59 +01:00
Simon Alibert
2765877f28
Speed up CI, add more checks ( #115 )
...
- Split pytest and end-to-end tests into separate jobs
- Add poetry check to ensure pyproject.toml and poetry.lock are in sync
- Add ruff format --diff to ensure style formatting is applied (fails if ruff would reformat anything)
2024-04-29 23:05:55 +02:00
Quentin Gallouédec
508bd92d03
Remove `update` method from the policy ( #99 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-29 12:27:58 +02:00
Simon Alibert
791506dfb8
Remove warnings ( #111 )
...
- Replace `use_pretrained_backbone` with `pretrained_backbone_weights`
- Bump diffusers' minimum version `0.26.3` -> `0.27.2`
- Add ignore flags in CI's pytest
- Change Box observation spaces in simulation environments
- Set `version_base="1.2"` in Hydra initializations
- Bump einops' minimum version `0.7.0` -> `0.8.0`
2024-04-29 00:31:33 +02:00
Alexander Soare
45f351c618
Make sure targets are normalized too ( #106 )
2024-04-26 11:18:39 +01:00
Remi
e760e4cd63
Move normalization to policy for act and diffusion ( #90 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-25 11:47:38 +02:00
Remi
c1bcf857c5
Fix online training ( #94 )
2024-04-23 18:54:55 +02:00
Remi
1030ea0070
Loads episode_data_index and stats during dataset __init__ ( #85 )
...
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-23 14:13:25 +02:00
Alexander Soare
e2168163cd
Quality of life patches for eval.py ( #86 )
2024-04-19 12:33:47 +01:00
Simon Alibert
7ad1909641
Tests cleaning & simplification ( #81 )
2024-04-18 14:47:42 +02:00
Remi
0928afd37d
Improve dataset examples ( #82 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-18 11:43:16 +02:00
Alexander Soare
bff4b673c9
Merge remote-tracking branch 'upstream/main' into unify_policy_api
2024-04-17 08:08:57 +01:00
Cadene
36d9e885ef
Address comments
2024-04-16 17:20:54 +00:00
Cadene
c7a8218620
typo
2024-04-16 17:19:39 +00:00
Cadene
67d79732f9
Add download_and_upload_dataset.py in script, update all datasets, update online training
2024-04-16 17:19:39 +00:00
Alexander Soare
23be5e1e7b
backup wip
2024-04-16 16:31:44 +01:00
Alexander Soare
9241b5e830
pass step as kwarg
2024-04-15 09:52:54 +01:00
Alexander Soare
6d0a45a97d
ready for review
2024-04-12 11:36:52 +01:00
Alexander Soare
94cc22da9e
Merge remote-tracking branch 'upstream/main' into refactor_dp
2024-04-11 17:52:10 +01:00
Alexander Soare
976a197f98
backup wip
2024-04-11 17:51:35 +01:00
Cadene
693f620df0
drop_last=False
2024-04-10 14:59:54 +00:00
Cadene
7c8eb7ff19
Merge remote-tracking branch 'origin/user/rcadene/2024_03_31_remove_torchrl' into user/rcadene/2024_03_31_remove_torchrl
2024-04-10 11:34:51 +00:00
Cadene
06573d7f67
online training works (loss goes down), remove repeat_action, eval_policy outputs episodes data, eval_policy uses max_episodes_rendered
2024-04-10 11:34:01 +00:00
Alexander Soare
86365adf9f
revision
2024-04-08 14:44:46 +01:00
Alexander Soare
e982c732f1
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-08 09:25:45 +01:00
Alexander Soare
8d2463f45b
backup wip
2024-04-05 18:46:30 +01:00
Alexander Soare
1e71196fe3
backup wip
2024-04-05 17:38:29 +01:00
Alexander Soare
4863e54ce9
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 12:00:31 +01:00
Cadene
ad3379a73a
fix memory leak due to itertools.cycle
2024-04-05 10:59:32 +00:00
Alexander Soare
9d77f5773d
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 11:41:11 +01:00
Cadene
5af00d0c1e
fix train.py, stats, eval.py (training is running)
2024-04-05 09:31:39 +00:00
Cadene
1cdfbc8b52
WIP
...
WIP
WIP train.py works, loss going down
WIP eval.py
Fix
WIP (eval running, TODO: verify results reproduced)
Eval works! (testing reproducibility)
WIP
pretrained model pusht reproduces same results as torchrl
pretrained model pusht reproduces same results as torchrl
Remove AbstractPolicy, Move all queues in select_action
WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00