amandip7
33362dbd17
Adding parameter dataloading_s to console logs and wandb for tracking… ( #243 )
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-06-04 17:02:05 +01:00
Ruijie
b0d954c6e1
Fix bug in normalize to avoid divide by zero ( #239 )
...
Co-authored-by: rj <rj@teleopstrio-razer.lan>
Co-authored-by: Remi <re.cadene@gmail.com>
2024-06-04 12:21:28 +02:00
Simon Alibert
bd3111f28b
Fix `visualize_dataset.py --help` ( #241 )
2024-06-03 16:35:16 +02:00
Alexander Soare
cf15cba5fc
Remove redundant slicing operation in Diffusion Policy ( #240 )
2024-06-03 13:04:24 +01:00
Remi
d585c73f9f
Add real-world support for ACT on Aloha/Aloha2 ( #228 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-31 15:31:02 +02:00
Radek Osmulski
504d2aaf48
add EpisodeAwareSampler ( #217 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-31 13:43:47 +01:00
Radek Osmulski
83f4f7f7e8
Add precision param to format_big_number ( #232 )
2024-05-31 10:19:01 +02:00
Alexander Soare
633115d861
Fix chaining in MultiLerobotDataset ( #233 )
2024-05-31 09:03:28 +01:00
Alexander Soare
57fb5fe8a6
Improve documentation on VAE encoder inputs ( #215 )
2024-05-30 19:16:44 +02:00
Alexander Soare
111cd58f8a
Add `MultiLerobotDataset` for training with multiple `LeRobotDataset`s ( #229 )
2024-05-30 16:12:21 +01:00
Remi
265b0ec44d
Refactor env to add key word arguments from config yaml ( #223 )
...
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-05-30 13:45:22 +02:00
Remi
2c2e4e14ed
Add `aloha_dora_format.py` ( #201 )
...
Co-authored-by: Thomas Wolf <thomwolf@users.noreply.github.com>
2024-05-30 11:26:39 +02:00
Alexander Soare
3d625ae6d3
Handle `crop_shape=None` in Diffusion Policy ( #219 )
2024-05-28 18:27:33 +01:00
Alexander Soare
e3b9f1c19b
Add resume training ( #205 )
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-28 12:04:23 +01:00
Radek Osmulski
3b86050ab0
throw an error if config.do_maks_loss and action_is_pad not provided in batch ( #213 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-27 09:06:26 +01:00
Alexander Soare
b6c216b590
Add Automatic Mixed Precision option for training and evaluation. ( #199 )
2024-05-20 18:57:54 +01:00
Alexander Soare
2b270d085b
Disable online training ( #202 )
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-20 18:27:54 +01:00
Remi
c4da689171
Hot fix to compute validation loss example test ( #200 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-20 18:30:11 +02:00
Radek Osmulski
9b62c25f6c
Adds split_by_episodes to LeRobotDataset ( #158 )
2024-05-20 14:04:04 +02:00
Remi
01eae09ba6
Fix aloha real-world datasets ( #175 )
2024-05-20 13:48:09 +02:00
Alexander Soare
096149b118
Disable wandb by default ( #195 )
2024-05-17 18:01:39 +01:00
Alexander Soare
5ec0af62c6
Explain why n_encoder_layers=1 ( #193 )
2024-05-17 15:05:40 +01:00
Alexander Soare
625f0557ef
Act temporal ensembling ( #186 )
2024-05-17 14:57:49 +01:00
Alexander Soare
4d7d41cdee
Fix act action queue ( #185 )
2024-05-16 15:43:25 +01:00
Akshay Kashyap
c9069df9f1
Port SpatialSoftmax and remove Robomimic dependency ( #182 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-16 15:34:10 +01:00
Alexander Soare
68c1b13406
Make policies compatible with other/multiple image keys ( #149 )
2024-05-16 13:51:53 +01:00
Simon Alibert
f52f4f2cd2
Add copyrights ( #157 )
2024-05-15 12:13:09 +02:00
AshisGhosh
fc5cf3d84a
Fixes issue #152 - error with creating wandb artifact ( #172 )
...
Co-authored-by: Ashis Ghosh <ahsisghosh@live.com>
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-05-12 08:13:12 +02:00
Remi
ced3de4c94
Fix hanging in visualize_dataset.py when num_workers > 0 ( #165 )
2024-05-11 19:28:22 +03:00
Alexander Soare
1249aee3ac
Enable logging all the information returned by the `forward` methods of policies ( #151 )
2024-05-10 07:45:32 +01:00
Alexander Soare
b187942db4
Add context manager for seeding ( #164 )
2024-05-09 17:58:39 +01:00
Alexander Soare
473345fdf6
Fix stats override in ACT config ( #161 )
2024-05-09 15:16:47 +01:00
Alexander Soare
e89521dfa0
Enable tests for TD-MPC ( #160 )
2024-05-09 13:42:12 +01:00
Ikko Eltociear Ashimine
0ea7a8b2a3
refactor: update configuration_tdmpc.py ( #153 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-08 18:13:51 +01:00
Akshay Kashyap
460df2ccea
Support for DDIMScheduler in Diffusion Policy ( #146 )
2024-05-08 18:05:16 +01:00
Alexander Soare
f5de57b385
Fix SpatialSoftmax input shape ( #150 )
2024-05-08 14:57:29 +01:00
Alexander Soare
47de07658c
Override pretrained model config ( #147 )
2024-05-08 12:56:21 +01:00
Alexander Soare
26d9a070d8
Add policy and dataset tags to WandB logging ( #144 )
2024-05-07 09:09:21 +01:00
Alexander Soare
a8e245fb31
Remove loss masking from diffusion policy ( #135 )
2024-05-06 07:27:01 +01:00
Simon Alibert
f5e76393eb
Release cleanup ( #132 )
...
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>
Co-authored-by: Cadene <re.cadene@gmail.com>
2024-05-06 03:03:14 +02:00
Remi
6eaffbef1d
Add 25 real world datasets (static + mobile aloha) ( #133 )
2024-05-05 19:13:11 +02:00
Alexander Soare
6e6feae09e
Remove additional frame from eval render ( #137 )
2024-05-05 13:49:53 +02:00
Alexander Soare
f6fe162432
Rename Diffusion Policy model repository ( #136 )
2024-05-05 12:06:58 +01:00
Alexander Soare
f3bba0270d
Remove EMA model from Diffusion Policy ( #134 )
2024-05-05 11:26:12 +01:00
Alexander Soare
d747195c57
Fix loops in eval.py ( #128 )
2024-05-04 15:21:10 +01:00
Simon Alibert
c77633c38c
Add regression tests ( #119 )
...
- Add `tests/scripts/save_policy_to_safetensor.py` to generate test artifacts
- Add `test_backward_compatibility to test generated outputs from the policies against artifacts
2024-05-04 16:20:30 +02:00
Remi
19812ca470
Add dataset visualization with rerun.io ( #131 )
...
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-05-04 16:07:14 +02:00
Simon Alibert
c015252e20
Remove batch length restrictions in select_action ( #123 )
2024-05-04 15:33:55 +02:00
Alexander Soare
bccee745c3
Refactor eval.py ( #127 )
2024-05-03 17:33:16 +01:00
Remi
b2cda12f87
Add video decoding to LeRobotDataset ( #92 )
2024-05-03 00:50:19 +02:00
Alexander Soare
c1668924ab
Fix missing `policy.to(device)` in policy factory ( #126 )
2024-05-01 17:26:58 +01:00
Alexander Soare
d1855a202a
Refactor TD-MPC ( #103 )
...
Co-authored-by: Cadene <re.cadene@gmail.com>
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-05-01 16:40:04 +01:00
Alexander Soare
a4891095e4
Use PytorchModelHubMixin to save models as safetensors ( #125 )
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-01 16:17:18 +01:00
Alexander Soare
01d5490d44
Follow `transformers` single file naming conventions ( #124 )
2024-05-01 13:09:42 +01:00
Alexander Soare
986583dc5c
Nest ACT model in ACT Policy ( #122 )
2024-04-30 17:17:40 +01:00
Alexander Soare
9d60dce6f3
Tidy up yaml configs ( #121 )
2024-04-30 16:08:59 +01:00
Remi
e4e739f4f8
Refactor push_dataset_to_hub ( #118 )
2024-04-30 14:25:41 +02:00
Simon Alibert
2765877f28
Speed up CI, add more checks ( #115 )
...
- Split pytest and end-to-end tests into separate jobs
- Add poetry check to ensure pyproject.toml and poetry.lock are in sync
- Add ruff format --diff to ensure style formatting is applied (fails if ruff would reformat anything)
2024-04-29 23:05:55 +02:00
Adil Zouitine
1ec5f77f7c
Remove mask check key in XarmProcessor ( #117 )
2024-04-29 22:00:37 +02:00
Quentin Gallouédec
508bd92d03
Remove `update` method from the policy ( #99 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-29 12:27:58 +02:00
Adil Zouitine
5b4fd8891d
Update UmiProcessor default fps to 10 ( #116 )
2024-04-29 12:11:34 +02:00
Alexander Soare
ccffa9e406
Fix policy defaults ( #113 )
2024-04-29 08:26:59 +01:00
Simon Alibert
791506dfb8
Remove warnings ( #111 )
...
- Replace `use_pretrained_backbone` with `pretrained_backbone_weights`
- Bump diffusers' minimum version `0.26.3` -> `0.27.2`
- Add ignore flags in CI's pytest
- Change Box observation spaces in simulation environments
- Set `version_base="1.2"` in Hydra initializations
- Bump einops' minimum version `0.7.0` -> `0.8.0`
2024-04-29 00:31:33 +02:00
Adil Zouitine
55dc9f7f51
Refactor the download and publication of the datasets and convert it into CLI script ( #95 )
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-04-29 00:08:17 +02:00
Adil Zouitine
81e490d46f
Add UMI-gripper dataset ( #83 )
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-04-28 18:41:07 +02:00
Alexander Soare
45f351c618
Make sure targets are normalized too ( #106 )
2024-04-26 11:18:39 +01:00
Remi
659c69a1c0
Refactor datasets into LeRobotDataset ( #91 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-25 12:23:12 +02:00
Remi
e760e4cd63
Move normalization to policy for act and diffusion ( #90 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-25 11:47:38 +02:00
Remi
c1bcf857c5
Fix online training ( #94 )
2024-04-23 18:54:55 +02:00
Remi
1030ea0070
Loads episode_data_index and stats during dataset __init__ ( #85 )
...
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-23 14:13:25 +02:00
Alexander Soare
e2168163cd
Quality of life patches for eval.py ( #86 )
2024-04-19 12:33:47 +01:00
Alexander Soare
8d980940a2
Fix tolerance for delta_timestamps ( #84 )
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-04-18 18:48:22 +01:00
Simon Alibert
7ad1909641
Tests cleaning & simplification ( #81 )
2024-04-18 14:47:42 +02:00
Remi
0928afd37d
Improve dataset examples ( #82 )
...
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-04-18 11:43:16 +02:00
Alexander Soare
dd9c6eed15
Add temporary patch in TD-MPC
2024-04-17 16:27:57 +01:00
Alexander Soare
2298ddf226
wip
2024-04-17 16:21:37 +01:00
Alexander Soare
63e5ec6483
revert some formatting changes
2024-04-17 11:40:49 +01:00
Alexander Soare
c50a13ab31
draft
2024-04-17 10:50:54 +01:00
Alexander Soare
296bbfe1ad
Merge branch 'fix_stats_saving' into unify_policy_api
2024-04-17 09:08:04 +01:00
Alexander Soare
c9454333d8
revision
2024-04-17 09:02:35 +01:00
Alexander Soare
1331d3b4e4
fix issue with saving freshly computed stats
2024-04-17 08:49:28 +01:00
Alexander Soare
bff4b673c9
Merge remote-tracking branch 'upstream/main' into unify_policy_api
2024-04-17 08:08:57 +01:00
Remi
3f1c322d56
Merge pull request #73 from huggingface/user/rcadene/2024_04_14_hf_datasets
...
Use Hugging Face datasets.Dataset
2024-04-16 21:54:37 +02:00
Simon Alibert
fbc31d906c
Merge pull request #74 from huggingface/user/aliberts/2024_04_15_setup_contributions
...
Setup contributions
2024-04-16 19:51:13 +02:00
Cadene
91badebdfc
fix tests
2024-04-16 17:29:31 +00:00
Cadene
4327e43f19
fix merge thingy
2024-04-16 17:24:25 +00:00
Cadene
36d9e885ef
Address comments
2024-04-16 17:20:54 +00:00
Cadene
b241ea46dd
move download_and_upload_dataset.py to root_dir
2024-04-16 17:20:53 +00:00
Cadene
e09d25267e
fix online training
2024-04-16 17:20:53 +00:00
Cadene
4a3eac4743
fix unit tests, stats was missing, visualize_dataset was broken
2024-04-16 17:20:53 +00:00
Cadene
0980fff6cc
HF datasets works
2024-04-16 17:19:40 +00:00
Cadene
5edd9a89a0
Move stats_dataset init into else statement -> faster init
2024-04-16 17:19:39 +00:00
Cadene
c7a8218620
typo
2024-04-16 17:19:39 +00:00
Cadene
67d79732f9
Add download_and_upload_dataset.py in script, update all datasets, update online training
2024-04-16 17:19:39 +00:00
Cadene
c6aca7fe44
For Pusht: use hf datasets to train, rename load_data_with_delta_timestamps -> load_previous_and_future_frames
2024-04-16 17:19:06 +00:00
Alexander Soare
cb3978b5f3
backup wip
2024-04-16 18:12:39 +01:00
Alexander Soare
0eb899de73
Merge remote-tracking branch 'upstream/main' into unify_policy_api
2024-04-16 17:30:41 +01:00
Alexander Soare
a9496fde39
revision 1
2024-04-16 17:15:51 +01:00
Alexander Soare
23be5e1e7b
backup wip
2024-04-16 16:31:44 +01:00
Alexander Soare
9c2f10bd04
ready for review
2024-04-16 13:43:58 +01:00
Alexander Soare
03b08eb74e
backup wip
2024-04-16 12:51:32 +01:00
Simon Alibert
376d75f8d3
Add env info
2024-04-16 10:35:43 +02:00
Alexander Soare
5608e659e6
backup wip
2024-04-15 19:06:44 +01:00
Alexander Soare
14f3ffb412
Merge remote-tracking branch 'upstream/main' into refactor_dp
2024-04-15 17:08:28 +01:00
Alexander Soare
30023535f9
revision 1
2024-04-15 10:56:43 +01:00
Alexander Soare
40d417ef60
Make sure to make remove all traces of omegaconf from policy config
2024-04-15 09:59:18 +01:00
Alexander Soare
9241b5e830
pass step as kwarg
2024-04-15 09:52:54 +01:00
Alexander Soare
ef4bd9e25c
Use dataclass config for ACT
2024-04-15 09:39:23 +01:00
Alexander Soare
34f00753eb
remove policy.py
2024-04-12 17:13:25 +01:00
Alexander Soare
55e484124a
draft pr
2024-04-12 17:03:59 +01:00
Alexander Soare
6d0a45a97d
ready for review
2024-04-12 11:36:52 +01:00
Alexander Soare
5666ec3ec7
backup wip
2024-04-11 18:33:54 +01:00
Alexander Soare
94cc22da9e
Merge remote-tracking branch 'upstream/main' into refactor_dp
2024-04-11 17:52:10 +01:00
Alexander Soare
976a197f98
backup wip
2024-04-11 17:51:35 +01:00
Cadene
c1a618e567
fix pusht images type from float32 to uint8, update gym-pusht dependencies
2024-04-11 14:29:16 +00:00
Cadene
8e5b4365ac
fix
2024-04-11 13:57:22 +00:00
Cadene
657b27cc8f
fix load_data_with_delta_timestamps and add tests
2024-04-11 13:00:09 +00:00
Remi
9229226522
Update lerobot/common/envs/utils.py
2024-04-11 10:35:17 +02:00
Cadene
949f4d1a5b
remove comment
2024-04-10 17:21:36 +00:00
Cadene
3914831585
remove __name__ outside script
2024-04-10 17:16:44 +00:00
Cadene
f8c5a2eb10
remove comment
2024-04-10 17:14:02 +00:00
Cadene
9874652c2f
enable test_compute_stats
...
enable test_compute_stats
2024-04-10 17:12:54 +00:00
Remi
4c3d8b061e
Update lerobot/scripts/eval.py
...
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-04-10 18:07:27 +02:00
Cadene
0f0113a7a6
print_cuda_memory_usage docstring
2024-04-10 16:03:39 +00:00
Cadene
e8622154f8
Replace import gym_pusht in pusht dataset by dynamic import
2024-04-10 15:56:18 +00:00
Cadene
8866b22db1
remove policy is None eval end-to-end tests
2024-04-10 15:09:04 +00:00
Cadene
693f620df0
drop_last=False
2024-04-10 14:59:54 +00:00
Cadene
48ec479660
fix end-to-end aloha
2024-04-10 14:26:30 +00:00
Cadene
a18bcb39a7
cfg.env.fps
2024-04-10 14:02:11 +00:00
Cadene
c08003278e
test_examples are passing
2024-04-10 13:45:45 +00:00
Cadene
6082a7bc73
Enable test_available.py
2024-04-10 13:06:48 +00:00
Cadene
7c8eb7ff19
Merge remote-tracking branch 'origin/user/rcadene/2024_03_31_remove_torchrl' into user/rcadene/2024_03_31_remove_torchrl
2024-04-10 11:34:51 +00:00
Cadene
06573d7f67
online training works (loss goes down), remove repeat_action, eval_policy outputs episodes data, eval_policy uses max_episodes_rendered
2024-04-10 11:34:01 +00:00
Alexander Soare
575891e8ac
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
2024-04-09 15:19:29 +01:00
Simon Alibert
d9019d9e7e
disable env_checker in factory
2024-04-09 10:24:28 +02:00
Alexander Soare
e6c6c2367f
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
2024-04-09 08:36:28 +01:00
Cadene
253e495df2
remove render(mode=visualization)
2024-04-09 03:46:05 +00:00
Cadene
6902e01db0
tests are passing for aloha/act policies, removes abstract policy
2024-04-09 03:28:56 +00:00
Cadene
73dfa3c8e3
tests for tdmpc and diffusion policy are passing
2024-04-09 02:50:32 +00:00
Alexander Soare
50e4c8050c
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
2024-04-08 17:13:11 +01:00
Cadene
1149894e1d
rename handle -> task
2024-04-08 14:54:52 +00:00
Alexander Soare
9c96349926
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
2024-04-08 15:44:00 +01:00
Simon Alibert
3f6dfa4916
Add gym-aloha, rename simxarm -> xarm, refactor
2024-04-08 16:24:11 +02:00
Simon Alibert
5dff6d8339
remove aloha
2024-04-08 16:22:13 +02:00
Cadene
70aaf1c4cb
test_datasets.py are passing!
2024-04-08 14:16:57 +00:00
Alexander Soare
91e0e4e175
rever change
2024-04-08 15:05:40 +01:00
Alexander Soare
0b4c42f4ff
typos
2024-04-08 14:59:37 +01:00
Alexander Soare
62b18a7607
Add type hints
2024-04-08 14:51:45 +01:00
Alexander Soare
86365adf9f
revision
2024-04-08 14:44:46 +01:00
Alexander Soare
863f28ffd8
ready for review
2024-04-08 13:10:19 +01:00
Alexander Soare
1bab4a1dd5
Eval reproduction works with gym_aloha
2024-04-08 10:23:26 +01:00
Alexander Soare
e982c732f1
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-08 09:25:45 +01:00
Cadene
e1ac5dc62f
fix aloha pixels env test
2024-04-07 17:20:54 +00:00
Cadene
4371a5570d
Remove latency, tdmpc policy passes tests (TODO: make it work with online RL)
2024-04-07 16:01:22 +00:00
Cadene
44656d2706
test_envs are passing
2024-04-05 23:27:12 +00:00
Alexander Soare
8d2463f45b
backup wip
2024-04-05 18:46:30 +01:00
Alexander Soare
ecc7dd3b17
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 18:35:13 +01:00
Cadene
5eff40b3d6
rename task, sim_transfer -> transfer
2024-04-05 17:18:37 +00:00
Cadene
a2d3588fca
wrap dm_control aloha into gymnasium (TODO: properly seeding the env)
2024-04-05 17:17:31 +00:00
Cadene
29032fbcd3
wrap dm_control aloha into gymnasium (TODO: properly seeding the env)
2024-04-05 17:17:14 +00:00
Alexander Soare
ab2286025b
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 18:06:00 +01:00
Alexander Soare
1e71196fe3
backup wip
2024-04-05 17:38:29 +01:00
Cadene
26602269cd
test_envs.py are passing, remove simxarm and pusht directories
2024-04-05 16:21:07 +00:00
Cadene
f56b1a0e16
WIP tdmpc
2024-04-05 13:40:31 +00:00
Simon Alibert
ab3cd3a7ba
(WIP) Add gym-xarm
2024-04-05 15:35:20 +02:00
Alexander Soare
0b8d27ff2c
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 12:48:11 +01:00
Cadene
c17dffe944
policies/utils.py
2024-04-05 11:47:15 +00:00
Alexander Soare
8ba88ba250
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 12:34:14 +01:00
Cadene
a420714ee4
fix: action_is_pad was missing in compute_loss
2024-04-05 11:33:39 +00:00
Alexander Soare
4863e54ce9
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 12:00:31 +01:00
Cadene
ad3379a73a
fix memory leak due to itertools.cycle
2024-04-05 10:59:32 +00:00
Alexander Soare
9d77f5773d
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 11:41:11 +01:00
Alexander Soare
edb125b351
backup wip
2024-04-05 11:03:28 +01:00
Cadene
5af00d0c1e
fix train.py, stats, eval.py (training is running)
2024-04-05 09:31:39 +00:00
Alexander Soare
3a4dfa82fe
backup wip
2024-04-04 18:34:41 +01:00
Cadene
c93ce35d8c
WIP stats (TODO: run tests on stats + cmpute them)
2024-04-04 16:36:03 +00:00
Cadene
1cdfbc8b52
WIP
...
WIP
WIP train.py works, loss going down
WIP eval.py
Fix
WIP (eval running, TODO: verify results reproduced)
Eval works! (testing reproducibility)
WIP
pretrained model pusht reproduces same results as torchrl
pretrained model pusht reproduces same results as torchrl
Remove AbstractPolicy, Move all queues in select_action
WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare
278336a39a
backup wip
2024-04-03 19:23:22 +01:00
Alexander Soare
110ac5ffa1
backup wip
2024-04-03 14:21:07 +01:00
Alexander Soare
c7d70a8db9
Merge remote-tracking branch 'upstream/main' into refactor_act
2024-04-03 10:08:12 +01:00
Alexander Soare
caf4ffcf65
add TODO
2024-04-03 09:56:46 +01:00
Alexander Soare
c50a62dd6d
clarifying math
2024-04-03 09:47:38 +01:00
Alexander Soare
e9eb262293
numerically sound mean computation
2024-04-03 09:44:20 +01:00
Alexander Soare
65ef8c30d0
backup wip
2024-04-02 19:13:49 +01:00
Alexander Soare
2b928eedd4
backup wip
2024-04-02 19:11:53 +01:00
Alexander Soare
a6edb85da4
Remove random sampling
2024-04-02 16:52:38 +01:00
Alexander Soare
95293d459d
fix stats computation
2024-04-02 16:40:33 +01:00
Alexander Soare
f1148b8c2d
Merge remote-tracking branch 'upstream/main' into finish_examples
2024-04-01 11:31:31 +01:00
Simon Alibert
6bddcb647e
Add test_aloha env test
2024-03-28 10:35:11 +01:00
Alexander Soare
b7c9c33072
revision
2024-03-27 18:33:48 +00:00
Alexander Soare
120f0aef5c
Merge remote-tracking branch 'upstream/main' into finish_examples
2024-03-27 17:52:36 +00:00
Alexander Soare
6cd671040f
fix revision
2024-03-27 13:22:14 +00:00
Alexander Soare
b6353964ba
fix bug: use provided revision instead of hardcoded one
2024-03-27 13:08:47 +00:00
Alexander Soare
64c8851c40
Merge branch 'tidy_diffusion_config' into update_diffusion_model
2024-03-27 13:06:08 +00:00
Alexander Soare
dc745e3037
Remove unused part of diffusion policy config
2024-03-27 13:05:13 +00:00
Alexander Soare
68d02c80cf
Remove b/c workaround
2024-03-27 12:03:19 +00:00
Alexander Soare
011f2d27fe
fix tests
2024-03-26 16:40:54 +00:00
Alexander Soare
1ed0110900
finish examples 2 and 3
2024-03-26 16:13:40 +00:00
Cadene
9ced0cf1fb
unskip
2024-03-26 10:45:31 +00:00
Cadene
5a46b8a2a9
fix tests
2024-03-26 10:24:46 +00:00