Commit Graph

547 Commits

Author SHA1 Message Date
Radek Osmulski ee6d4c31d9 Make using sampler in train.py more explicit 2024-05-30 07:43:29 +10:00
Radek Osmulski aca5fd2f37 DRY up docstring 2024-05-30 07:36:58 +10:00
Radek Osmulski 5d7a4e316b Minor change to improve test clarity 2024-05-30 07:36:58 +10:00
Radek Osmulski 621f69d98f Implement 2nd round of review changes 2024-05-30 07:36:58 +10:00
Radek Osmulski 566a8aa98e Implement review feedback 2024-05-30 07:36:58 +10:00
Radek Osmulski e7abcc2ffd add EpisodeAwareSampler
EpisodeAwareSampler allows the dropping of first n or last n frames
from each episode.

Additionally, you can select which episodes to run on by passig in
their ids (complements functionality of the hf dataset slice syntax
to give extra flexibility to the user).

* modify train.py and diffusion.yaml to leverage new functionality
  and train on pusht with last 7 steps per episode dropped
* add unit tests
2024-05-30 07:36:58 +10:00
Simon Alibert 13310681b1
Enable cuda for end-to-end tests (#222) 2024-05-29 23:02:23 +02:00
Alexander Soare 3d625ae6d3
Handle `crop_shape=None` in Diffusion Policy (#219) 2024-05-28 18:27:33 +01:00
Alexander Soare e3b9f1c19b
Add resume training (#205)
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-28 12:04:23 +01:00
Simon Alibert 7ec76ee235
Fix nightly builds (#216) 2024-05-28 10:43:34 +02:00
Radek Osmulski 3b86050ab0
throw an error if config.do_maks_loss and action_is_pad not provided in batch (#213)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-27 09:06:26 +01:00
Alexander Soare 6d39b73399
Adds a tutorial section on how to use arbitrary configuration files (#206) 2024-05-24 12:39:11 +01:00
Simon Alibert aca424a481
Add dev docker image (#189)
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-23 14:39:14 +02:00
Simon Alibert 35c1ce7a66
Fix install issues (#191) 2024-05-23 14:25:18 +02:00
Alexander Soare e67da1d7a6
Add tutorials for using the training script and (#196)
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-21 16:47:49 +01:00
Alexander Soare b6c216b590
Add Automatic Mixed Precision option for training and evaluation. (#199) 2024-05-20 18:57:54 +01:00
Alexander Soare 2b270d085b
Disable online training (#202)
Co-authored-by: Remi <re.cadene@gmail.com>
2024-05-20 18:27:54 +01:00
Remi c4da689171
Hot fix to compute validation loss example test (#200)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-20 18:30:11 +02:00
Radek Osmulski 9b62c25f6c
Adds split_by_episodes to LeRobotDataset (#158) 2024-05-20 14:04:04 +02:00
Remi 01eae09ba6
Fix aloha real-world datasets (#175) 2024-05-20 13:48:09 +02:00
Alexander Soare 19dfb9144a
Update the README to reflect WandB disabled by default (#198) 2024-05-20 09:02:24 +01:00
Alexander Soare 096149b118
Disable wandb by default (#195) 2024-05-17 18:01:39 +01:00
Alexander Soare 5ec0af62c6
Explain why n_encoder_layers=1 (#193) 2024-05-17 15:05:40 +01:00
Alexander Soare 625f0557ef
Act temporal ensembling (#186) 2024-05-17 14:57:49 +01:00
Alexander Soare 4d7d41cdee
Fix act action queue (#185) 2024-05-16 15:43:25 +01:00
Akshay Kashyap c9069df9f1
Port SpatialSoftmax and remove Robomimic dependency (#182)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-16 15:34:10 +01:00
Alexander Soare 68c1b13406
Make policies compatible with other/multiple image keys (#149) 2024-05-16 13:51:53 +01:00
Simon Alibert f52f4f2cd2
Add copyrights (#157) 2024-05-15 12:13:09 +02:00
Simon Alibert 89c6be84ca
Limit datasets major update (#176)
Co-authored-by: Quentin Lhoest <42851186+lhoestq@users.noreply.github.com>
Co-authored-by: Quentin Lhoest <lhoest.q@gmail.com>
2024-05-12 08:15:07 +02:00
AshisGhosh fc5cf3d84a
Fixes issue #152 - error with creating wandb artifact (#172)
Co-authored-by: Ashis Ghosh <ahsisghosh@live.com>
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-05-12 08:13:12 +02:00
Simon Alibert 29a196c5dd
Fix #173 - Require gym-pusht to be installed for test_examples_3_and_2 (#174) 2024-05-12 08:08:59 +02:00
Remi ced3de4c94
Fix hanging in visualize_dataset.py when num_workers > 0 (#165) 2024-05-11 19:28:22 +03:00
Vincent Moens 7b47ab211b
Remove torchrl acknowledgement (#177) 2024-05-11 14:45:51 +03:00
Alexander Soare 1249aee3ac
Enable logging all the information returned by the `forward` methods of policies (#151) 2024-05-10 07:45:32 +01:00
Alexander Soare b187942db4
Add context manager for seeding (#164) 2024-05-09 17:58:39 +01:00
Alexander Soare 473345fdf6
Fix stats override in ACT config (#161) 2024-05-09 15:16:47 +01:00
Alexander Soare e89521dfa0
Enable tests for TD-MPC (#160) 2024-05-09 13:42:12 +01:00
Simon Alibert 7bb5b15f4c
Remove dependencies upper bounds constraints (#145) 2024-05-08 17:23:10 +00:00
Simon Alibert df914aa76c
Update dev docker build (#148) 2024-05-08 17:21:58 +00:00
Ikko Eltociear Ashimine 0ea7a8b2a3
refactor: update configuration_tdmpc.py (#153)
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
2024-05-08 18:13:51 +01:00
Akshay Kashyap 460df2ccea
Support for DDIMScheduler in Diffusion Policy (#146) 2024-05-08 18:05:16 +01:00
Alexander Soare f5de57b385
Fix SpatialSoftmax input shape (#150) 2024-05-08 14:57:29 +01:00
Alexander Soare 47de07658c
Override pretrained model config (#147) 2024-05-08 12:56:21 +01:00
Alexander Soare 26d9a070d8
Add policy and dataset tags to WandB logging (#144) 2024-05-07 09:09:21 +01:00
Simon Alibert eb6bfe01b2
Relax version contraints & poetry lock (#139) 2024-05-06 17:38:29 +02:00
Alexander Soare a8e245fb31
Remove loss masking from diffusion policy (#135) 2024-05-06 07:27:01 +01:00
Simon Alibert f5e76393eb
Release cleanup (#132)
Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>
Co-authored-by: Cadene <re.cadene@gmail.com>
2024-05-06 03:03:14 +02:00
Remi 6eaffbef1d
Add 25 real world datasets (static + mobile aloha) (#133) 2024-05-05 19:13:11 +02:00
Alexander Soare 6e6feae09e
Remove additional frame from eval render (#137) 2024-05-05 13:49:53 +02:00
Alexander Soare f6fe162432
Rename Diffusion Policy model repository (#136) 2024-05-05 12:06:58 +01:00