Remi
9229226522
Update lerobot/common/envs/utils.py
2024-04-11 10:35:17 +02:00
Cadene
949f4d1a5b
remove comment
2024-04-10 17:21:36 +00:00
Cadene
3914831585
remove __name__ outside script
2024-04-10 17:16:44 +00:00
Cadene
f8c5a2eb10
remove comment
2024-04-10 17:14:02 +00:00
Cadene
9874652c2f
enable test_compute_stats
...
enable test_compute_stats
2024-04-10 17:12:54 +00:00
Remi
4c3d8b061e
Update lerobot/scripts/eval.py
...
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
2024-04-10 18:07:27 +02:00
Cadene
0f0113a7a6
print_cuda_memory_usage docstring
2024-04-10 16:03:39 +00:00
Cadene
e8622154f8
Replace import gym_pusht in pusht dataset by dynamic import
2024-04-10 15:56:18 +00:00
Cadene
8866b22db1
remove policy is None eval end-to-end tests
2024-04-10 15:09:04 +00:00
Cadene
693f620df0
drop_last=False
2024-04-10 14:59:54 +00:00
Cadene
48ec479660
fix end-to-end aloha
2024-04-10 14:26:30 +00:00
Cadene
a18bcb39a7
cfg.env.fps
2024-04-10 14:02:11 +00:00
Cadene
c08003278e
test_examples are passing
2024-04-10 13:45:45 +00:00
Cadene
6082a7bc73
Enable test_available.py
2024-04-10 13:06:48 +00:00
Cadene
7c8eb7ff19
Merge remote-tracking branch 'origin/user/rcadene/2024_03_31_remove_torchrl' into user/rcadene/2024_03_31_remove_torchrl
2024-04-10 11:34:51 +00:00
Cadene
06573d7f67
online training works (loss goes down), remove repeat_action, eval_policy outputs episodes data, eval_policy uses max_episodes_rendered
2024-04-10 11:34:01 +00:00
Alexander Soare
575891e8ac
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
2024-04-09 15:19:29 +01:00
Simon Alibert
d9019d9e7e
disable env_checker in factory
2024-04-09 10:24:28 +02:00
Alexander Soare
e6c6c2367f
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
2024-04-09 08:36:28 +01:00
Cadene
253e495df2
remove render(mode=visualization)
2024-04-09 03:46:05 +00:00
Cadene
6902e01db0
tests are passing for aloha/act policies, removes abstract policy
2024-04-09 03:28:56 +00:00
Cadene
73dfa3c8e3
tests for tdmpc and diffusion policy are passing
2024-04-09 02:50:32 +00:00
Alexander Soare
50e4c8050c
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
2024-04-08 17:13:11 +01:00
Cadene
1149894e1d
rename handle -> task
2024-04-08 14:54:52 +00:00
Alexander Soare
9c96349926
Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act
2024-04-08 15:44:00 +01:00
Simon Alibert
3f6dfa4916
Add gym-aloha, rename simxarm -> xarm, refactor
2024-04-08 16:24:11 +02:00
Simon Alibert
5dff6d8339
remove aloha
2024-04-08 16:22:13 +02:00
Cadene
70aaf1c4cb
test_datasets.py are passing!
2024-04-08 14:16:57 +00:00
Alexander Soare
91e0e4e175
rever change
2024-04-08 15:05:40 +01:00
Alexander Soare
0b4c42f4ff
typos
2024-04-08 14:59:37 +01:00
Alexander Soare
62b18a7607
Add type hints
2024-04-08 14:51:45 +01:00
Alexander Soare
86365adf9f
revision
2024-04-08 14:44:46 +01:00
Alexander Soare
863f28ffd8
ready for review
2024-04-08 13:10:19 +01:00
Alexander Soare
1bab4a1dd5
Eval reproduction works with gym_aloha
2024-04-08 10:23:26 +01:00
Alexander Soare
e982c732f1
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-08 09:25:45 +01:00
Cadene
e1ac5dc62f
fix aloha pixels env test
2024-04-07 17:20:54 +00:00
Cadene
4371a5570d
Remove latency, tdmpc policy passes tests (TODO: make it work with online RL)
2024-04-07 16:01:22 +00:00
Cadene
44656d2706
test_envs are passing
2024-04-05 23:27:12 +00:00
Alexander Soare
8d2463f45b
backup wip
2024-04-05 18:46:30 +01:00
Alexander Soare
ecc7dd3b17
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 18:35:13 +01:00
Cadene
5eff40b3d6
rename task, sim_transfer -> transfer
2024-04-05 17:18:37 +00:00
Cadene
a2d3588fca
wrap dm_control aloha into gymnasium (TODO: properly seeding the env)
2024-04-05 17:17:31 +00:00
Cadene
29032fbcd3
wrap dm_control aloha into gymnasium (TODO: properly seeding the env)
2024-04-05 17:17:14 +00:00
Alexander Soare
ab2286025b
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 18:06:00 +01:00
Alexander Soare
1e71196fe3
backup wip
2024-04-05 17:38:29 +01:00
Cadene
26602269cd
test_envs.py are passing, remove simxarm and pusht directories
2024-04-05 16:21:07 +00:00
Cadene
f56b1a0e16
WIP tdmpc
2024-04-05 13:40:31 +00:00
Simon Alibert
ab3cd3a7ba
(WIP) Add gym-xarm
2024-04-05 15:35:20 +02:00
Alexander Soare
0b8d27ff2c
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 12:48:11 +01:00
Cadene
c17dffe944
policies/utils.py
2024-04-05 11:47:15 +00:00
Alexander Soare
8ba88ba250
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 12:34:14 +01:00
Cadene
a420714ee4
fix: action_is_pad was missing in compute_loss
2024-04-05 11:33:39 +00:00
Alexander Soare
4863e54ce9
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 12:00:31 +01:00
Cadene
ad3379a73a
fix memory leak due to itertools.cycle
2024-04-05 10:59:32 +00:00
Alexander Soare
9d77f5773d
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 11:41:11 +01:00
Alexander Soare
edb125b351
backup wip
2024-04-05 11:03:28 +01:00
Cadene
5af00d0c1e
fix train.py, stats, eval.py (training is running)
2024-04-05 09:31:39 +00:00
Alexander Soare
3a4dfa82fe
backup wip
2024-04-04 18:34:41 +01:00
Cadene
c93ce35d8c
WIP stats (TODO: run tests on stats + cmpute them)
2024-04-04 16:36:03 +00:00
Cadene
1cdfbc8b52
WIP
...
WIP
WIP train.py works, loss going down
WIP eval.py
Fix
WIP (eval running, TODO: verify results reproduced)
Eval works! (testing reproducibility)
WIP
pretrained model pusht reproduces same results as torchrl
pretrained model pusht reproduces same results as torchrl
Remove AbstractPolicy, Move all queues in select_action
WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare
278336a39a
backup wip
2024-04-03 19:23:22 +01:00
Alexander Soare
110ac5ffa1
backup wip
2024-04-03 14:21:07 +01:00
Alexander Soare
c7d70a8db9
Merge remote-tracking branch 'upstream/main' into refactor_act
2024-04-03 10:08:12 +01:00
Alexander Soare
caf4ffcf65
add TODO
2024-04-03 09:56:46 +01:00
Alexander Soare
c50a62dd6d
clarifying math
2024-04-03 09:47:38 +01:00
Alexander Soare
e9eb262293
numerically sound mean computation
2024-04-03 09:44:20 +01:00
Alexander Soare
65ef8c30d0
backup wip
2024-04-02 19:13:49 +01:00
Alexander Soare
2b928eedd4
backup wip
2024-04-02 19:11:53 +01:00
Alexander Soare
a6edb85da4
Remove random sampling
2024-04-02 16:52:38 +01:00
Alexander Soare
95293d459d
fix stats computation
2024-04-02 16:40:33 +01:00
Alexander Soare
f1148b8c2d
Merge remote-tracking branch 'upstream/main' into finish_examples
2024-04-01 11:31:31 +01:00
Simon Alibert
6bddcb647e
Add test_aloha env test
2024-03-28 10:35:11 +01:00
Alexander Soare
b7c9c33072
revision
2024-03-27 18:33:48 +00:00
Alexander Soare
120f0aef5c
Merge remote-tracking branch 'upstream/main' into finish_examples
2024-03-27 17:52:36 +00:00
Alexander Soare
6cd671040f
fix revision
2024-03-27 13:22:14 +00:00
Alexander Soare
b6353964ba
fix bug: use provided revision instead of hardcoded one
2024-03-27 13:08:47 +00:00
Alexander Soare
64c8851c40
Merge branch 'tidy_diffusion_config' into update_diffusion_model
2024-03-27 13:06:08 +00:00
Alexander Soare
dc745e3037
Remove unused part of diffusion policy config
2024-03-27 13:05:13 +00:00
Alexander Soare
68d02c80cf
Remove b/c workaround
2024-03-27 12:03:19 +00:00
Alexander Soare
011f2d27fe
fix tests
2024-03-26 16:40:54 +00:00
Alexander Soare
1ed0110900
finish examples 2 and 3
2024-03-26 16:13:40 +00:00
Cadene
9ced0cf1fb
unskip
2024-03-26 10:45:31 +00:00
Cadene
5a46b8a2a9
fix tests
2024-03-26 10:24:46 +00:00
Cadene
4a8c5e238e
issue with cat_and_write_video
2024-03-26 10:12:16 +00:00
Alexander Soare
1a1308d62f
fix environment seeding
...
add fixes for reproducibility
only try to start env if it is closed
revision
fix normalization and data type
Improve README
Improve README
Tests are passing, Eval pretrained model works, Add gif
Update gif
Update gif
Update gif
Update gif
Update README
Update README
update minor
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
Address suggestions
Update thumbnail + stats
Update thumbnail + stats
Update README.md
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
Add more comments
Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert
98b9631aa6
Add n_obs_steps in default.yaml config
2024-03-26 10:08:00 +01:00
Simon Alibert
c5635b7d94
Minor fixes for #47
2024-03-25 18:50:47 +01:00
Simon Alibert
bcfdba109f
Update pre-commit & run on all files
2024-03-25 17:29:35 +01:00
Simon Alibert
7cdd6d2450
Renamed set_seed -> set_global_seed
2024-03-25 17:19:28 +01:00
Simon Alibert
058ac991eb
Add simxarm back into tests
2024-03-25 16:35:46 +01:00
Simon Alibert
d3adaf1379
Add stat.pth for xarm_lift_medium
2024-03-25 15:55:45 +01:00
Simon Alibert
dc89166bee
Upgrade gym to gymnasium
2024-03-25 15:12:21 +01:00
Simon Alibert
5ef813ff1e
Remove deprecated code
2024-03-25 13:22:49 +01:00
Simon Alibert
c0833f1c2d
Remove simxarm download and preproc hack
2024-03-25 12:41:17 +01:00
Simon Alibert
de5c30405e
fix wrong version
2024-03-25 12:35:06 +01:00
Simon Alibert
462e7469e8
Add xarm_lift_medium revision 1.0 to hub
2024-03-25 12:28:07 +01:00
Cadene
be6364f109
fix, it's training now!
2024-03-25 12:28:07 +01:00
Simon Alibert
127de1258d
WIP
2024-03-25 12:28:07 +01:00
Cadene
b905111895
fix render issue
2024-03-25 12:28:07 +01:00
Simon Alibert
0c41675986
fix __init__ import Base
2024-03-25 12:28:07 +01:00
Simon Alibert
1c24bbda3f
WIP Upgrading simxam from mujoco-py to mujoco python bindings
2024-03-25 12:28:07 +01:00
Alexander Soare
4a48b77540
fix normalization and data type
2024-03-25 09:44:03 +00:00
Remi
f3cfc8b3b4
Merge pull request #46 from huggingface/user/rcadene/2024_03_23_update_stats_v1.2
...
Fix bug with stats.pth + Move from cadene to lerobot + Update datasets to v1.2
2024-03-24 17:53:32 +01:00
Cadene
d2ef43436c
move from cadene to lerobot
2024-03-23 13:34:35 +00:00
Cadene
40f3783fca
v1.2
2024-03-23 11:41:56 +00:00
Alexander Soare
bd40ffc53c
revision
2024-03-22 15:43:45 +00:00
Alexander Soare
d43fa600a0
only try to start env if it is closed
2024-03-22 15:32:55 +00:00
Alexander Soare
e698d38a35
Merge remote-tracking branch 'upstream/main' into fix_environment_seeding
2024-03-22 15:11:15 +00:00
Alexander Soare
15ff3b3af8
add fixes for reproducibility
2024-03-22 15:06:57 +00:00
Alexander Soare
b9047fbdd2
fix environment seeding
2024-03-22 13:25:23 +00:00
Alexander Soare
115927d0f6
make sure to pass stats.pth arg
2024-03-22 12:58:59 +00:00
Alexander Soare
529f42643d
revision
2024-03-22 12:33:25 +00:00
Alexander Soare
8720c568d0
Add ability to eval hub model
2024-03-22 10:26:55 +00:00
Alexander Soare
41912b962b
remove TODO
2024-03-21 13:51:26 +00:00
Alexander Soare
4e10cd306b
revert changes to default.yaml
2024-03-21 10:27:07 +00:00
Alexander Soare
72d3c3120b
Merge remote-tracking branch 'upstream/main' into fix_pusht_diffusion
2024-03-21 10:20:52 +00:00
Alexander Soare
acf1174447
ready for review
2024-03-21 10:18:50 +00:00
Simon Alibert
1bd50122be
Merge pull request #40 from huggingface/user/aliberts/2024_03_20_enable_mps_device
...
Enable mps backend for Apple silicon devices
2024-03-20 19:33:12 +01:00
Simon Alibert
4631d36c05
Add get_safe_torch_device in policies
2024-03-20 18:38:55 +01:00
Cadene
82e6e01651
v1.1
2024-03-20 17:34:00 +00:00
Alexander Soare
d323993569
backup wip
2024-03-20 15:01:27 +00:00
Alexander Soare
4b7ec81dde
remove abstracmethods, fix online training
2024-03-20 14:49:41 +00:00
Alexander Soare
32e3f71dd1
backup wip
2024-03-20 09:49:16 +00:00
Alexander Soare
5332766a82
revision
2024-03-20 09:45:45 +00:00
Alexander Soare
b1ec3da035
remove internal rendering hooks
2024-03-20 09:23:23 +00:00
Alexander Soare
d16f6a93b3
Merge remote-tracking branch 'upstream/main' into user/alexander-soare/multistep_policy_and_serial_env
2024-03-20 09:01:45 +00:00
Alexander Soare
52e149fbfd
Only save video frames in first rollout
2024-03-20 08:32:11 +00:00
Alexander Soare
4f1955edfd
Clear action queue when environment is reset
2024-03-20 08:31:06 +00:00
Alexander Soare
c5010fee9a
fix seeding
2024-03-20 08:21:33 +00:00
Alexander Soare
18fa88475b
Move reset_warning_issued flag to class attribute
2024-03-20 08:09:38 +00:00
Alexander Soare
b54cdc9a0f
break_when_any_done==True for batch_size==1
2024-03-19 19:08:25 +00:00
Alexander Soare
896a11f60e
backup wip
2024-03-19 18:50:04 +00:00
Cadene
7d5d99e036
Address more comments
2024-03-19 16:53:07 +00:00
Cadene
10034e85c4
Aloha done
2024-03-19 16:03:42 +00:00
Alexander Soare
ea17f4ce50
backup wip
2024-03-19 16:02:09 +00:00
Cadene
6a1a29386a
Add replay_buffer directory in pusht datasets + aloha (WIP)
2024-03-19 15:49:45 +00:00
Alexander Soare
88347965c2
revert dp changes, make act and tdmpc batch friendly
2024-03-18 19:18:21 +00:00
Alexander Soare
09ddd9bf92
Merge branch 'main' into user/alexander-soare/multistep_policy_and_serial_env
2024-03-18 18:27:50 +00:00
Alexander Soare
8e346b379d
switch between train and eval
2024-03-18 09:45:17 +00:00
Alexander Soare
bae7e7b41c
Merge remote-tracking branch 'origin/main' into user/alexander-soare/multistep_policy_and_serial_env
2024-03-15 14:06:53 +00:00
Alexander Soare
3124f71ebd
Merge remote-tracking branch 'origin/main' into user/alexander-soare/multistep_policy_and_serial_env
2024-03-15 14:04:23 +00:00
Alexander Soare
4ecfd17f9e
fix wandb artifact name and add disable option
2024-03-15 13:56:55 +00:00
Cadene
b752833f3f
fix download
2024-03-15 13:19:18 +00:00
Alexander Soare
a45896dc8d
Merge remote-tracking branch 'origin/main' into user/alexander-soare/multistep_policy_and_serial_env
2024-03-15 13:05:35 +00:00
Cadene
5805a7ffb1
small fix in type + comments
2024-03-15 12:44:52 +00:00
Cadene
41521f7e96
self.root is Path or None + The following packages are already present in the pyproject.toml and will be skipped:
...
- huggingface-hub
If you want to update it to the latest compatible version, you can use `poetry update package`.
If you prefer to upgrade it to the latest available version, you can use `poetry add package@latest`.
Nothing to add.
2024-03-15 10:56:46 +00:00
Cadene
b10c9507d4
Small fix
2024-03-15 00:36:55 +00:00
Cadene
a311d38796
Add aloha + improve readme
2024-03-15 00:30:11 +00:00
Cadene
19730b3412
Add pusht on hf dataset (WIP)
2024-03-14 16:59:37 +00:00
Alexander Soare
a222c88c99
Merge branch 'user/alexander-soare/train_pusht' into user/alexander-soare/multistep_policy_and_serial_env
2024-03-14 16:06:21 +00:00
Alexander Soare
ba91976944
wip: still needs batch logic for act and tdmp
2024-03-14 15:24:10 +00:00
Alexander Soare
98484ac68e
ready for review
2024-03-12 21:59:01 +00:00
Alexander Soare
9512d1d2f3
Merge branch 'main' into user/alexander-soare/train_pusht
2024-03-12 19:41:27 +00:00
Remi Cadene
9d002032d1
Add Aloha env and ACT policy
...
WIP Aloha env tests pass
Rendering works (fps look fast tho? TODO action bounding is too wide [-1,1])
Update README
Copy past from act repo
Remove download.py add a WIP for Simxarm
Remove download.py add a WIP for Simxarm
Add act yaml (TODO: try train.py)
Training can runs (TODO: eval)
Add tasks without end_effector that are compatible with dataset, Eval can run (TODO: training and pretrained model)
Add AbstractEnv, Refactor AlohaEnv, Add rendering_hook in env, Minor modifications, (TODO: Refactor Pusht and Simxarm)
poetry lock
fix bug in compute_stats for action normalization
fix more bugs in normalization
fix training
fix import
PushtEnv inheriates AbstractEnv, Improve factory Normalization
Add _make_env to EnvAbstract
Add call_rendering_hooks to pusht env
SimxarmEnv inherites from AbstractEnv (NOT TESTED)
Add aloha tests artifacts + update pusht stats
fix image normalization: before env was in [0,1] but dataset in [0,255], and now both in [0,255]
Small fix on simxarm
Add next to obs
Add top camera to Aloha env (TODO: make it compatible with set of cameras)
Add top camera to Aloha env (TODO: make it compatible with set of cameras)
2024-03-12 10:27:48 +00:00
Alexander Soare
87fcc536f9
wip - still need to verify full training run
2024-03-11 18:45:21 +00:00
Simon Alibert
48e70e044e
Merge pull request #18 from Cadene/user/aliberts/2024_03_11_dynamic_build_versioning
...
Add dynamic build versioning
2024-03-11 19:06:51 +01:00
Simon Alibert
4449c06823
Add comment
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-03-11 19:06:22 +01:00
Alexander Soare
304355c917
Merge remote-tracking branch 'origin/main' into train_pusht
2024-03-11 15:37:37 +00:00
Alexander Soare
2a01487494
early training loss as expected
2024-03-11 13:34:04 +00:00
Simon Alibert
78690d197f
Merge pull request #19 from Cadene/user/aliberts/2024_03_11_wandb_config
...
Configure wandb entity outside config
2024-03-11 14:17:44 +01:00
Simon Alibert
6d6c84b4a3
Remove entity from config
...
Co-authored-by: Remi <re.cadene@gmail.com>
2024-03-11 14:14:17 +01:00
Remi
fab2b3240b
Merge pull request #17 from Cadene/user/rcadene/2024_03_11_bugfix_compute_stats
...
Fix bugs with normalization
2024-03-11 13:44:07 +01:00
Cadene
84a1647c01
fix import
2024-03-11 12:41:14 +00:00
Cadene
ccd5dc5a42
fix training
2024-03-11 12:33:15 +00:00
Simon Alibert
00fe4f4f18
Configure wandb entity outside config
2024-03-11 13:09:46 +01:00
Simon Alibert
225eebde40
Add dynamic build versioning
2024-03-11 12:10:46 +01:00
Cadene
816b2e9d63
fix more bugs in normalization
2024-03-11 11:03:51 +00:00
Cadene
a7ef4a6a33
fix bug in compute_stats for action normalization
2024-03-11 09:47:54 +00:00
Simon Alibert
f54ee7cda0
Fix paths
2024-03-10 16:51:50 +01:00
Simon Alibert
134009f337
Remove init files
2024-03-10 16:38:49 +01:00
Simon Alibert
6c867d78ef
Integrate pusht env from diffusion
2024-03-10 16:33:03 +01:00
Simon Alibert
302b78962c
Integrate diffusion policy
2024-03-10 15:31:17 +01:00
Simon Alibert
59397fb44a
Move tdmpc files
2024-03-09 18:44:36 +01:00
Simon Alibert
89eaab140b
Add pusht test artifact
2024-03-09 15:36:20 +01:00
Simon Alibert
f1e2837d63
fix pusht data_dir path
2024-03-08 12:26:15 +01:00
Remi Cadene
524d29aa80
fix tests
2024-03-07 13:23:22 +01:00
Remi Cadene
d782b029e1
Add aloha dataset
2024-03-06 10:26:32 +00:00
Remi
49c0955f97
Merge pull request #7 from Cadene/user/rcadene/2024_03_05_abstract_replay_buffer
...
Add AbstractReplayBuffer
2024-03-06 11:25:24 +01:00
Remi Cadene
eed24b083a
small fix
2024-03-06 10:21:22 +00:00
Remi Cadene
f95ecd66fc
Improve visualize_dataset, Improve AbstractReplayBuffer, Small improvements
2024-03-06 10:15:57 +00:00
Simon Alibert
a6d353c419
Fix
2024-03-05 17:00:17 +01:00
Remi Cadene
2f80d71c3e
Remove noqa-F821
2024-03-05 10:22:21 +00:00
Remi Cadene
d4e0849970
Refactor datasets with abstract class
2024-03-05 10:20:57 +00:00
Remi Cadene
e132a267aa
offline_prioritized_sampler: true
2024-03-04 23:17:59 +00:00
Remi Cadene
a027f4edfb
Add cfg.offline_prioritized_sampler
2024-03-04 23:08:52 +00:00
Remi Cadene
2bcf2631b9
minor comment
2024-03-04 22:34:44 +00:00
Remi
e990f3e148
Merge pull request #6 from Cadene/user/rcadene/2024_03_04_diffusion
...
Make diffusion work
2024-03-04 18:30:40 +01:00
Remi Cadene
e29fbb50e8
Fix grad_clip_norm 0 -> 10, Fix normalization min_max to be per channel
2024-03-04 17:26:34 +00:00
Remi Cadene
cfc304e870
Refactor env queue, Training diffusion works (Still not converging)
2024-03-04 11:00:51 +00:00
Remi Cadene
fddd9f0311
Add possibility for the policy to provide a sequence of actions to the env
2024-03-03 14:02:24 +00:00
Remi Cadene
4c400b41a5
Improve log msg in train.py
2024-03-03 13:22:09 +00:00
Remi Cadene
0f2fa4d9ef
Add obs queue to pusht, Set n_obs_steps=2 for diffusion (Not fully tested)
2024-03-03 13:21:31 +00:00
Remi Cadene
cbbed590a9
Add mode to NormalizeTransform with mean_std or min_max (Not fully tested)
2024-03-03 13:19:02 +00:00
Simon Alibert
b859e89936
Fix for PR #5
2024-03-03 13:05:21 +01:00
Simon Alibert
b33ec5a630
Add run on cpu-only compatibility
2024-03-03 12:47:26 +01:00
Remi Cadene
48ded3dbc7
fix
2024-03-02 18:11:50 +00:00
Remi Cadene
661bda45ea
imagenet_norm: False
2024-03-02 17:18:58 +00:00
Remi Cadene
80785f8d0e
Small fix, Refactor diffusion, Diffusion runs (TODO: remove normalization in diffusion)
2024-03-02 17:04:39 +00:00
Remi Cadene
45b4ecb727
pre-commit run -a
2024-03-02 15:58:21 +00:00
Remi Cadene
1ae6205269
Add Normalize, non_blocking=True in tdmpc, tdmpc run (TODO: diffusion)
2024-03-02 15:53:29 +00:00