Alexander Soare
e982c732f1
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-08 09:25:45 +01:00
Cadene
e1ac5dc62f
fix aloha pixels env test
2024-04-07 17:20:54 +00:00
Cadene
4371a5570d
Remove latency, tdmpc policy passes tests (TODO: make it work with online RL)
2024-04-07 16:01:22 +00:00
Cadene
44656d2706
test_envs are passing
2024-04-05 23:27:12 +00:00
Alexander Soare
8d2463f45b
backup wip
2024-04-05 18:46:30 +01:00
Alexander Soare
ecc7dd3b17
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 18:35:13 +01:00
Cadene
a2d3588fca
wrap dm_control aloha into gymnasium (TODO: properly seeding the env)
2024-04-05 17:17:31 +00:00
Cadene
29032fbcd3
wrap dm_control aloha into gymnasium (TODO: properly seeding the env)
2024-04-05 17:17:14 +00:00
Alexander Soare
ab2286025b
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 18:06:00 +01:00
Alexander Soare
1e71196fe3
backup wip
2024-04-05 17:38:29 +01:00
Cadene
26602269cd
test_envs.py are passing, remove simxarm and pusht directories
2024-04-05 16:21:07 +00:00
Cadene
f56b1a0e16
WIP tdmpc
2024-04-05 13:40:31 +00:00
Simon Alibert
ab3cd3a7ba
(WIP) Add gym-xarm
2024-04-05 15:35:20 +02:00
Alexander Soare
0b8d27ff2c
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 12:48:11 +01:00
Cadene
c17dffe944
policies/utils.py
2024-04-05 11:47:15 +00:00
Alexander Soare
8ba88ba250
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 12:34:14 +01:00
Cadene
a420714ee4
fix: action_is_pad was missing in compute_loss
2024-04-05 11:33:39 +00:00
Alexander Soare
4863e54ce9
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 12:00:31 +01:00
Cadene
ad3379a73a
fix memory leak due to itertools.cycle
2024-04-05 10:59:32 +00:00
Alexander Soare
9d77f5773d
Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl
2024-04-05 11:41:11 +01:00
Alexander Soare
edb125b351
backup wip
2024-04-05 11:03:28 +01:00
Cadene
5af00d0c1e
fix train.py, stats, eval.py (training is running)
2024-04-05 09:31:39 +00:00
Alexander Soare
3a4dfa82fe
backup wip
2024-04-04 18:34:41 +01:00
Cadene
c93ce35d8c
WIP stats (TODO: run tests on stats + cmpute them)
2024-04-04 16:36:03 +00:00
Cadene
1cdfbc8b52
WIP
...
WIP
WIP train.py works, loss going down
WIP eval.py
Fix
WIP (eval running, TODO: verify results reproduced)
Eval works! (testing reproducibility)
WIP
pretrained model pusht reproduces same results as torchrl
pretrained model pusht reproduces same results as torchrl
Remove AbstractPolicy, Move all queues in select_action
WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare
278336a39a
backup wip
2024-04-03 19:23:22 +01:00
Alexander Soare
110ac5ffa1
backup wip
2024-04-03 14:21:07 +01:00
Alexander Soare
c7d70a8db9
Merge remote-tracking branch 'upstream/main' into refactor_act
2024-04-03 10:08:12 +01:00
Alexander Soare
caf4ffcf65
add TODO
2024-04-03 09:56:46 +01:00
Alexander Soare
c50a62dd6d
clarifying math
2024-04-03 09:47:38 +01:00
Alexander Soare
e9eb262293
numerically sound mean computation
2024-04-03 09:44:20 +01:00
Alexander Soare
65ef8c30d0
backup wip
2024-04-02 19:13:49 +01:00
Alexander Soare
2b928eedd4
backup wip
2024-04-02 19:11:53 +01:00
Alexander Soare
a6edb85da4
Remove random sampling
2024-04-02 16:52:38 +01:00
Alexander Soare
95293d459d
fix stats computation
2024-04-02 16:40:33 +01:00
Alexander Soare
f1148b8c2d
Merge remote-tracking branch 'upstream/main' into finish_examples
2024-04-01 11:31:31 +01:00
Simon Alibert
6bddcb647e
Add test_aloha env test
2024-03-28 10:35:11 +01:00
Alexander Soare
b7c9c33072
revision
2024-03-27 18:33:48 +00:00
Alexander Soare
120f0aef5c
Merge remote-tracking branch 'upstream/main' into finish_examples
2024-03-27 17:52:36 +00:00
Alexander Soare
68d02c80cf
Remove b/c workaround
2024-03-27 12:03:19 +00:00
Alexander Soare
011f2d27fe
fix tests
2024-03-26 16:40:54 +00:00
Alexander Soare
1ed0110900
finish examples 2 and 3
2024-03-26 16:13:40 +00:00
Cadene
9ced0cf1fb
unskip
2024-03-26 10:45:31 +00:00
Cadene
5a46b8a2a9
fix tests
2024-03-26 10:24:46 +00:00
Alexander Soare
1a1308d62f
fix environment seeding
...
add fixes for reproducibility
only try to start env if it is closed
revision
fix normalization and data type
Improve README
Improve README
Tests are passing, Eval pretrained model works, Add gif
Update gif
Update gif
Update gif
Update gif
Update README
Update README
update minor
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
Update README.md
Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>
Address suggestions
Update thumbnail + stats
Update thumbnail + stats
Update README.md
Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>
Add more comments
Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert
c5635b7d94
Minor fixes for #47
2024-03-25 18:50:47 +01:00
Simon Alibert
bcfdba109f
Update pre-commit & run on all files
2024-03-25 17:29:35 +01:00
Simon Alibert
7cdd6d2450
Renamed set_seed -> set_global_seed
2024-03-25 17:19:28 +01:00
Simon Alibert
058ac991eb
Add simxarm back into tests
2024-03-25 16:35:46 +01:00
Simon Alibert
d3adaf1379
Add stat.pth for xarm_lift_medium
2024-03-25 15:55:45 +01:00