Commit Graph

169 Commits

Author SHA1 Message Date
Alexander Soare e982c732f1 Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-08 09:25:45 +01:00
Cadene e1ac5dc62f fix aloha pixels env test 2024-04-07 17:20:54 +00:00
Cadene 4371a5570d Remove latency, tdmpc policy passes tests (TODO: make it work with online RL) 2024-04-07 16:01:22 +00:00
Cadene 44656d2706 test_envs are passing 2024-04-05 23:27:12 +00:00
Alexander Soare 8d2463f45b backup wip 2024-04-05 18:46:30 +01:00
Alexander Soare ecc7dd3b17 Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 18:35:13 +01:00
Cadene a2d3588fca wrap dm_control aloha into gymnasium (TODO: properly seeding the env) 2024-04-05 17:17:31 +00:00
Cadene 29032fbcd3 wrap dm_control aloha into gymnasium (TODO: properly seeding the env) 2024-04-05 17:17:14 +00:00
Alexander Soare ab2286025b Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 18:06:00 +01:00
Alexander Soare 1e71196fe3 backup wip 2024-04-05 17:38:29 +01:00
Cadene 26602269cd test_envs.py are passing, remove simxarm and pusht directories 2024-04-05 16:21:07 +00:00
Cadene f56b1a0e16 WIP tdmpc 2024-04-05 13:40:31 +00:00
Simon Alibert ab3cd3a7ba (WIP) Add gym-xarm 2024-04-05 15:35:20 +02:00
Alexander Soare 0b8d27ff2c Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 12:48:11 +01:00
Cadene c17dffe944 policies/utils.py 2024-04-05 11:47:15 +00:00
Alexander Soare 8ba88ba250 Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 12:34:14 +01:00
Cadene a420714ee4 fix: action_is_pad was missing in compute_loss 2024-04-05 11:33:39 +00:00
Alexander Soare 4863e54ce9 Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 12:00:31 +01:00
Cadene ad3379a73a fix memory leak due to itertools.cycle 2024-04-05 10:59:32 +00:00
Alexander Soare 9d77f5773d Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 11:41:11 +01:00
Alexander Soare edb125b351 backup wip 2024-04-05 11:03:28 +01:00
Cadene 5af00d0c1e fix train.py, stats, eval.py (training is running) 2024-04-05 09:31:39 +00:00
Alexander Soare 3a4dfa82fe backup wip 2024-04-04 18:34:41 +01:00
Cadene c93ce35d8c WIP stats (TODO: run tests on stats + cmpute them) 2024-04-04 16:36:03 +00:00
Cadene 1cdfbc8b52 WIP
WIP

WIP train.py works, loss going down

WIP eval.py

Fix

WIP (eval running, TODO: verify results reproduced)

Eval works! (testing reproducibility)

WIP

pretrained model pusht reproduces same results as torchrl

pretrained model pusht reproduces same results as torchrl

Remove AbstractPolicy, Move all queues in select_action

WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare 278336a39a backup wip 2024-04-03 19:23:22 +01:00
Alexander Soare 110ac5ffa1 backup wip 2024-04-03 14:21:07 +01:00
Alexander Soare c7d70a8db9 Merge remote-tracking branch 'upstream/main' into refactor_act 2024-04-03 10:08:12 +01:00
Alexander Soare caf4ffcf65 add TODO 2024-04-03 09:56:46 +01:00
Alexander Soare c50a62dd6d clarifying math 2024-04-03 09:47:38 +01:00
Alexander Soare e9eb262293 numerically sound mean computation 2024-04-03 09:44:20 +01:00
Alexander Soare 65ef8c30d0 backup wip 2024-04-02 19:13:49 +01:00
Alexander Soare 2b928eedd4 backup wip 2024-04-02 19:11:53 +01:00
Alexander Soare a6edb85da4 Remove random sampling 2024-04-02 16:52:38 +01:00
Alexander Soare 95293d459d fix stats computation 2024-04-02 16:40:33 +01:00
Alexander Soare f1148b8c2d Merge remote-tracking branch 'upstream/main' into finish_examples 2024-04-01 11:31:31 +01:00
Simon Alibert 6bddcb647e Add test_aloha env test 2024-03-28 10:35:11 +01:00
Alexander Soare b7c9c33072 revision 2024-03-27 18:33:48 +00:00
Alexander Soare 120f0aef5c Merge remote-tracking branch 'upstream/main' into finish_examples 2024-03-27 17:52:36 +00:00
Alexander Soare 68d02c80cf Remove b/c workaround 2024-03-27 12:03:19 +00:00
Alexander Soare 011f2d27fe fix tests 2024-03-26 16:40:54 +00:00
Alexander Soare 1ed0110900 finish examples 2 and 3 2024-03-26 16:13:40 +00:00
Cadene 9ced0cf1fb unskip 2024-03-26 10:45:31 +00:00
Cadene 5a46b8a2a9 fix tests 2024-03-26 10:24:46 +00:00
Alexander Soare 1a1308d62f fix environment seeding
add fixes for reproducibility

only try to start env if it is closed

revision

fix normalization and data type

Improve README

Improve README

Tests are passing, Eval pretrained model works, Add gif

Update gif

Update gif

Update gif

Update gif

Update README

Update README

update minor

Update README.md

Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>

Update README.md

Co-authored-by: Simon Alibert <75076266+aliberts@users.noreply.github.com>

Address suggestions

Update thumbnail + stats

Update thumbnail + stats

Update README.md

Co-authored-by: Alexander Soare <alexander.soare159@gmail.com>

Add more comments

Add test_examples.py
2024-03-26 10:10:43 +00:00
Simon Alibert c5635b7d94 Minor fixes for #47 2024-03-25 18:50:47 +01:00
Simon Alibert bcfdba109f Update pre-commit & run on all files 2024-03-25 17:29:35 +01:00
Simon Alibert 7cdd6d2450 Renamed set_seed -> set_global_seed 2024-03-25 17:19:28 +01:00
Simon Alibert 058ac991eb Add simxarm back into tests 2024-03-25 16:35:46 +01:00
Simon Alibert d3adaf1379 Add stat.pth for xarm_lift_medium 2024-03-25 15:55:45 +01:00