Commit Graph

502 Commits

Author SHA1 Message Date
Remi 1e09507bc1
Merge pull request #69 from huggingface/user/aliberts/2024_04_08_remove_envs
Remove envs
2024-04-08 16:55:20 +02:00
Cadene 1149894e1d rename handle -> task 2024-04-08 14:54:52 +00:00
Alexander Soare 9c96349926 Merge remote-tracking branch 'upstream/user/rcadene/2024_03_31_remove_torchrl' into refactor_act 2024-04-08 15:44:00 +01:00
Simon Alibert 6c792f0d3d Update README 2024-04-08 16:24:11 +02:00
Simon Alibert 3f6dfa4916 Add gym-aloha, rename simxarm -> xarm, refactor 2024-04-08 16:24:11 +02:00
Simon Alibert 5dff6d8339 remove aloha 2024-04-08 16:22:13 +02:00
Cadene 70aaf1c4cb test_datasets.py are passing! 2024-04-08 14:16:57 +00:00
Alexander Soare 91e0e4e175 rever change 2024-04-08 15:05:40 +01:00
Alexander Soare 0b4c42f4ff typos 2024-04-08 14:59:37 +01:00
Alexander Soare 62b18a7607 Add type hints 2024-04-08 14:51:45 +01:00
Alexander Soare 86365adf9f revision 2024-04-08 14:44:46 +01:00
Alexander Soare 0a721f3d94 empty commit 2024-04-08 13:21:38 +01:00
Alexander Soare 863f28ffd8 ready for review 2024-04-08 13:10:19 +01:00
Alexander Soare 1bab4a1dd5 Eval reproduction works with gym_aloha 2024-04-08 10:23:26 +01:00
Alexander Soare e982c732f1 Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-08 09:25:45 +01:00
Cadene e1ac5dc62f fix aloha pixels env test 2024-04-07 17:20:54 +00:00
Cadene 4371a5570d Remove latency, tdmpc policy passes tests (TODO: make it work with online RL) 2024-04-07 16:01:22 +00:00
Cadene 44656d2706 test_envs are passing 2024-04-05 23:27:12 +00:00
Alexander Soare 8d2463f45b backup wip 2024-04-05 18:46:30 +01:00
Alexander Soare ecc7dd3b17 Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 18:35:13 +01:00
Cadene 5eff40b3d6 rename task, sim_transfer -> transfer 2024-04-05 17:18:37 +00:00
Cadene a2d3588fca wrap dm_control aloha into gymnasium (TODO: properly seeding the env) 2024-04-05 17:17:31 +00:00
Cadene 29032fbcd3 wrap dm_control aloha into gymnasium (TODO: properly seeding the env) 2024-04-05 17:17:14 +00:00
Alexander Soare ab2286025b Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 18:06:00 +01:00
Alexander Soare 1e71196fe3 backup wip 2024-04-05 17:38:29 +01:00
Cadene 26602269cd test_envs.py are passing, remove simxarm and pusht directories 2024-04-05 16:21:07 +00:00
Alexander Soare 9c28ac8aa4 re-add pre-commit check 2024-04-05 15:25:11 +01:00
Cadene f56b1a0e16 WIP tdmpc 2024-04-05 13:40:31 +00:00
Simon Alibert ab3cd3a7ba (WIP) Add gym-xarm 2024-04-05 15:35:20 +02:00
Alexander Soare 0b8d27ff2c Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 12:48:11 +01:00
Cadene c17dffe944 policies/utils.py 2024-04-05 11:47:15 +00:00
Alexander Soare 8ba88ba250 Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 12:34:14 +01:00
Cadene a420714ee4 fix: action_is_pad was missing in compute_loss 2024-04-05 11:33:39 +00:00
Alexander Soare 4863e54ce9 Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 12:00:31 +01:00
Cadene ad3379a73a fix memory leak due to itertools.cycle 2024-04-05 10:59:32 +00:00
Alexander Soare 9d77f5773d Merge remote-tracking branch 'Cadene/user/rcadene/2024_03_31_remove_torchrl' into refactor_act_remove_torchrl 2024-04-05 11:41:11 +01:00
Alexander Soare edb125b351 backup wip 2024-04-05 11:03:28 +01:00
Cadene 5af00d0c1e fix train.py, stats, eval.py (training is running) 2024-04-05 09:31:39 +00:00
Alexander Soare 3a4dfa82fe backup wip 2024-04-04 18:34:41 +01:00
Cadene c93ce35d8c WIP stats (TODO: run tests on stats + cmpute them) 2024-04-04 16:36:03 +00:00
Cadene 1cdfbc8b52 WIP
WIP

WIP train.py works, loss going down

WIP eval.py

Fix

WIP (eval running, TODO: verify results reproduced)

Eval works! (testing reproducibility)

WIP

pretrained model pusht reproduces same results as torchrl

pretrained model pusht reproduces same results as torchrl

Remove AbstractPolicy, Move all queues in select_action

WIP test_datasets passed (TODO: re-enable NormalizeTransform)
2024-04-04 15:31:03 +00:00
Alexander Soare 278336a39a backup wip 2024-04-03 19:23:22 +01:00
Alexander Soare 110ac5ffa1 backup wip 2024-04-03 14:21:07 +01:00
Alexander Soare c7d70a8db9 Merge remote-tracking branch 'upstream/main' into refactor_act 2024-04-03 10:08:12 +01:00
Alexander Soare 920e0d118b
Merge pull request #66 from alexander-soare/fix_stats_computation
fix stats computation
2024-04-03 10:03:47 +01:00
Alexander Soare caf4ffcf65 add TODO 2024-04-03 09:56:46 +01:00
Alexander Soare a6ec4fbf58 remove try-catch 2024-04-03 09:53:15 +01:00
Alexander Soare c50a62dd6d clarifying math 2024-04-03 09:47:38 +01:00
Alexander Soare e9eb262293 numerically sound mean computation 2024-04-03 09:44:20 +01:00
Alexander Soare 7242953197 revision 2024-04-02 19:19:13 +01:00