Simon Alibert
|
dba0375089
|
Fix CI
|
2024-04-09 10:45:58 +02:00 |
Simon Alibert
|
d21543eb4f
|
Add env.close()
|
2024-04-09 10:41:20 +02:00 |
Simon Alibert
|
dfaacbcf5a
|
Split dev/test dependencies
|
2024-04-09 10:40:11 +02:00 |
Simon Alibert
|
2573e89e1d
|
Remove direct dependencies
|
2024-04-09 10:38:08 +02:00 |
Simon Alibert
|
274f20b49d
|
Update gym-pusht
|
2024-04-09 10:25:41 +02:00 |
Simon Alibert
|
d9019d9e7e
|
disable env_checker in factory
|
2024-04-09 10:24:28 +02:00 |
Cadene
|
19e7661b8d
|
Remove torchrl/tensordict from dependecies + update poetry cpu
|
2024-04-09 03:50:49 +00:00 |
Cadene
|
253e495df2
|
remove render(mode=visualization)
|
2024-04-09 03:46:05 +00:00 |
Cadene
|
6902e01db0
|
tests are passing for aloha/act policies, removes abstract policy
|
2024-04-09 03:28:56 +00:00 |
Cadene
|
73dfa3c8e3
|
tests for tdmpc and diffusion policy are passing
|
2024-04-09 02:50:32 +00:00 |
Remi
|
1e09507bc1
|
Merge pull request #69 from huggingface/user/aliberts/2024_04_08_remove_envs
Remove envs
|
2024-04-08 16:55:20 +02:00 |
Cadene
|
1149894e1d
|
rename handle -> task
|
2024-04-08 14:54:52 +00:00 |
Simon Alibert
|
6c792f0d3d
|
Update README
|
2024-04-08 16:24:11 +02:00 |
Simon Alibert
|
3f6dfa4916
|
Add gym-aloha, rename simxarm -> xarm, refactor
|
2024-04-08 16:24:11 +02:00 |
Simon Alibert
|
5dff6d8339
|
remove aloha
|
2024-04-08 16:22:13 +02:00 |
Cadene
|
70aaf1c4cb
|
test_datasets.py are passing!
|
2024-04-08 14:16:57 +00:00 |
Cadene
|
e1ac5dc62f
|
fix aloha pixels env test
|
2024-04-07 17:20:54 +00:00 |
Cadene
|
4371a5570d
|
Remove latency, tdmpc policy passes tests (TODO: make it work with online RL)
|
2024-04-07 16:01:22 +00:00 |
Cadene
|
44656d2706
|
test_envs are passing
|
2024-04-05 23:27:12 +00:00 |
Cadene
|
5eff40b3d6
|
rename task, sim_transfer -> transfer
|
2024-04-05 17:18:37 +00:00 |
Cadene
|
a2d3588fca
|
wrap dm_control aloha into gymnasium (TODO: properly seeding the env)
|
2024-04-05 17:17:31 +00:00 |
Cadene
|
29032fbcd3
|
wrap dm_control aloha into gymnasium (TODO: properly seeding the env)
|
2024-04-05 17:17:14 +00:00 |
Cadene
|
26602269cd
|
test_envs.py are passing, remove simxarm and pusht directories
|
2024-04-05 16:21:07 +00:00 |
Cadene
|
f56b1a0e16
|
WIP tdmpc
|
2024-04-05 13:40:31 +00:00 |
Simon Alibert
|
ab3cd3a7ba
|
(WIP) Add gym-xarm
|
2024-04-05 15:35:20 +02:00 |
Cadene
|
c17dffe944
|
policies/utils.py
|
2024-04-05 11:47:15 +00:00 |
Cadene
|
a420714ee4
|
fix: action_is_pad was missing in compute_loss
|
2024-04-05 11:33:39 +00:00 |
Cadene
|
ad3379a73a
|
fix memory leak due to itertools.cycle
|
2024-04-05 10:59:32 +00:00 |
Cadene
|
5af00d0c1e
|
fix train.py, stats, eval.py (training is running)
|
2024-04-05 09:31:39 +00:00 |
Cadene
|
c93ce35d8c
|
WIP stats (TODO: run tests on stats + cmpute them)
|
2024-04-04 16:36:03 +00:00 |
Cadene
|
1cdfbc8b52
|
WIP
WIP
WIP train.py works, loss going down
WIP eval.py
Fix
WIP (eval running, TODO: verify results reproduced)
Eval works! (testing reproducibility)
WIP
pretrained model pusht reproduces same results as torchrl
pretrained model pusht reproduces same results as torchrl
Remove AbstractPolicy, Move all queues in select_action
WIP test_datasets passed (TODO: re-enable NormalizeTransform)
|
2024-04-04 15:31:03 +00:00 |
Alexander Soare
|
920e0d118b
|
Merge pull request #66 from alexander-soare/fix_stats_computation
fix stats computation
|
2024-04-03 10:03:47 +01:00 |
Alexander Soare
|
caf4ffcf65
|
add TODO
|
2024-04-03 09:56:46 +01:00 |
Alexander Soare
|
a6ec4fbf58
|
remove try-catch
|
2024-04-03 09:53:15 +01:00 |
Alexander Soare
|
c50a62dd6d
|
clarifying math
|
2024-04-03 09:47:38 +01:00 |
Alexander Soare
|
e9eb262293
|
numerically sound mean computation
|
2024-04-03 09:44:20 +01:00 |
Alexander Soare
|
7242953197
|
revision
|
2024-04-02 19:19:13 +01:00 |
Alexander Soare
|
c3234adc7d
|
fix indentation
|
2024-04-02 16:59:19 +01:00 |
Alexander Soare
|
148df1c1d5
|
add comment on test
|
2024-04-02 16:57:25 +01:00 |
Alexander Soare
|
a6edb85da4
|
Remove random sampling
|
2024-04-02 16:52:38 +01:00 |
Alexander Soare
|
95293d459d
|
fix stats computation
|
2024-04-02 16:40:33 +01:00 |
Alexander Soare
|
11cbf1bea1
|
Merge pull request #53 from alexander-soare/finish_examples
Add examples 2 and 3
|
2024-04-01 11:52:41 +01:00 |
Alexander Soare
|
f1148b8c2d
|
Merge remote-tracking branch 'upstream/main' into finish_examples
|
2024-04-01 11:31:31 +01:00 |
Simon Alibert
|
2a98cc71ed
|
Merge pull request #56 from huggingface/user/aliberts/2024_03_27_improve_ci
Add code coverage, more end-to-end tests
|
2024-03-28 10:57:44 +01:00 |
Simon Alibert
|
a7c9b78e56
|
Deactivate eval ACT on Aloha (policy is None)
|
2024-03-28 10:55:11 +01:00 |
Simon Alibert
|
404b8f8a75
|
Fix end-to-end ACT train on Aloha
|
2024-03-28 10:35:11 +01:00 |
Simon Alibert
|
17c2bbbeb8
|
remove todo
|
2024-03-28 10:35:11 +01:00 |
Simon Alibert
|
006e5feabf
|
WIP add code coverage
|
2024-03-28 10:35:11 +01:00 |
Simon Alibert
|
b99ee8180a
|
Add more end-to-end tests
|
2024-03-28 10:35:11 +01:00 |
Simon Alibert
|
6bddcb647e
|
Add test_aloha env test
|
2024-03-28 10:35:11 +01:00 |