Cadene
|
cf5063e50e
|
Add diffusion policy (train and eval works, TODO: reproduce results)
|
2024-02-28 15:21:42 +00:00 |
Cadene
|
e543c9a42c
|
small fix %
|
2024-02-27 11:54:31 +00:00 |
Cadene
|
7df542445c
|
Small fix and improve logging message
|
2024-02-27 11:44:26 +00:00 |
Cadene
|
21670dce90
|
Refactor train, eval_policy, logger, Add diffusion.yaml (WIP)
|
2024-02-26 01:10:09 +00:00 |
Cadene
|
b16c334825
|
Refactor configs to have env in seperate yaml + Fix training
|
2024-02-25 17:42:47 +00:00 |
Cadene
|
ed80db2846
|
Sanitize cfg.env
|
2024-02-25 12:02:29 +00:00 |
Cadene
|
598bb496b0
|
Add policies/factory, Add test, Add _self_ in config
|
2024-02-25 10:50:23 +00:00 |
Cadene
|
aed02dc7c6
|
Add multithreading for video generation, Speed policy sampling
|
2024-02-24 18:18:39 +00:00 |
Cadene
|
63d18475cc
|
fix simxarm factory
|
2024-02-22 13:04:24 +00:00 |
Cadene
|
e3643d6146
|
Wandb works, One output dir
|
2024-02-22 12:14:12 +00:00 |
Cadene
|
3dc14b5576
|
Add Prod transform, Add test_factory
|
2024-02-20 14:22:16 +00:00 |
Cadene
|
3da6ffb2cb
|
Fix unit tests, Refactor, Add pusht env, (TODO pusht replay buffer, image preprocessing)
|
2024-02-20 12:26:57 +00:00 |
Cadene
|
a5c305a7a4
|
offline training + online finetuning converge to 33 reward!
|
2024-02-18 01:23:44 +00:00 |
Cadene
|
0b4084f0f8
|
Clean + alpha beta corresponds to config (before 0.7 and 0.9)
|
2024-02-16 16:27:54 +00:00 |
Cadene
|
c202c2b3c2
|
Online finetuning runs (sometimes crash because of nans)
|
2024-02-16 15:13:24 +00:00 |
Cadene
|
228c045674
|
Eval reproduced! Train running (but not reproduced)
|
2024-02-10 15:46:24 +00:00 |
Cadene
|
5a5b190f70
|
Add common, refactor eval with eval_policy
|
2024-01-31 13:48:12 +00:00 |
Cadene
|
1144819c29
|
First real commit, simxarm env added with torchrl!
|
2024-01-29 12:49:30 +00:00 |