lerobot/lerobot
KeWang1017 a5228a0dfe Enhance SAC configuration and policy with new parameters and subsampling logic
- Added `num_subsample_critics`, `critic_target_update_weight`, and `utd_ratio` to SACConfig.
- Implemented target entropy calculation in SACPolicy if not provided.
- Introduced subsampling of critics to prevent overfitting during updates.
- Updated temperature loss calculation to use the new target entropy.
- Added comments for future UTD update implementation.

These changes improve the flexibility and performance of the SAC implementation.
2025-03-28 17:18:24 +00:00
..
common Enhance SAC configuration and policy with new parameters and subsampling logic 2025-03-28 17:18:24 +00:00
configs Add human intervention mechanism and eval_robot script to evaluate policy on the robot (#541) 2025-03-28 17:18:24 +00:00
scripts Update lerobot/scripts/train_hilserl_classifier.py 2025-03-28 17:18:24 +00:00
templates [visualization] Ignore 2d or 3d data for now (#809) 2025-03-04 10:53:01 +01:00
__init__.py Simplify configs (#550) 2025-01-31 13:57:37 +01:00
__version__.py Add copyrights (#157) 2024-05-15 12:13:09 +02:00