lerobot/lerobot/common
KeWang1017 5b4adc00bb Refactor SAC configuration and policy for improved action sampling and stability
- Updated SACConfig to replace standard deviation parameterization with log_std_min and log_std_max for better control over action distributions.
- Modified SACPolicy to streamline action selection and log probability calculations, enhancing stochastic behavior.
- Removed deprecated TanhMultivariateNormalDiag class to simplify the codebase and improve maintainability.

These changes aim to enhance the robustness and performance of the SAC implementation during training and inference.
2024-12-29 14:27:19 +00:00
..
datasets Reward classifier and training (#528) 2024-12-17 02:41:29 +07:00
envs small fix: assertion error message in envs/utils.py (#426) 2024-09-12 18:03:34 +02:00
policies Refactor SAC configuration and policy for improved action sampling and stability 2024-12-29 14:27:19 +00:00
robot_devices Fixup 2024-12-17 02:42:53 +07:00
utils Make `say(blocking=True)` work for Linux (#460) 2024-10-17 15:22:21 +01:00
logger.py Added normalization schemes and style checks 2024-12-29 12:51:21 +00:00