lerobot

History

KeWang1017 5b4adc00bb Refactor SAC configuration and policy for improved action sampling and stability - Updated SACConfig to replace standard deviation parameterization with log_std_min and log_std_max for better control over action distributions. - Modified SACPolicy to streamline action selection and log probability calculations, enhancing stochastic behavior. - Removed deprecated TanhMultivariateNormalDiag class to simplify the codebase and improve maintainability. These changes aim to enhance the robustness and performance of the SAC implementation during training and inference.		2024-12-29 14:27:19 +00:00
..
datasets	Reward classifier and training (#528 )	2024-12-17 02:41:29 +07:00
envs	small fix: assertion error message in envs/utils.py (#426 )	2024-09-12 18:03:34 +02:00
policies	Refactor SAC configuration and policy for improved action sampling and stability	2024-12-29 14:27:19 +00:00
robot_devices	Fixup	2024-12-17 02:42:53 +07:00
utils	Make `say(blocking=True)` work for Linux (#460 )	2024-10-17 15:22:21 +01:00
logger.py	Added normalization schemes and style checks	2024-12-29 12:51:21 +00:00