lerobot

History

KeWang1017 70e3b9248c Refine SAC configuration and policy for enhanced performance - Updated standard deviation parameterization in SACConfig to 'softplus' with defined min and max values for improved stability. - Modified action sampling in SACPolicy to use reparameterized sampling, ensuring better gradient flow and log probability calculations. - Cleaned up log probability calculations in TanhMultivariateNormalDiag for clarity and efficiency. - Increased evaluation frequency in YAML configuration to 50000 for more efficient training cycles. These changes aim to enhance the robustness and performance of the SAC implementation during training and inference.		2025-03-28 17:18:24 +00:00
..
datasets	Reward classifier and training (#528 )	2025-03-28 17:18:24 +00:00
envs	chore(doc): add license header to all files (#818 )	2025-03-05 17:56:51 +01:00
optim	chore(doc): add license header to all files (#818 )	2025-03-05 17:56:51 +01:00
policies	Refine SAC configuration and policy for enhanced performance	2025-03-28 17:18:24 +00:00
robot_devices	Fixup	2025-03-28 17:18:24 +00:00
utils	Add offline mode in the configuration for wandb logging (#897 )	2025-03-25 13:44:49 +01:00
constants.py	chore(doc): add license header to all files (#818 )	2025-03-05 17:56:51 +01:00
logger.py	Added normalization schemes and style checks	2025-03-28 17:18:24 +00:00