lerobot

History

AdilZouitine 7960f2c3c1 Enhance SAC configuration and policy with gradient clipping and temperature management - Introduced `grad_clip_norm` parameter in SAC configuration for gradient clipping - Updated SACPolicy to store temperature as an instance variable for consistent usage - Modified loss calculations in SACPolicy to utilize the instance temperature - Enhanced MLP and CriticHead to support a customizable final activation function - Implemented gradient clipping in the learner server during training steps for both actor and critic - Added tracking for gradient norms in training information		2025-03-28 17:18:48 +00:00
..
common	Enhance SAC configuration and policy with gradient clipping and temperature management	2025-03-28 17:18:48 +00:00
configs	[Port HIL-SERL] Balanced sampler function speed up and refactor to align with train.py (#715 )	2025-03-28 17:18:48 +00:00
scripts	Enhance SAC configuration and policy with gradient clipping and temperature management	2025-03-28 17:18:48 +00:00
templates	[visualization] Ignore 2d or 3d data for now (#809 )	2025-03-04 10:53:01 +01:00
__init__.py	[pre-commit.ci] auto fixes from pre-commit.com hooks	2025-03-28 17:18:48 +00:00
__version__.py	Add copyrights (#157 )	2024-05-15 12:13:09 +02:00