lerobot/lerobot
AdilZouitine 7960f2c3c1 Enhance SAC configuration and policy with gradient clipping and temperature management
- Introduced `grad_clip_norm` parameter in SAC configuration for gradient clipping
- Updated SACPolicy to store temperature as an instance variable for consistent usage
- Modified loss calculations in SACPolicy to utilize the instance temperature
- Enhanced MLP and CriticHead to support a customizable final activation function
- Implemented gradient clipping in the learner server during training steps for both actor and critic
- Added tracking for gradient norms in training information
2025-03-28 17:18:48 +00:00
..
common Enhance SAC configuration and policy with gradient clipping and temperature management 2025-03-28 17:18:48 +00:00
configs [Port HIL-SERL] Balanced sampler function speed up and refactor to align with train.py (#715) 2025-03-28 17:18:48 +00:00
scripts Enhance SAC configuration and policy with gradient clipping and temperature management 2025-03-28 17:18:48 +00:00
templates [visualization] Ignore 2d or 3d data for now (#809) 2025-03-04 10:53:01 +01:00
__init__.py [pre-commit.ci] auto fixes from pre-commit.com hooks 2025-03-28 17:18:48 +00:00
__version__.py Add copyrights (#157) 2024-05-15 12:13:09 +02:00