lerobot/lerobot
AdilZouitine e35ee47b07 Refactor SAC policy and training loop to enhance discrete action support
- Updated SACPolicy to conditionally compute losses for grasp critic based on num_discrete_actions.
- Simplified forward method to return loss outputs as a dictionary for better clarity.
- Adjusted learner_server to handle both main and grasp critic losses during training.
- Ensured optimizers are created conditionally for grasp critic based on configuration settings.
2025-04-16 16:46:37 +02:00
..
common Refactor SAC policy and training loop to enhance discrete action support 2025-04-16 16:46:37 +02:00
configs [pre-commit.ci] auto fixes from pre-commit.com hooks 2025-03-28 17:20:39 +00:00
scripts Refactor SAC policy and training loop to enhance discrete action support 2025-04-16 16:46:37 +02:00
templates [visualization] Ignore 2d or 3d data for now (#809) 2025-03-04 10:53:01 +01:00
__init__.py [pre-commit.ci] auto fixes from pre-commit.com hooks 2025-03-28 17:18:48 +00:00
__version__.py Add copyrights (#157) 2024-05-15 12:13:09 +02:00