lerobot/lerobot/common/policies
KeWang1017 ecb91b37eb Refactor SACPolicy for improved action sampling and standard deviation handling
- Updated action selection to use distribution sampling and log probabilities for better stochastic behavior.
- Enhanced standard deviation clamping to prevent extreme values, ensuring stability in policy outputs.
- Cleaned up code by removing unnecessary comments and improving readability.

These changes aim to refine the SAC implementation, enhancing its robustness and performance during training and inference.
2025-03-24 13:24:23 +01:00
..
act User/pepijn/2025 03 17 act different image shapes (#870) 2025-03-18 11:09:05 +01:00
diffusion Add typos checks (#770) 2025-02-25 23:51:15 +01:00
hilserl Added normalization schemes and style checks 2025-03-24 13:24:23 +01:00
pi0 refactor(config): Move device & amp args to PreTrainedConfig (#812) 2025-03-06 17:59:28 +01:00
sac Refactor SACPolicy for improved action sampling and standard deviation handling 2025-03-24 13:24:23 +01:00
tdmpc Add typos checks (#770) 2025-02-25 23:51:15 +01:00
vqbet Add typos checks (#770) 2025-02-25 23:51:15 +01:00
__init__.py chore(doc): add license header to all files (#818) 2025-03-05 17:56:51 +01:00
factory.py added optimizer and sac to factory.py 2025-03-24 13:23:53 +01:00
normalize.py LeRobotDataset v2.1 (#711) 2025-02-25 15:27:29 +01:00
pretrained.py refactor(config): Move device & amp args to PreTrainedConfig (#812) 2025-03-06 17:59:28 +01:00
utils.py Simplify configs (#550) 2025-01-31 13:57:37 +01:00