lerobot

History

KeWang1017 ecb91b37eb Refactor SACPolicy for improved action sampling and standard deviation handling - Updated action selection to use distribution sampling and log probabilities for better stochastic behavior. - Enhanced standard deviation clamping to prevent extreme values, ensuring stability in policy outputs. - Cleaned up code by removing unnecessary comments and improving readability. These changes aim to refine the SAC implementation, enhancing its robustness and performance during training and inference.		2025-03-24 13:24:23 +01:00
..
act	User/pepijn/2025 03 17 act different image shapes (#870 )	2025-03-18 11:09:05 +01:00
diffusion	Add typos checks (#770 )	2025-02-25 23:51:15 +01:00
hilserl	Added normalization schemes and style checks	2025-03-24 13:24:23 +01:00
pi0	refactor(config): Move device & amp args to PreTrainedConfig (#812 )	2025-03-06 17:59:28 +01:00
sac	Refactor SACPolicy for improved action sampling and standard deviation handling	2025-03-24 13:24:23 +01:00
tdmpc	Add typos checks (#770 )	2025-02-25 23:51:15 +01:00
vqbet	Add typos checks (#770 )	2025-02-25 23:51:15 +01:00
__init__.py	chore(doc): add license header to all files (#818 )	2025-03-05 17:56:51 +01:00
factory.py	added optimizer and sac to factory.py	2025-03-24 13:23:53 +01:00
normalize.py	LeRobotDataset v2.1 (#711 )	2025-02-25 15:27:29 +01:00
pretrained.py	refactor(config): Move device & amp args to PreTrainedConfig (#812 )	2025-03-06 17:59:28 +01:00
utils.py	Simplify configs (#550 )	2025-01-31 13:57:37 +01:00