lerobot

Commit Graph

Author	SHA1	Message	Date
AdilZouitine	699d374d89	Refactor SACPolicy for improved readability and action dimension handling - Cleaned up code formatting for better readability, including consistent spacing and removal of unnecessary blank lines. - Consolidated continuous action dimension calculation to enhance clarity and maintainability. - Simplified loss return statements in the forward method to improve code structure. - Ensured grasp critic parameters are included conditionally based on configuration settings.	2025-04-01 15:43:29 +00:00
AdilZouitine	451a7b01db	Add mock gripper support and enhance SAC policy action handling - Introduced mock_gripper parameter in ManiskillEnvConfig to enable gripper simulation. - Added ManiskillMockGripperWrapper to adjust action space for environments with discrete actions. - Updated SACPolicy to compute continuous action dimensions correctly, ensuring compatibility with the new gripper setup. - Refactored action handling in the training loop to accommodate the changes in action dimensions.	2025-04-01 14:22:08 +00:00
AdilZouitine	306c735172	Refactor SAC policy and training loop to enhance discrete action support - Updated SACPolicy to conditionally compute losses for grasp critic based on num_discrete_actions. - Simplified forward method to return loss outputs as a dictionary for better clarity. - Adjusted learner_server to handle both main and grasp critic losses during training. - Ensured optimizers are created conditionally for grasp critic based on configuration settings.	2025-04-01 11:42:28 +00:00
AdilZouitine	6a215f47dd	Refactor SAC configuration and policy to support discrete actions - Removed GraspCriticNetworkConfig class and integrated its parameters into SACConfig. - Added num_discrete_actions parameter to SACConfig for better action handling. - Updated SACPolicy to conditionally create grasp critic networks based on num_discrete_actions. - Enhanced grasp critic forward pass to handle discrete actions and compute losses accordingly.	2025-04-01 11:32:24 +02:00
Michel Aractingi	fe2ff516a8	Added Gripper quantization wrapper and grasp penalty removed complementary info from buffer and learner server removed get_gripper_action function added gripper parameters to `common/envs/configs.py`	2025-04-01 11:08:15 +02:00
pre-commit-ci[bot]	7983baf4fc	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-31 16:10:01 +00:00
s1lent4gnt	c774bbe522	Add grasp critic to the training loop - Integrated the grasp critic gradient update to the training loop in learner_server - Added Adam optimizer and configured grasp critic learning rate in configuration_sac - Added target critics networks update after the critics gradient step	2025-03-31 18:06:21 +02:00
s1lent4gnt	2c1e5fa28b	Add get_gripper_action method to GamepadController	2025-03-31 17:40:00 +02:00
s1lent4gnt	7452f9baaa	Add gripper penalty wrapper	2025-03-31 17:38:16 +02:00
s1lent4gnt	007fee9230	Add complementary info in the replay buffer - Added complementary info in the add method - Added complementary info in the sample method	2025-03-31 17:36:35 +02:00
s1lent4gnt	4a1c26d9ee	Add grasp critic - Implemented grasp critic to evaluate gripper actions - Added corresponding config parameters for tuning	2025-03-31 17:35:59 +02:00
pre-commit-ci[bot]	0f706ce543	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-31 13:59:32 +00:00
AdilZouitine	026ad463a9	Fix convergence of sac, multiple torch compile on the same model caused divergence	2025-03-31 13:54:21 +00:00
AdilZouitine	8494634d48	Fix cuda graph break	2025-03-31 07:59:56 +00:00
s1lent4gnt	66c3672738	Fix: Prevent Invalid next_state References When optimize_memory=True (#918 )	2025-03-31 09:43:40 +02:00
pre-commit-ci[bot]	c05e4835d0	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-28 17:20:39 +00:00
Michel Aractingi	808cf63221	Added support for controlling the gripper with the pygame interface of gamepad Minor modifications in gym_manipulator to quantize the gripper actions clamped the observations after F.resize in ConvertToLeRobotObservation wrapper due to a bug in F.resize, images were returned exceeding the maximum value of 1.0	2025-03-28 17:18:48 +00:00
AdilZouitine	0150139668	Refactor SACPolicy for improved type annotations and readability - Enhanced type annotations for variables in the `SACPolicy` class to improve code clarity. - Updated method calls to use keyword arguments for better readability. - Streamlined the extraction of batch components, ensuring consistent typing across the class methods.	2025-03-28 17:18:48 +00:00
AdilZouitine	b3ad63cf6e	Refactor SACPolicy and learner_server for improved clarity and functionality - Updated the `forward` method in `SACPolicy` to handle loss computation for actor, critic, and temperature models. - Replaced direct calls to `compute_loss_*` methods with a unified `forward` method in `learner_server`. - Enhanced batch processing by consolidating input parameters into a single dictionary for better readability and maintainability. - Removed redundant code and improved documentation for clarity.	2025-03-28 17:18:48 +00:00
AdilZouitine	8b02e81bb5	Refactor actor_server.py for improved structure and logging - Consolidated logging initialization and enhanced logging for actor processes. - Streamlined the handling of gRPC connections and process management. - Improved readability by organizing core algorithm functions and communication functions. - Added detailed comments and documentation for clarity. - Ensured proper queue management and shutdown handling for actor processes.	2025-03-28 17:18:48 +00:00
AdilZouitine	dcce446a66	Refactor learner_server.py for improved structure and clarity - Removed unused imports and streamlined the code structure. - Consolidated logging initialization and enhanced logging for training processes. - Improved handling of training state loading and resume logic. - Refactored transition and interaction message processing for better readability and maintainability. - Added detailed comments and documentation for clarity.	2025-03-28 17:18:48 +00:00
AdilZouitine	82a6b69e0e	Refactor imports in modeling_sac.py for improved organization - Rearranged import statements for better readability. - Removed unused imports and streamlined the code structure.	2025-03-28 17:18:48 +00:00
AdilZouitine	6f7024242a	Refactor SACConfig properties for improved readability - Simplified the `image_features` property to directly iterate over `input_features`. - Removed unused imports and unnecessary code related to main execution, enhancing clarity and maintainability.	2025-03-28 17:18:48 +00:00
AdilZouitine	3c56ad33c3	fix	2025-03-28 17:18:48 +00:00
AdilZouitine	49baa1ff49	Enhance logging for actor and learner servers - Implemented process-specific logging for actor and learner servers to improve traceability. - Created a dedicated logs directory and ensured it exists before logging. - Initialized logging with explicit log files for each process, including actor transitions, interactions, and policy. - Updated the actor CLI to validate configuration and set up logging accordingly.	2025-03-28 17:18:48 +00:00
Michel Aractingi	02b9ea9446	Added gripper control mechanism to gym_manipulator Moved HilSerl env config to configs/env/configs.py fixes in actor_server and modeling_sac and configuration_sac added the possibility of ignoring missing keys in env_cfg in get_features_from_env_config function	2025-03-28 17:18:48 +00:00
AdilZouitine	79e0f6e06c	Add WrapperConfig for environment wrappers and update SACConfig properties - Introduced `WrapperConfig` dataclass for environment wrapper configurations. - Updated `ManiskillEnvConfig` to include a `wrapper` field for enhanced environment management. - Modified `SACConfig` to return `None` for `observation_delta_indices` and `action_delta_indices` properties. - Refactored `make_robot_env` function to improve readability and maintainability.	2025-03-28 17:18:48 +00:00
Michel Aractingi	d0b7690bc0	Change HILSerlRobotEnvConfig to inherit from EnvConfig Added support for hil_serl classifier to be trained with train.py run classifier training by python lerobot/scripts/train.py --policy.type=hilserl_classifier fixes in find_joint_limits, control_robot, end_effector_control_utils	2025-03-28 17:18:48 +00:00
AdilZouitine	052a4acfc2	[WIP] Update SAC configuration and environment settings - Reduced frame rate in `ManiskillEnvConfig` from 400 to 200. - Enhanced `SACConfig` with new dataclasses for actor, learner, and network configurations. - Improved input and output feature management in `SACConfig`. - Refactored `actor_server` and `learner_server` to access configuration properties directly. - Updated training pipeline to validate configurations and handle dataset repo IDs more robustly.	2025-03-28 17:18:48 +00:00
AdilZouitine	626e5dd35c	Add wandb run id in config	2025-03-28 17:18:48 +00:00
AdilZouitine	dd37bd412e	[WIP] Non functional yet Add ManiSkill environment configuration and wrappers - Introduced `VideoRecordConfig` for video recording settings. - Added `ManiskillEnvConfig` to encapsulate environment-specific configurations. - Implemented various wrappers for the ManiSkill environment, including observation and action scaling. - Enhanced the `make_maniskill` function to create a wrapped ManiSkill environment with video recording and observation processing. - Updated the `actor_server` and `learner_server` to utilize the new configuration structure. - Refactored the training pipeline to accommodate the new environment and policy configurations.	2025-03-28 17:18:48 +00:00
Michel Aractingi	b7b6d8102f	Change config logic in: - gym_manipulator - find_joint_limits - end_effector_utils	2025-03-28 17:18:48 +00:00
AdilZouitine	ee25fd8afe	Add .devcontainer to .gitignore for improved development environment management	2025-03-28 17:18:48 +00:00
AdilZouitine	5fbbc65869	Add task field to frame_dict in ReplayBuffer and simplify save_episode calls - Introduced a new "task" field in frame_dict to meet the requirements of LeRobotDataset. - Removed task_name parameter from save_episode calls for consistency.	2025-03-28 17:18:48 +00:00
AdilZouitine	f483931fc0	Handle new config with sac	2025-03-28 17:18:48 +00:00
AdilZouitine	b2025b852c	Handle multi optimizers	2025-03-28 17:18:48 +00:00
pre-commit-ci[bot]	7c05755823	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-28 17:18:48 +00:00
Michel Aractingi	2945bbb221	Removed depleted files and scripts	2025-03-28 17:18:48 +00:00
pre-commit-ci[bot]	8e6d5f504c	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-28 17:18:48 +00:00
AdilZouitine	761a2dbcb3	Update tensor device assignment in ReplayBuffer class - Changed the device assignment for tensors in the ReplayBuffer class from `device` to `storage_device` for consistency and improved resource management.	2025-03-28 17:18:48 +00:00
pre-commit-ci[bot]	81952b2092	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-28 17:18:48 +00:00
AdilZouitine	0eef49a0f6	Initialize log_alpha with the logarithm of temperature_init in SACPolicy - Updated the SACPolicy class to set log_alpha using the logarithm of the initial temperature value from the configuration.	2025-03-28 17:18:48 +00:00
pre-commit-ci[bot]	2d5effeeba	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-28 17:18:48 +00:00
AdilZouitine	c5c921cd7c	Remove unused functions and imports from modeling_sac.py - Deleted the `find_and_copy_params` function and the `Ensemble` class, as they were deemed unnecessary. - Cleaned up imports by removing `from_modules` from `tensordict` to enhance code clarity. - Simplified the assertion in the `Policy` class for better readability.	2025-03-28 17:18:48 +00:00
AdilZouitine	80e766c05c	Add intervention rate tracking in act_with_policy function - Introduced counters for tracking intervention steps and total steps during training. - Calculated and logged the intervention rate at the end of each episode. - Reset intervention counters after each episode to ensure accurate tracking.	2025-03-28 17:18:48 +00:00
AdilZouitine	eb6787e159	- Updated the logging condition to use `log_freq` directly instead of accessing it through `cfg.training.log_freq` for improved readability and speed.	2025-03-28 17:18:48 +00:00
Eugene Mironov	659adfc743	[PORT HIL-SERL] Optimize training loop, extract config usage (#855 ) Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-03-28 17:18:48 +00:00
AdilZouitine	07cc0662da	Enhance training information logging in learner server - Added tracking for replay buffer size and offline replay buffer size during training steps.	2025-03-28 17:18:48 +00:00
AdilZouitine	a02195249f	Update configuration files for improved performance and flexibility - Increased frame rate in `maniskill_example.yaml` from 20 to 400 for enhanced simulation speed. - Updated `sac_maniskill.yaml` to set `dataset_repo_id` to null and adjusted `grad_clip_norm` from 10.0 to 40.0. - Changed `storage_device` from "cpu" to "cuda" for better resource utilization. - Modified `save_freq` from 2000000 to 1000000 to optimize saving intervals. - Enhanced input normalization parameters for `observation.state` and `observation.image` in SAC policy. - Adjusted `num_critics` from 10 to 2 and `policy_parameters_push_frequency` from 1 to 4 for improved training dynamics. - Updated `learner_server.py` to utilize `offline_buffer_capacity` for replay buffer initialization. - Changed action multiplier in `maniskill_manipulator.py` from 1 to 0.03 for finer control over actions.	2025-03-28 17:18:48 +00:00
pre-commit-ci[bot]	cb272294f5	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-28 17:18:48 +00:00

1 2 3 4 5 ...

905 Commits All Branches Search

905 Commits

All Branches