lerobot

Commit Graph

Author	SHA1	Message	Date
pre-commit-ci[bot]	eb44a06a9b	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:10:22 +02:00
Michel Aractingi	8eb3c1510c	Added support for controlling the gripper with the pygame interface of gamepad Minor modifications in gym_manipulator to quantize the gripper actions clamped the observations after F.resize in ConvertToLeRobotObservation wrapper due to a bug in F.resize, images were returned exceeding the maximum value of 1.0	2025-04-18 15:10:22 +02:00
AdilZouitine	4d5ecb082e	Refactor SACPolicy for improved type annotations and readability - Enhanced type annotations for variables in the `SACPolicy` class to improve code clarity. - Updated method calls to use keyword arguments for better readability. - Streamlined the extraction of batch components, ensuring consistent typing across the class methods.	2025-04-18 15:10:22 +02:00
AdilZouitine	6e687e2910	Refactor SACPolicy and learner_server for improved clarity and functionality - Updated the `forward` method in `SACPolicy` to handle loss computation for actor, critic, and temperature models. - Replaced direct calls to `compute_loss_*` methods with a unified `forward` method in `learner_server`. - Enhanced batch processing by consolidating input parameters into a single dictionary for better readability and maintainability. - Removed redundant code and improved documentation for clarity.	2025-04-18 15:10:22 +02:00
AdilZouitine	eb710647bf	Refactor actor_server.py for improved structure and logging - Consolidated logging initialization and enhanced logging for actor processes. - Streamlined the handling of gRPC connections and process management. - Improved readability by organizing core algorithm functions and communication functions. - Added detailed comments and documentation for clarity. - Ensured proper queue management and shutdown handling for actor processes.	2025-04-18 15:10:22 +02:00
AdilZouitine	176557d770	Refactor learner_server.py for improved structure and clarity - Removed unused imports and streamlined the code structure. - Consolidated logging initialization and enhanced logging for training processes. - Improved handling of training state loading and resume logic. - Refactored transition and interaction message processing for better readability and maintainability. - Added detailed comments and documentation for clarity.	2025-04-18 15:10:22 +02:00
AdilZouitine	3beab33fac	Refactor imports in modeling_sac.py for improved organization - Rearranged import statements for better readability. - Removed unused imports and streamlined the code structure.	2025-04-18 15:10:22 +02:00
AdilZouitine	c0ba4b4954	Refactor SACConfig properties for improved readability - Simplified the `image_features` property to directly iterate over `input_features`. - Removed unused imports and unnecessary code related to main execution, enhancing clarity and maintainability.	2025-04-18 15:10:22 +02:00
AdilZouitine	8fb373aeb2	fix	2025-04-18 15:10:22 +02:00
AdilZouitine	5a0ee06651	Enhance logging for actor and learner servers - Implemented process-specific logging for actor and learner servers to improve traceability. - Created a dedicated logs directory and ensured it exists before logging. - Initialized logging with explicit log files for each process, including actor transitions, interactions, and policy. - Updated the actor CLI to validate configuration and set up logging accordingly.	2025-04-18 15:10:22 +02:00
Michel Aractingi	05a237ce10	Added gripper control mechanism to gym_manipulator Moved HilSerl env config to configs/env/configs.py fixes in actor_server and modeling_sac and configuration_sac added the possibility of ignoring missing keys in env_cfg in get_features_from_env_config function	2025-04-18 15:10:22 +02:00
AdilZouitine	88cc2b8fc8	Add WrapperConfig for environment wrappers and update SACConfig properties - Introduced `WrapperConfig` dataclass for environment wrapper configurations. - Updated `ManiskillEnvConfig` to include a `wrapper` field for enhanced environment management. - Modified `SACConfig` to return `None` for `observation_delta_indices` and `action_delta_indices` properties. - Refactored `make_robot_env` function to improve readability and maintainability.	2025-04-18 15:10:22 +02:00
Michel Aractingi	b69132c79d	Change HILSerlRobotEnvConfig to inherit from EnvConfig Added support for hil_serl classifier to be trained with train.py run classifier training by python lerobot/scripts/train.py --policy.type=hilserl_classifier fixes in find_joint_limits, control_robot, end_effector_control_utils	2025-04-18 15:10:21 +02:00
AdilZouitine	db897a1619	[WIP] Update SAC configuration and environment settings - Reduced frame rate in `ManiskillEnvConfig` from 400 to 200. - Enhanced `SACConfig` with new dataclasses for actor, learner, and network configurations. - Improved input and output feature management in `SACConfig`. - Refactored `actor_server` and `learner_server` to access configuration properties directly. - Updated training pipeline to validate configurations and handle dataset repo IDs more robustly.	2025-04-18 15:09:46 +02:00
AdilZouitine	0b5b62c8fb	Add wandb run id in config	2025-04-18 15:09:46 +02:00
AdilZouitine	056f79d358	[WIP] Non functional yet Add ManiSkill environment configuration and wrappers - Introduced `VideoRecordConfig` for video recording settings. - Added `ManiskillEnvConfig` to encapsulate environment-specific configurations. - Implemented various wrappers for the ManiSkill environment, including observation and action scaling. - Enhanced the `make_maniskill` function to create a wrapped ManiSkill environment with video recording and observation processing. - Updated the `actor_server` and `learner_server` to utilize the new configuration structure. - Refactored the training pipeline to accommodate the new environment and policy configurations.	2025-04-18 15:09:46 +02:00
Michel Aractingi	114ec644d0	Change config logic in: - gym_manipulator - find_joint_limits - end_effector_utils	2025-04-18 15:09:45 +02:00
AdilZouitine	26ee8b6ae5	Add .devcontainer to .gitignore for improved development environment management	2025-04-18 15:09:27 +02:00
AdilZouitine	38e8864284	Add task field to frame_dict in ReplayBuffer and simplify save_episode calls - Introduced a new "task" field in frame_dict to meet the requirements of LeRobotDataset. - Removed task_name parameter from save_episode calls for consistency.	2025-04-18 15:09:27 +02:00
AdilZouitine	80d566eb56	Handle new config with sac	2025-04-18 15:09:27 +02:00
AdilZouitine	bb5a95889f	Handle multi optimizers	2025-04-18 15:09:27 +02:00
pre-commit-ci[bot]	0ea27704f6	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:09:25 +02:00
Michel Aractingi	2abbd60a0d	Removed depleted files and scripts	2025-04-18 15:07:48 +02:00
pre-commit-ci[bot]	1c8daf11fd	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:07:46 +02:00
AdilZouitine	cdcf346061	Update tensor device assignment in ReplayBuffer class - Changed the device assignment for tensors in the ReplayBuffer class from `device` to `storage_device` for consistency and improved resource management.	2025-04-18 15:06:52 +02:00
pre-commit-ci[bot]	42f95e827d	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:06:52 +02:00
AdilZouitine	618ed00d45	Initialize log_alpha with the logarithm of temperature_init in SACPolicy - Updated the SACPolicy class to set log_alpha using the logarithm of the initial temperature value from the configuration.	2025-04-18 15:06:52 +02:00
pre-commit-ci[bot]	50d8db481e	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:06:52 +02:00
AdilZouitine	e4a5971ffd	Remove unused functions and imports from modeling_sac.py - Deleted the `find_and_copy_params` function and the `Ensemble` class, as they were deemed unnecessary. - Cleaned up imports by removing `from_modules` from `tensordict` to enhance code clarity. - Simplified the assertion in the `Policy` class for better readability.	2025-04-18 15:06:52 +02:00
AdilZouitine	36f9ccd851	Add intervention rate tracking in act_with_policy function - Introduced counters for tracking intervention steps and total steps during training. - Calculated and logged the intervention rate at the end of each episode. - Reset intervention counters after each episode to ensure accurate tracking.	2025-04-18 15:06:52 +02:00
AdilZouitine	787aee0e60	- Updated the logging condition to use `log_freq` directly instead of accessing it through `cfg.training.log_freq` for improved readability and speed.	2025-04-18 15:06:52 +02:00
Eugene Mironov	0341a38fdd	[PORT HIL-SERL] Optimize training loop, extract config usage (#855 ) Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:06:52 +02:00
AdilZouitine	ffbed4a141	Enhance training information logging in learner server - Added tracking for replay buffer size and offline replay buffer size during training steps.	2025-04-18 15:06:52 +02:00
AdilZouitine	03fe0f054b	Update configuration files for improved performance and flexibility - Increased frame rate in `maniskill_example.yaml` from 20 to 400 for enhanced simulation speed. - Updated `sac_maniskill.yaml` to set `dataset_repo_id` to null and adjusted `grad_clip_norm` from 10.0 to 40.0. - Changed `storage_device` from "cpu" to "cuda" for better resource utilization. - Modified `save_freq` from 2000000 to 1000000 to optimize saving intervals. - Enhanced input normalization parameters for `observation.state` and `observation.image` in SAC policy. - Adjusted `num_critics` from 10 to 2 and `policy_parameters_push_frequency` from 1 to 4 for improved training dynamics. - Updated `learner_server.py` to utilize `offline_buffer_capacity` for replay buffer initialization. - Changed action multiplier in `maniskill_manipulator.py` from 1 to 0.03 for finer control over actions.	2025-04-18 15:06:52 +02:00
pre-commit-ci[bot]	fd74c194b6	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:06:52 +02:00
AdilZouitine	0959694bab	Refactor SACPolicy and learner server for improved replay buffer management - Updated SACPolicy to create critic heads using a list comprehension for better readability. - Simplified the saving and loading of models using `save_model` and `load_model` functions from the safetensors library. - Introduced `initialize_offline_replay_buffer` function in the learner server to streamline offline dataset handling and replay buffer initialization. - Enhanced logging for dataset loading processes to improve traceability during training.	2025-04-18 15:06:52 +02:00
Michel Aractingi	7b01e16439	Add end effector action space to hil-serl (#861 ) Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-04-18 15:06:52 +02:00
AdilZouitine	66816fd871	Enhance SAC configuration and policy with gradient clipping and temperature management - Introduced `grad_clip_norm` parameter in SAC configuration for gradient clipping - Updated SACPolicy to store temperature as an instance variable for consistent usage - Modified loss calculations in SACPolicy to utilize the instance temperature - Enhanced MLP and CriticHead to support a customizable final activation function - Implemented gradient clipping in the learner server during training steps for both actor and critic - Added tracking for gradient norms in training information	2025-04-18 15:06:52 +02:00
pre-commit-ci[bot]	599326508f	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:06:52 +02:00
AdilZouitine	2f04d0d2b9	Add custom save and load methods for SAC policy - Implement `_save_pretrained` method to handle TensorDict state saving - Add `_from_pretrained` class method for loading SAC policy from files - Create utility function `find_and_copy_params` to handle parameter copying	2025-04-18 15:06:52 +02:00
AdilZouitine	e002c5ec56	Remove torch.no_grad decorator and optimize next action prediction in SAC policy - Removed `@torch.no_grad` decorator from Unnormalize forward method - Added TODO comment for optimizing next action prediction in SAC policy - Minor formatting adjustment in NaN assertion for log standard deviation Co-authored-by: Yoel Chornton <yoel.chornton@gmail.com>	2025-04-18 15:06:52 +02:00
s1lent4gnt	3dfb37e976	[Port HIL-SERL] Balanced sampler function speed up and refactor to align with train.py (#715 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-04-18 15:06:52 +02:00
Eugene Mironov	b6a2200983	[HIL-SERL] Migrate threading to multiprocessing (#759 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-04-18 15:06:52 +02:00
pre-commit-ci[bot]	85fe8a3f4e	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-18 15:06:51 +02:00
AdilZouitine	bb69cb3c8c	Add storage device configuration for SAC policy and replay buffer - Introduce `storage_device` parameter in SAC configuration and training settings - Update learner server to use configurable storage device for replay buffer - Reduce online buffer capacity in ManiSkill configuration - Modify replay buffer initialization to support custom storage device	2025-04-18 15:04:58 +02:00
AdilZouitine	ae51c19b3c	Add memory optimization option to ReplayBuffer - Introduce `optimize_memory` parameter to reduce memory usage in replay buffer - Implement simplified memory optimization by not storing duplicate next_states - Update learner server and buffer initialization to use memory optimization by default	2025-04-18 15:04:58 +02:00
AdilZouitine	9ea79f8a76	Add storage device parameter to replay buffer initialization - Specify storage device for replay buffer to optimize memory management	2025-04-18 15:04:58 +02:00
AdilZouitine	1d4ec50a58	Refactor ReplayBuffer with tensor-based storage and improved sampling efficiency - Replaced list-based memory storage with pre-allocated tensor storage - Optimized sampling process with direct tensor indexing - Added support for DrQ image augmentation during sampling for offline dataset - Improved dataset conversion with more robust episode handling - Enhanced buffer initialization and state tracking - Added comprehensive testing for buffer conversion and sampling	2025-04-18 15:04:58 +02:00
AdilZouitine	4c73891575	Update ManiSkill configuration and replay buffer to support truncation and dataset handling - Reduced image size in ManiSkill environment configuration from 128 to 64 - Added support for truncation in replay buffer and actor server - Updated SAC policy configuration to use a specific dataset and modify vision encoder settings - Improved dataset conversion process with progress tracking and task naming - Added flexibility for joint action space masking in learner server	2025-04-18 15:04:58 +02:00
Michel Aractingi	d3b84ecd6f	Added caching function in the learner_server and modeling sac in order to limit the number of forward passes through the pretrained encoder when its frozen. Added tensordict dependencies Updated the version of torch and torchvision Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-04-18 15:04:58 +02:00

1 2 3 4 5 ...

904 Commits All Branches Search

904 Commits

All Branches