lerobot

Commit Graph

Author	SHA1	Message	Date
Michel Aractingi	9eec7b8bb0	General fixes in code, removed delta action, fixed grasp penalty, added logic to put gripper reward in info	2025-04-16 16:46:37 +02:00
pre-commit-ci[bot]	7225bc74a3	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-16 16:46:37 +02:00
AdilZouitine	9b6e5a383f	Refactor complementary_info handling in ReplayBuffer	2025-04-16 16:46:37 +02:00
AdilZouitine	86466b025f	Handle gripper penalty	2025-04-16 16:46:37 +02:00
pre-commit-ci[bot]	82584cca78	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-16 16:46:37 +02:00
AdilZouitine	74c11c4a75	Enhance SAC configuration and replay buffer with asynchronous prefetching support - Added async_prefetch parameter to SACConfig for improved buffer management. - Implemented get_iterator method in ReplayBuffer to support asynchronous prefetching of batches. - Updated learner_server to utilize the new iterator for online and offline sampling, enhancing training efficiency.	2025-04-16 16:46:37 +02:00
Michel Aractingi	c621077b62	Added Gripper quantization wrapper and grasp penalty removed complementary info from buffer and learner server removed get_gripper_action function added gripper parameters to `common/envs/configs.py`	2025-04-16 16:46:37 +02:00
pre-commit-ci[bot]	f5cfd9fd48	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-04-16 16:46:37 +02:00
s1lent4gnt	cef944e1b1	Add complementary info in the replay buffer - Added complementary info in the add method - Added complementary info in the sample method	2025-04-16 16:46:37 +02:00
s1lent4gnt	66c3672738	Fix: Prevent Invalid next_state References When optimize_memory=True (#918 )	2025-03-31 09:43:40 +02:00
pre-commit-ci[bot]	c05e4835d0	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-28 17:20:39 +00:00
AdilZouitine	5fbbc65869	Add task field to frame_dict in ReplayBuffer and simplify save_episode calls - Introduced a new "task" field in frame_dict to meet the requirements of LeRobotDataset. - Removed task_name parameter from save_episode calls for consistency.	2025-03-28 17:18:48 +00:00
pre-commit-ci[bot]	7c05755823	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-28 17:18:48 +00:00
AdilZouitine	761a2dbcb3	Update tensor device assignment in ReplayBuffer class - Changed the device assignment for tensors in the ReplayBuffer class from `device` to `storage_device` for consistency and improved resource management.	2025-03-28 17:18:48 +00:00
Eugene Mironov	db78fee9de	[HIL-SERL] Migrate threading to multiprocessing (#759 ) Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>	2025-03-28 17:18:48 +00:00
pre-commit-ci[bot]	38f5fa4523	[pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci	2025-03-28 17:18:48 +00:00
AdilZouitine	24f93c755a	Add memory optimization option to ReplayBuffer - Introduce `optimize_memory` parameter to reduce memory usage in replay buffer - Implement simplified memory optimization by not storing duplicate next_states - Update learner server and buffer initialization to use memory optimization by default	2025-03-28 17:18:48 +00:00
AdilZouitine	7c366e3223	Refactor ReplayBuffer with tensor-based storage and improved sampling efficiency - Replaced list-based memory storage with pre-allocated tensor storage - Optimized sampling process with direct tensor indexing - Added support for DrQ image augmentation during sampling for offline dataset - Improved dataset conversion with more robust episode handling - Enhanced buffer initialization and state tracking - Added comprehensive testing for buffer conversion and sampling	2025-03-28 17:18:48 +00:00
AdilZouitine	2c799508d7	Update ManiSkill configuration and replay buffer to support truncation and dataset handling - Reduced image size in ManiSkill environment configuration from 128 to 64 - Added support for truncation in replay buffer and actor server - Updated SAC policy configuration to use a specific dataset and modify vision encoder settings - Improved dataset conversion process with progress tracking and task naming - Added flexibility for joint action space masking in learner server	2025-03-28 17:18:48 +00:00
Eugene Mironov	d48161da1b	[Port HIL-SERL] Adjust Actor-Learner architecture & clean up dependency management for HIL-SERL (#722 )	2025-03-28 17:18:48 +00:00
Michel Aractingi	291358d6a2	Fixed bug in the action scale of the intervention actions and offline dataset actions. (scale by inverse delta) Co-authored-by: Adil Zouitine <adizouitinegm@gmail.com>	2025-03-28 17:18:24 +00:00
Michel Aractingi	2aca830a09	Modified crop_dataset_roi interface to automatically write the cropped parameters to a json file in the meta of the dataset Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-03-28 17:18:24 +00:00
Michel Aractingi	2f34d84298	Optimized the replay buffer from the memory side to store data on cpu instead of a gpu device and send the batches to the gpu. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-03-28 17:18:24 +00:00
Michel Aractingi	eb7e28d9d9	Hardcoded some normalization parameters. TODO refactor Added masking actions on the level of the intervention actions and offline dataset Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-03-28 17:18:24 +00:00
Michel Aractingi	c623824139	- Added JointMaskingActionSpace wrapper in `gym_manipulator` in order to select which joints will be controlled. For example, we can disable the gripper actions for some tasks. - Added Nan detection mechanisms in the actor, learner and gym_manipulator for the case where we encounter nans in the loop. - changed the non-blocking in the `.to(device)` functions to only work for the case of cuda because they were causing nans when running the policy on mps - Added some joint clipping and limits in the env, robot and policy configs. TODO clean this part and make the limits in one config file only. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-03-28 17:18:24 +00:00
Michel Aractingi	f4f5b26a21	Several fixes to move the actor_server and learner_server code from the maniskill environment to the real robot environment. Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-03-28 17:18:24 +00:00
Michel Aractingi	163bcbcad4	fixed bug in crop_dataset_roi.py added missing buffer.pt in server dir Co-authored-by: Adil Zouitine <adilzouitinegm@gmail.com>	2025-03-28 17:18:24 +00:00

27 Commits