lerobot/tests/test_datasets.py

import pytest
import torch

from lerobot.common.utils import init_hydra_config
import logging
from lerobot.common.datasets.factory import make_dataset

from .utils import DEVICE, DEFAULT_CONFIG_PATH


@pytest.mark.parametrize(
    "env_name,dataset_id,policy_name",
    [
        ("xarm", "xarm_lift_medium", "tdmpc"),
        ("pusht", "pusht", "diffusion"),
        ("aloha", "aloha_sim_insertion_human", "act"),
        ("aloha", "aloha_sim_insertion_scripted", "act"),
        ("aloha", "aloha_sim_transfer_cube_human", "act"),
        ("aloha", "aloha_sim_transfer_cube_scripted", "act"),
    ],
)
def test_factory(env_name, dataset_id, policy_name):
    cfg = init_hydra_config(
        DEFAULT_CONFIG_PATH,
        overrides=[f"env={env_name}", f"dataset_id={dataset_id}", f"policy={policy_name}", f"device={DEVICE}"]
    )
    dataset = make_dataset(cfg)
    delta_timestamps = dataset.delta_timestamps
    image_keys = dataset.image_keys

    item = dataset[0]

    keys_ndim_required = [
        ("action", 1, True),
        ("episode", 0, True),
        ("frame_id", 0, True),
        ("timestamp", 0, True),
        # TODO(rcadene): should we rename it agent_pos?
        ("observation.state", 1, True),
        ("next.reward", 0, False),
        ("next.done", 0, False),
    ]

    for key in image_keys:
        keys_ndim_required.append(
            (key, 3, True),
        )
    
    # test number of dimensions
    for key, ndim, required in keys_ndim_required:
        if key not in item:
            if required:
                assert key in item, f"{key}"
            else:
                logging.warning(f'Missing key in dataset: "{key}" not in {dataset}.')
                continue
        
        if delta_timestamps is not None and key in delta_timestamps:
            assert item[key].ndim == ndim + 1, f"{key}"
            assert item[key].shape[0] == len(delta_timestamps[key]), f"{key}"
        else:
            assert item[key].ndim == ndim, f"{key}"
        
        if key in image_keys:
            assert item[key].dtype == torch.float32, f"{key}"
            # TODO(rcadene): we assume for now that image normalization takes place in the model
            assert item[key].max() <= 1.0, f"{key}"
            assert item[key].min() >= 0.0, f"{key}"

            if delta_timestamps is not None and key in delta_timestamps:
                # test t,c,h,w
                assert item[key].shape[1] == 3, f"{key}"
            else:
                # test c,h,w 
                assert item[key].shape[0] == 3, f"{key}"


    if delta_timestamps is not None:
        # test missing keys in delta_timestamps
        for key in delta_timestamps:
            assert key in item, f"{key}"


# def test_compute_stats():
#     """Check that the statistics are computed correctly according to the stats_patterns property.

#     We compare with taking a straight min, mean, max, std of all the data in one pass (which we can do
#     because we are working with a small dataset).
#     """
#     cfg = init_hydra_config(
#         DEFAULT_CONFIG_PATH, overrides=["env=aloha", "env.task=sim_transfer_cube_human"]
#     )
#     dataset = make_dataset(cfg)
#     # Get all of the data.
#     all_data = dataset.data_dict
#     # Note: we set the batch size to be smaller than the whole dataset to make sure we are testing batched
#     # computation of the statistics. While doing this, we also make sure it works when we don't divide the
#     # dataset into even batches. 
#     computed_stats = buffer._compute_stats(batch_size=int(len(all_data) * 0.75))
#     for k, pattern in buffer.stats_patterns.items():
#         expected_mean = einops.reduce(all_data[k], pattern, "mean")
#         assert torch.allclose(computed_stats[k]["mean"], expected_mean)
#         assert torch.allclose(
#             computed_stats[k]["std"],
#             torch.sqrt(einops.reduce((all_data[k] - expected_mean) ** 2, pattern, "mean"))
#         )
#         assert torch.allclose(computed_stats[k]["min"], einops.reduce(all_data[k], pattern, "min"))
#         assert torch.allclose(computed_stats[k]["max"], einops.reduce(all_data[k], pattern, "max"))
Refactor configs to have env in seperate yaml + Fix training 2024-02-26 01:42:47 +08:00			`import pytest`
Add Aloha env and ACT policy WIP Aloha env tests pass Rendering works (fps look fast tho? TODO action bounding is too wide [-1,1]) Update README Copy past from act repo Remove download.py add a WIP for Simxarm Remove download.py add a WIP for Simxarm Add act yaml (TODO: try train.py) Training can runs (TODO: eval) Add tasks without end_effector that are compatible with dataset, Eval can run (TODO: training and pretrained model) Add AbstractEnv, Refactor AlohaEnv, Add rendering_hook in env, Minor modifications, (TODO: Refactor Pusht and Simxarm) poetry lock fix bug in compute_stats for action normalization fix more bugs in normalization fix training fix import PushtEnv inheriates AbstractEnv, Improve factory Normalization Add _make_env to EnvAbstract Add call_rendering_hooks to pusht env SimxarmEnv inherites from AbstractEnv (NOT TESTED) Add aloha tests artifacts + update pusht stats fix image normalization: before env was in [0,1] but dataset in [0,255], and now both in [0,255] Small fix on simxarm Add next to obs Add top camera to Aloha env (TODO: make it compatible with set of cameras) Add top camera to Aloha env (TODO: make it compatible with set of cameras) 2024-03-08 17:47:39 +08:00			`import torch`
Refactor configs to have env in seperate yaml + Fix training 2024-02-26 01:42:47 +08:00
revision 2024-03-28 02:33:48 +08:00			`from lerobot.common.utils import init_hydra_config`
WIP WIP WIP train.py works, loss going down WIP eval.py Fix WIP (eval running, TODO: verify results reproduced) Eval works! (testing reproducibility) WIP pretrained model pusht reproduces same results as torchrl pretrained model pusht reproduces same results as torchrl Remove AbstractPolicy, Move all queues in select_action WIP test_datasets passed (TODO: re-enable NormalizeTransform) 2024-03-31 23:05:25 +08:00			`import logging`
			`from lerobot.common.datasets.factory import make_dataset`
Refactor configs to have env in seperate yaml + Fix training 2024-02-26 01:42:47 +08:00
revision 2024-03-28 02:33:48 +08:00			`from .utils import DEVICE, DEFAULT_CONFIG_PATH`
Refactor configs to have env in seperate yaml + Fix training 2024-02-26 01:42:47 +08:00

			`@pytest.mark.parametrize(`
test_datasets.py are passing! 2024-04-08 22:02:03 +08:00			`"env_name,dataset_id,policy_name",`
Refactor configs to have env in seperate yaml + Fix training 2024-02-26 01:42:47 +08:00			`[`
Add gym-aloha, rename simxarm -> xarm, refactor 2024-04-08 22:18:53 +08:00			`("xarm", "xarm_lift_medium", "tdmpc"),`
test_datasets.py are passing! 2024-04-08 22:02:03 +08:00			`("pusht", "pusht", "diffusion"),`
			`("aloha", "aloha_sim_insertion_human", "act"),`
			`("aloha", "aloha_sim_insertion_scripted", "act"),`
			`("aloha", "aloha_sim_transfer_cube_human", "act"),`
			`("aloha", "aloha_sim_transfer_cube_scripted", "act"),`
Refactor configs to have env in seperate yaml + Fix training 2024-02-26 01:42:47 +08:00			`],`
			`)`
test_datasets.py are passing! 2024-04-08 22:02:03 +08:00			`def test_factory(env_name, dataset_id, policy_name):`
revision 2024-03-28 02:33:48 +08:00			`cfg = init_hydra_config(`
			`DEFAULT_CONFIG_PATH,`
test_datasets.py are passing! 2024-04-08 22:02:03 +08:00			`overrides=[f"env={env_name}", f"dataset_id={dataset_id}", f"policy={policy_name}", f"device={DEVICE}"]`
revision 2024-03-28 02:33:48 +08:00			`)`
WIP WIP WIP train.py works, loss going down WIP eval.py Fix WIP (eval running, TODO: verify results reproduced) Eval works! (testing reproducibility) WIP pretrained model pusht reproduces same results as torchrl pretrained model pusht reproduces same results as torchrl Remove AbstractPolicy, Move all queues in select_action WIP test_datasets passed (TODO: re-enable NormalizeTransform) 2024-03-31 23:05:25 +08:00			`dataset = make_dataset(cfg)`
test_datasets.py are passing! 2024-04-08 22:02:03 +08:00			`delta_timestamps = dataset.delta_timestamps`
			`image_keys = dataset.image_keys`
WIP WIP WIP train.py works, loss going down WIP eval.py Fix WIP (eval running, TODO: verify results reproduced) Eval works! (testing reproducibility) WIP pretrained model pusht reproduces same results as torchrl pretrained model pusht reproduces same results as torchrl Remove AbstractPolicy, Move all queues in select_action WIP test_datasets passed (TODO: re-enable NormalizeTransform) 2024-03-31 23:05:25 +08:00
			`item = dataset[0]`

test_datasets.py are passing! 2024-04-08 22:02:03 +08:00			`keys_ndim_required = [`
			`("action", 1, True),`
			`("episode", 0, True),`
			`("frame_id", 0, True),`
			`("timestamp", 0, True),`
			`# TODO(rcadene): should we rename it agent_pos?`
			`("observation.state", 1, True),`
			`("next.reward", 0, False),`
			`("next.done", 0, False),`
			`]`

			`for key in image_keys:`
			`keys_ndim_required.append(`
			`(key, 3, True),`
			`)`

			`# test number of dimensions`
			`for key, ndim, required in keys_ndim_required:`
			`if key not in item:`
			`if required:`
			`assert key in item, f"{key}"`
			`else:`
			`logging.warning(f'Missing key in dataset: "{key}" not in {dataset}.')`
			`continue`

			`if delta_timestamps is not None and key in delta_timestamps:`
			`assert item[key].ndim == ndim + 1, f"{key}"`
			`assert item[key].shape[0] == len(delta_timestamps[key]), f"{key}"`
			`else:`
			`assert item[key].ndim == ndim, f"{key}"`

			`if key in image_keys:`
			`assert item[key].dtype == torch.float32, f"{key}"`
			`# TODO(rcadene): we assume for now that image normalization takes place in the model`
			`assert item[key].max() <= 1.0, f"{key}"`
			`assert item[key].min() >= 0.0, f"{key}"`

			`if delta_timestamps is not None and key in delta_timestamps:`
			`# test t,c,h,w`
			`assert item[key].shape[1] == 3, f"{key}"`
			`else:`
			`# test c,h,w`
			`assert item[key].shape[0] == 3, f"{key}"`


			`if delta_timestamps is not None:`
			`# test missing keys in delta_timestamps`
			`for key in delta_timestamps:`
			`assert key in item, f"{key}"`
WIP WIP WIP train.py works, loss going down WIP eval.py Fix WIP (eval running, TODO: verify results reproduced) Eval works! (testing reproducibility) WIP pretrained model pusht reproduces same results as torchrl pretrained model pusht reproduces same results as torchrl Remove AbstractPolicy, Move all queues in select_action WIP test_datasets passed (TODO: re-enable NormalizeTransform) 2024-03-31 23:05:25 +08:00
fix stats computation 2024-04-02 23:40:33 +08:00
WIP stats (TODO: run tests on stats + cmpute them) 2024-04-05 00:36:03 +08:00			`# def test_compute_stats():`
			`# """Check that the statistics are computed correctly according to the stats_patterns property.`
fix stats computation 2024-04-02 23:40:33 +08:00
WIP stats (TODO: run tests on stats + cmpute them) 2024-04-05 00:36:03 +08:00			`# We compare with taking a straight min, mean, max, std of all the data in one pass (which we can do`
			`# because we are working with a small dataset).`
			`# """`
			`# cfg = init_hydra_config(`
			`# DEFAULT_CONFIG_PATH, overrides=["env=aloha", "env.task=sim_transfer_cube_human"]`
			`# )`
			`# dataset = make_dataset(cfg)`
			`# # Get all of the data.`
fix train.py, stats, eval.py (training is running) 2024-04-05 17:31:39 +08:00			`# all_data = dataset.data_dict`
WIP stats (TODO: run tests on stats + cmpute them) 2024-04-05 00:36:03 +08:00			`# # Note: we set the batch size to be smaller than the whole dataset to make sure we are testing batched`
			`# # computation of the statistics. While doing this, we also make sure it works when we don't divide the`
			`# # dataset into even batches.`
			`# computed_stats = buffer._compute_stats(batch_size=int(len(all_data) * 0.75))`
			`# for k, pattern in buffer.stats_patterns.items():`
			`# expected_mean = einops.reduce(all_data[k], pattern, "mean")`
			`# assert torch.allclose(computed_stats[k]["mean"], expected_mean)`
			`# assert torch.allclose(`
			`# computed_stats[k]["std"],`
			`# torch.sqrt(einops.reduce((all_data[k] - expected_mean) ** 2, pattern, "mean"))`
			`# )`
			`# assert torch.allclose(computed_stats[k]["min"], einops.reduce(all_data[k], pattern, "min"))`
			`# assert torch.allclose(computed_stats[k]["max"], einops.reduce(all_data[k], pattern, "max"))`