Dataset
NOTE: The joint efforts data in the robot state entry of the parquet files are wrong. This is because we do not store observations during our initial data collection, and all observations are collected through a round of "data replay" in which we restore sim state every step without stepping physics, and thus the joint effort reading is wrong. Please do not use them for training. They will be removed in the next dataset release.
Dataset Access
We host our dataset on Hugging Face:
Dataset URL: https://huggingface.co/datasets/behavior-1k/2025-challenge-demos
Rawdata URL: https://huggingface.co/datasets/behavior-1k/2025-challenge-rawdata
Data Format
For the 2025 NeurIPS challenge, we provide the following datasets:
-
2025-challenge-demos: 10000 human-collected teleoperation demos across 50 tasks. It follows the LeRobot format with some customizations for better data handling. The dataset has the following structure:
Folder Description annotations language annotations for each episode data low dim data, including proprioceptions, actions, privileged task info, etc. meta metadata folder containing episode-level information videos visual observations, including rgb, depth, seg_instance_id -
2025-challenge-rawdata: the original raw HDF5 data of the 10k teleoperation demos. These files contains everything needed to replay the exact trajectory in OmniGibson. We use this alongside with
OmniGibson/scripts/replay_obs.pyto replay the trajectory and collect additional visual observations.
Our demonstration data (2025-challenge-demos) is provided in LeRobot format, a widely-adopted format for robot learning datasets. LeRobot provides a unified interface for robot demonstration data, making it easy to load, process, and use the data for training policies.
To learn more about the LeRobot format, visit the official LeRobot repository. The whole dataset is ~1.5T, and we provide APIs to perform partial downloads based on task name, cameras, and modalities. We also provide functions to generate new modalities based on what's given by the dataset. Please refer to our tutorial notebooks about loading the dataset and generating custom data.
The dataset includes 3 visual modalities: RGB (rgb), Depth (depth_linear), and Mesh Segmentation (seg_instance_id):
|
RGB RGB image of the scene from the camera perspective. Size: (height, width, 4), numpy.uint8 Resolution: 720 x 720 for head camera, 480 x 480 for wrist cameras. Range: [0, 255] We provide RGBVideoLoader class for loading RGB mp4 video from demo dataset. |
|
|
Depth Linear Distance between the camera and everything else in the scene, where distance measurement is linearly proportional to the actual distance. Size: (height, width), numpy.float32 During data replay, we converted raw depth data to mp4 videos through a log quantization step. Our provided data loader will dequantize the video, and return (unnormalized) depth value within the range of [0, 10] meters. Please checkout quantize_depth and dequantize_depth for more details. We provide DepthVideoLoader class for loading depth mp4 video from demo dataset. |
|
|
Instance Segmentation ID Each pixel is assigned a label, indicating the specific object instance it belongs to (e.g., /World/table1/visuals, /World/chair2/visuals). Size: (height, width), numpy.uint32 Each integer corresponds to a unique instance id of a mesh, the id to prim path mapping can be found as ins_id_mapping in the episode metadata json file. We provide SegVideoLoader class for loading mesh segmentation mp4 video from demo dataset. |
|
Dataset Statistics
| Metric | Value |
|---|---|
| Total Trajectories | 10,000 |
| Total Tasks | 50 |
| Total Skills | 270,600 |
| Unique Skills | 31 |
| Avg. Skills per Trajectory | 27.06 |
| Avg. Trajectory Duration | 397.04 seconds / 6.6 minutes |
Show unique skills breakdown
- attach
- chop
- close door
- close drawer
- close lid
- hand over
- hang
- hold
- ignite
- insert
- move to
- open door
- open drawer
- open lid
- pick up from
- place in
- place in next to
- place on
- place on next to
- place under
- pour
- press
- push to
- release
- spray
- sweep surface
- tip over
- turn off switch
- turn on switch
- turn to
- wipe hard
Overall Demo Duration

Per Task Demo Duration
