Find robotics datasets by task, embodiment, modality, license, and format.
The catalog starts with high-value open robotics datasets and treats license awareness, format metadata, enrichment readiness, and source review as first-class fields.
DROID
DROID research consortium · v1.0.0
Large-scale in-the-wild robot manipulation dataset.
Type
In-the-wild robot manipulation
Size
76K+ trajectories, 350h
Format
TFDS / RLDS / LeRobot
License
CC-BY 4.0 or source-verified
BridgeData V2
UC Berkeley · v1.0.0
Low-cost robot manipulation dataset.
Type
Low-cost robot manipulation
Size
60K trajectories
Format
TFDS / raw
License
CC-BY 4.0
Open X-Embodiment
Open X-Embodiment collaboration · v1.0.0
Cross-robot dataset across many robot embodiments.
Type
Cross-robot embodied dataset
Size
1M+ episodes, 22 robot types, 500+ skills
Format
RLDS
License
Mixed
ALOHA
ALOHA project community · v1.0.0
Bimanual teleoperation and mobile manipulation datasets.
Type
Bimanual teleoperation
Size
Varies by subset
Format
HDF5 / LeRobot
License
Apache 2.0 for selected LeRobot-hosted subsets
LIBERO
LIBERO research project · v1.0.0
Lifelong robot learning benchmark.
Type
Lifelong robot learning benchmark
Size
130 tasks, 65K demos
Format
benchmark / simulation / HDF5
License
Open benchmark, verify before mirroring
RoboNet
RoboNet project · v1.0.0
Multi-robot manipulation dataset.
Type
Multi-robot manipulation
Size
15M frames, 7 robot platforms
Format
custom
License
Verify before mirroring
RoboMimic / MimicGen
RoboMimic and MimicGen communities · v1.0.0
Imitation learning framework and generated demonstration datasets.
Type
Imitation learning and generated demos
Size
50K+ demos
Format
HDF5
License
MIT for framework / verify dataset subset
Egocentric-100K
Egocentric data project · v1.0.0
Large-scale egocentric manual labor video dataset.
Type
Egocentric manual labor video
Size
100K+ hours, 10.8B frames
Format
WebDataset / MP4
License
Apache 2.0
HumanoidLayer does not claim ownership of third-party open datasets. We index, curate, normalize metadata, and provide access workflows according to each dataset's license. Some datasets may be link-only until licensing is verified. Commercial use depends on the original license and subset restrictions.