HomeDatasetsnkp37/OpenVid-1M
O

nkp37/OpenVid-1M

Text To Video · nkp37· 71.5K
cc-by-4.0 11 TB task_categories:text-to-videotask_categories:image-to-videolanguage:enlicense:cc-by-4.0size_categories:1M<n<10M

Summary This is the dataset proposed in our paper [[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation](https://arxiv.org/abs/2407.02371). OpenVid-1M is a high-quality text-to-video dataset designed for research institutions to enhance video quality, featuring high aesthetics, clarity, and resolution. It can be used for direct training or as a quality tuning com

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull nkp37/OpenVid-1M

Dataset details

Task
Text To Video
Language
en
License
cc-by-4.0
Size
11 TB
Rows / images
1.5M
Creator
nkp37
Downloads
71.5K
Source
huggingface_datasets
Updated
2026-03-31

About nkp37/OpenVid-1M

Summary This is the dataset proposed in our paper [[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation](https://arxiv.org/abs/2407.02371). OpenVid-1M is a high-quality text-to-video dataset designed for research institutions to enhance video quality, featuring high aesthetics, clarity, and resolution. It can be used for direct training or as a quality tuning complement to other video datasets. All videos in the OpenVid-1M dataset have resolutions of at least 512×512. Furthermore, we curate 433K 1080p videos from OpenVid-1M to create OpenVidHD, advancing high-definition video generation.