Name: nkp37/OpenVid-1M
Creator: nkp37
License: cc-by-4.0
Keywords: huggingface, task_categories:text-to-video, task_categories:image-to-video, language:en, license:cc-by-4.0, size_categories:1M<n<10M, format:csv, modality:tabular, modality:text, text-to-video, image-to-video

About nkp37/OpenVid-1M

Summary This is the dataset proposed in our paper [[ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generation](https://arxiv.org/abs/2407.02371). OpenVid-1M is a high-quality text-to-video dataset designed for research institutions to enhance video quality, featuring high aesthetics, clarity, and resolution. It can be used for direct training or as a quality tuning complement to other video datasets. All videos in the OpenVid-1M dataset have resolutions of at least 512×512. Furthermore, we curate 433K 1080p videos from OpenVid-1M to create OpenVidHD, advancing high-definition video generation.

nkp37/OpenVid-1M

Dataset details

About nkp37/OpenVid-1M