L
mvp-lab/LLaVA-OneVision-2-Data
Video Text To Text · mvp-lab
· 173.7K
apache-2.0
61 TB
task_categories:video-text-to-texttask_categories:visual-question-answeringtask_categories:image-text-to-textlanguage:enlicense:apache-2.0
Training data for the LLaVA-OneVision-2 multimodal model family, covering large-scale video and spatial reasoning corpora used in mid-training.
# download instantly
mlforge datasets pull mvp-lab/LLaVA-OneVision-2-Data
Dataset details
Source
huggingface_datasets
About mvp-lab/LLaVA-OneVision-2-Data
Training data for the LLaVA-OneVision-2 multimodal model family, covering large-scale video and spatial reasoning corpora used in mid-training.