mvp-lab/LLaVA-OneVision-2-Data

Name: mvp-lab/LLaVA-OneVision-2-Data
Creator: mvp-lab
License: apache-2.0
Keywords: huggingface, task_categories:video-text-to-text, task_categories:visual-question-answering, task_categories:image-text-to-text, language:en, license:apache-2.0, size_categories:n<1K, format:parquet, format:optimized-parquet, video-text-to-text, visual-question-answering, image-text-to-text

Video Text To Text · mvp-lab· 173.7K

apache-2.0 61 TB task_categories:video-text-to-texttask_categories:visual-question-answeringtask_categories:image-text-to-textlanguage:enlicense:apache-2.0

Training data for the LLaVA-OneVision-2 multimodal model family, covering large-scale video and spatial reasoning corpora used in mid-training.

Open in MLForge Sign up free Desktop app

# download instantly
mlforge datasets pull mvp-lab/LLaVA-OneVision-2-Data

Dataset details

Task

Video Text To Text

Language

License

apache-2.0

Size

61 TB

Rows / images

Creator

mvp-lab

Downloads

173.7K

Source

huggingface_datasets

Updated

2026-05-11

About mvp-lab/LLaVA-OneVision-2-Data

Training data for the LLaVA-OneVision-2 multimodal model family, covering large-scale video and spatial reasoning corpora used in mid-training.