M
VLM2Vec/MSR-VTT
Text To Video · VLM2Vec
· 7.8K
Unknown
4.2 GB
task_categories:text-to-videotask_categories:text-retrievaltask_categories:video-classificationlanguage:ensize_categories:10K<n<100K
MSRVTT contains 10K video clips and 200K captions.
# download instantly
mlforge datasets pull VLM2Vec/MSR-VTT
Dataset details
Source
huggingface_datasets