HomeDatasetsARTPARK-IISc/Vaani
V

ARTPARK-IISc/Vaani

Automatic Speech Recognition · ARTPARK-IISc· 60.2K
cc-by-4.0 6.8 TB task_categories:automatic-speech-recognitiontask_categories:text-to-speechtask_categories:image-to-texttask_categories:text-to-imagelanguage:ne

VAANI is an India-representative multi-modal multi-lingual dataset. The current version (phase 1- 80 districts, phase 2- 85 districts) contains ~31,255 hours of spontaenous,image-prompted speech by 156K speakers across 165 districts, talking about 288K images covering 106 languages. From this audio data, 2,043 hours of transcribed data(text) is available, spanning almost evenly across the 165 districts. Project Vaani, by IISc, Bangalore and ARTPARK, is capturing the true diversity of India’s… See the full description on the dataset page: https://huggingface.co/datasets/ARTPARK-IISc/Vaani.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull ARTPARK-IISc/Vaani

Dataset details

Task
Automatic Speech Recognition
Language
ne
License
cc-by-4.0
Size
6.8 TB
Creator
ARTPARK-IISc
Downloads
60.2K
Source
huggingface_datasets
Updated
2026-05-04