HomeDatasetsai4bharat/indicvoices_r
I

ai4bharat/indicvoices_r

Text To Speech · ai4bharat· 28.2K
cc-by-4.0 976 GB task_categories:text-to-speechlanguage:aslanguage:bnlanguage:gulanguage:hi

IndicVoices-R: Multilingual, Multi-Speaker Speech Corpus for Indian TTS Dataset Summary IndicVoices-R (IV-R) is the largest multilingual Indian text-to-speech (TTS) dataset derived from an automatic speech recognition (ASR) dataset. It contains 1,704 hours of high-quality speech from 10,496 speakers across 22 Indian languages. This dataset is designed to enhance the development of robust Indian TTS models by providing diverse speaker demographics, natural conversational… See the full description on the dataset page: https://huggingface.co/datasets/ai4bharat/indicvoices_r.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull ai4bharat/indicvoices_r

Dataset details

Task
Text To Speech
Language
as
License
cc-by-4.0
Size
976 GB
Creator
ai4bharat
Downloads
28.2K
Source
huggingface_datasets
Updated
2025-03-06