Name: ai4bharat/indicvoices_r
Creator: ai4bharat
License: cc-by-4.0
Keywords: huggingface, task_categories:text-to-speech, language:as, language:bn, language:gu, language:hi, language:kn, language:ks, language:ml, text-to-speech

IndicVoices-R: Multilingual, Multi-Speaker Speech Corpus for Indian TTS Dataset Summary IndicVoices-R (IV-R) is the largest multilingual Indian text-to-speech (TTS) dataset derived from an automatic speech recognition (ASR) dataset. It contains 1,704 hours of high-quality speech from 10,496 speakers across 22 Indian languages. This dataset is designed to enhance the development of robust Indian TTS models by providing diverse speaker demographics, natural conversational… See the full description on the dataset page: https://huggingface.co/datasets/ai4bharat/indicvoices_r.

ai4bharat/indicvoices_r

Dataset details