HomeModelsAutomatic Speech Recognitionsumedh/wav2vec2-large-xlsr-marathi
W

sumedh/wav2vec2-large-xlsr-marathi

Automatic Speech Recognition·sumedh· 642.7K· 2
transformers apache-2.0 315.5M params dataset:openslrbase_model:facebook/wav2vec2-large-xlsr-53base_model:finetune:facebook/wav2vec2-large-xlsr-53

Wav2Vec2-Large-XLSR-53-Marathi Fine-tuned facebook/wav2vec2-large-xlsr-53 on Marathi using the Open SLR64 dataset. When using this model, make sure that your speech input is sampled at 16kHz. This data contains only female voices but the model works well for male voices too. Trained on Google Colab Pro on Tesla P100 16GB GPU. WER (Word Error Rate) on the Test Set: 12.70 % Usage The model can be u

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull sumedh/wav2vec2-large-xlsr-marathi

Model details

Task
Automatic Speech Recognition
Provider
sumedh
Framework
transformers
Parameters
315.5M
Size
2.4 GB
License
apache-2.0
Downloads
642.7K
Likes
2
Updated
2025-02-22

About sumedh/wav2vec2-large-xlsr-marathi

Wav2Vec2-Large-XLSR-53-Marathi Fine-tuned facebook/wav2vec2-large-xlsr-53 on Marathi using the Open SLR64 dataset. When using this model, make sure that your speech input is sampled at 16kHz. This data contains only female voices but the model works well for male voices too. Trained on Google Colab Pro on Tesla P100 16GB GPU. WER (Word Error Rate) on the Test Set: 12.70 % Usage The model can be used directly without a language model as follows, given that your dataset has Marathi actualtext and pathinfolder columns:

Related Automatic Speech Recognition

S pyannote/speaker-diarization-3.1 Automatic Speech Recognition 8.2M 2.5K 🤗 HF W argmaxinc/whisperkit-coreml Automatic Speech Recognition 8.1M 193 🤗 HF W openai/whisper-large-v3-turbo Automatic Speech Recognition ·808.9M params 7.1M 3.1K 🤗 HF W openai/whisper-base Automatic Speech Recognition ·72.6M params 6.2M 274 🤗 HF W jonatasgrosman/wav2vec2-large-xlsr-53-japanese Automatic Speech Recognition 6.1M 61 🤗 HF W openai/whisper-large-v3 Automatic Speech Recognition ·1.5B params 5.7M 5.9K 🤗 HF