HomeModelsAutomatic Speech Recognitionalvanlii/wav2vec2-BERT-cantonese
W

alvanlii/wav2vec2-BERT-cantonese

Automatic Speech Recognition·alvanlii· 532.7K· 6
transformers apache-2.0 608.4M params dataset:mozilla-foundation/common_voice_16_0arxiv:2201.02419license:apache-2.0region:us

This model is a fine-tuned version of facebook/w2v-bert-2.0. This has a CER of 10.27 on Common Voice 16 (yue) test set (without punctuations).

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull alvanlii/wav2vec2-BERT-cantonese

Model details

Task
Automatic Speech Recognition
Provider
alvanlii
Framework
transformers
Parameters
608.4M
Size
27 GB
License
apache-2.0
Downloads
532.7K
Likes
6
Paper
arXiv:2201.02419
Updated
2024-04-05

About alvanlii/wav2vec2-BERT-cantonese

This model is a fine-tuned version of facebook/w2v-bert-2.0. This has a CER of 10.27 on Common Voice 16 (yue) test set (without punctuations).

Related Automatic Speech Recognition

S pyannote/speaker-diarization-3.1 Automatic Speech Recognition 8.2M 2.5K 🤗 HF W argmaxinc/whisperkit-coreml Automatic Speech Recognition 8.1M 193 🤗 HF W openai/whisper-large-v3-turbo Automatic Speech Recognition ·808.9M params 7.1M 3.1K 🤗 HF W openai/whisper-base Automatic Speech Recognition ·72.6M params 6.2M 274 🤗 HF W jonatasgrosman/wav2vec2-large-xlsr-53-japanese Automatic Speech Recognition 6.1M 61 🤗 HF W openai/whisper-large-v3 Automatic Speech Recognition ·1.5B params 5.7M 5.9K 🤗 HF