automatic speech recognition · transformers model
Fine-tuned facebook/wav2vec2-large-xlsr-53 on Estonian using the Common Voice dataset. When using this model, make sure that your speech input is sampled at 16kHz.