HomeDatasetsgoogle/fleurs
F

google/fleurs

Automatic Speech Recognition · google· 72.8K
["cc-by-4.0"] 830 GB task_categories:automatic-speech-recognitionannotations_creators:expert-generatedannotations_creators:crowdsourcedannotations_creators:machine-generatedlanguage_creators:crowdsourced

- Fine-Tuning script: pytorch/speech-recognition - Paper: FLEURS: Few-shot Learning Evaluation of Universal Representations of Speech - Total amount of disk used: ca. 350 GB

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull google/fleurs

Dataset details

Task
Automatic Speech Recognition
Language
afr
License
["cc-by-4.0"]
Size
830 GB
Rows / images
768.1K
Creator
google
Downloads
72.8K
Source
huggingface_datasets
Updated
2026-05-15

About google/fleurs

--- annotationscreators: - expert-generated - crowdsourced - machine-generated languagecreators: - crowdsourced - expert-generated language: - afr - amh - ara - asm - ast - azj - bel - ben - bos - cat - ceb - cmn - ces - cym - dan - deu - ell - eng - spa - est - fas - ful - fin - tgl - fra - gle - glg - guj - hau - heb - hin - hrv - hun - hye - ind - ibo - isl - ita - jpn - jav - kat - kam - kea - kaz - khm - kan - kor - ckb - kir - ltz - lug - lin - lao - lit - luo - lav - mri - mkd - mal - mon - mar - msa - mlt - mya - nob - npi - nld - nso - nya - oci - orm - ory - pan - pol - pus - por - ron - rus - bul - snd - slk - slv - sna - som - srp - swe - swh - tam - tel - tgk - tha - tur - ukr - umb - urd - uzb - vie - wol - xho - yor - yue - zul license: - cc-by-4.0 multilinguality: - multilingual sizecategories: - 10K<n<100K taskcategories: - automatic-speech-recognition taskids: [] prettyname: 'The Cross-lingual TRansfer Evaluation of Multilingual Encoders for Speech (XTREME-S) benchmark is a benchmark designed to evaluate speech representations across languages, tasks, domains and data regimes. It covers 102 languages from 10+ language families, 3 different domains and 4 task families: speech recognition, translation, classification and retrieval.' tags: - speech-recognition datasetinfo: - configname: afza features: - name: id dtype: int32 - name: numsamples dtype: int32 - name: path dtype: string - name: audio dtype: audio: samplingrate: 16000 - name: transcription dtype: string - name: rawtranscription dtype: str