HomeDatasetsjapanese-asr/whisper_transcriptions.reazon_speech_all.wer_10.0.vectorized
W

japanese-asr/whisper_transcriptions.reazon_speech_all.wer_10.0.vectorized

General · japanese-asr· 53.3K
Unknown 1.4 TB size_categories:1M<n<10Mformat:parquetlibrary:datasetslibrary:dasklibrary:mlcroissant

HuggingFace Dataset

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull japanese-asr/whisper_transcriptions.reazon_speech_all.wer_10.0.vectorized

Dataset details

Task
General
License
Unknown
Size
1.4 TB
Rows / images
689.7K
Creator
japanese-asr
Downloads
53.3K
Source
huggingface_datasets
Updated
2024-09-17

About japanese-asr/whisper_transcriptions.reazon_speech_all.wer_10.0.vectorized

--- datasetinfo: - configname: subset0 features: - name: transcription sequence: int64 - name: transcription/engpt3.5 sequence: int64 - name: whispertranscription sequence: int64 - name: whispertranscription/engpt3.5 sequence: int64 - name: inputfeatures sequence: sequence: float32 splits: - name: train numbytes: 44407083236 numexamples: 28889 downloadsize: 6430216790 datasetsize: 44407083236 - configname: subset1 features: - name: transcription sequence: int64 - name: transcription/engpt3.5 sequence: int64 - name: whispertranscription sequence: int64 - name: whispertranscription/engpt3.5 sequence: int64 - name: inputfeatures sequence: sequence: float32 splits: - name: train numbytes: 44089216600 numexamples: 28682 downloadsize: 6385763048 datasetsize: 44089216600 - configname: subset10 features: - name: transcription sequence: int64 - name: transcription/engpt3.5 sequence: int64 - name: whispertranscription sequence: int64 - name: whispertranscription/engpt3.5 sequence: int64 - name: inputfeatures sequence: sequence: float32 splits: - name: train numbytes: 43927652252 numexamples: 28577 downloadsize: 6336100250 datasetsize: 43927652252 - configname: subset100 features: - name: transcription sequence: int64 - name: transcription/engpt3.5 sequence: int64 - name: whispertranscription sequence: int64 - name: whispertranscription/engpt3.5 sequence: int64 - name: inputfeatures sequence: