japanese-asr/whisper_transcriptions.reazon_speech_all.wer_10.0.vectorized
HuggingFace Dataset
mlforge datasets pull japanese-asr/whisper_transcriptions.reazon_speech_all.wer_10.0.vectorized
Dataset details
About japanese-asr/whisper_transcriptions.reazon_speech_all.wer_10.0.vectorized
--- datasetinfo: - configname: subset0 features: - name: transcription sequence: int64 - name: transcription/engpt3.5 sequence: int64 - name: whispertranscription sequence: int64 - name: whispertranscription/engpt3.5 sequence: int64 - name: inputfeatures sequence: sequence: float32 splits: - name: train numbytes: 44407083236 numexamples: 28889 downloadsize: 6430216790 datasetsize: 44407083236 - configname: subset1 features: - name: transcription sequence: int64 - name: transcription/engpt3.5 sequence: int64 - name: whispertranscription sequence: int64 - name: whispertranscription/engpt3.5 sequence: int64 - name: inputfeatures sequence: sequence: float32 splits: - name: train numbytes: 44089216600 numexamples: 28682 downloadsize: 6385763048 datasetsize: 44089216600 - configname: subset10 features: - name: transcription sequence: int64 - name: transcription/engpt3.5 sequence: int64 - name: whispertranscription sequence: int64 - name: whispertranscription/engpt3.5 sequence: int64 - name: inputfeatures sequence: sequence: float32 splits: - name: train numbytes: 43927652252 numexamples: 28577 downloadsize: 6336100250 datasetsize: 43927652252 - configname: subset100 features: - name: transcription sequence: int64 - name: transcription/engpt3.5 sequence: int64 - name: whispertranscription sequence: int64 - name: whispertranscription/engpt3.5 sequence: int64 - name: inputfeatures sequence: