HomeDatasetsbigscience/P3
P

bigscience/P3

Other · bigscience· 41.9K
["apache-2.0"] 176 GB task_categories:otherannotations_creators:crowdsourcedannotations_creators:expert-generatedmultilinguality:monolinguallanguage:en

Table of Contents - Table of Contents - Dataset Description - Dataset Summary - Supported Tasks and Leaderboards - Languages - Dataset Structure - Data Instances - Data Fields - Data Splits - Dataset Creation - Curation Rationale - Source Data - Annotations - Additional Information - Licensing Information - Citation Information - Contributions

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull bigscience/P3

Dataset details

Task
Other
Language
en
License
["apache-2.0"]
Size
176 GB
Rows / images
122.0M
Creator
bigscience
Downloads
41.9K
Source
huggingface_datasets
Updated
2024-03-04

About bigscience/P3

--- annotationscreators: - crowdsourced - expert-generated language: - en license: - apache-2.0 multilinguality: - monolingual sizecategories: - 100M<n<1B taskcategories: - other prettyname: P3 datasetinfo: - configname: adversarialqadbertanswerthefollowingq features: - name: inputs sequence: int32 - name: inputspretokenized dtype: string - name: targets sequence: int32 - name: targetspretokenized dtype: string splits: - name: train numbytes: 18313753 numexamples: 10000 - name: validation numbytes: 1791034 numexamples: 1000 downloadsize: 6288641 datasetsize: 20104787 - configname: adversarialqadbertbasedon features: - name: inputs sequence: int32 - name: inputspretokenized dtype: string - name: targets sequence: int32 - name: targetspretokenized dtype: string splits: - name: train numbytes: 17580553 numexamples: 10000 - name: validation numbytes: 1717566 numexamples: 1000 downloadsize: 6206744 datasetsize: 19298119 - configname: adversarialqadbertgeneratequestion features: - name: inputs sequence: int32 - name: inputspretokenized dtype: string - name: targets sequence: int32 - name: targetspretokenized dtype: string splits: - name: train numbytes: 18552810 numexamples: 10000 - name: validation numbytes: 1824231 numexamples: 1000 - name: test numbytes: 1954952 numexamples: 1000 downloadsize: 5882604 datasetsize: 22331993 - configname: adversarialqadbertquestioncontextanswer features: - name: inputs seque