bigscience/P3
Table of Contents - Table of Contents - Dataset Description - Dataset Summary - Supported Tasks and Leaderboards - Languages - Dataset Structure - Data Instances - Data Fields - Data Splits - Dataset Creation - Curation Rationale - Source Data - Annotations - Additional Information - Licensing Information - Citation Information - Contributions
mlforge datasets pull bigscience/P3
Dataset details
About bigscience/P3
--- annotationscreators: - crowdsourced - expert-generated language: - en license: - apache-2.0 multilinguality: - monolingual sizecategories: - 100M<n<1B taskcategories: - other prettyname: P3 datasetinfo: - configname: adversarialqadbertanswerthefollowingq features: - name: inputs sequence: int32 - name: inputspretokenized dtype: string - name: targets sequence: int32 - name: targetspretokenized dtype: string splits: - name: train numbytes: 18313753 numexamples: 10000 - name: validation numbytes: 1791034 numexamples: 1000 downloadsize: 6288641 datasetsize: 20104787 - configname: adversarialqadbertbasedon features: - name: inputs sequence: int32 - name: inputspretokenized dtype: string - name: targets sequence: int32 - name: targetspretokenized dtype: string splits: - name: train numbytes: 17580553 numexamples: 10000 - name: validation numbytes: 1717566 numexamples: 1000 downloadsize: 6206744 datasetsize: 19298119 - configname: adversarialqadbertgeneratequestion features: - name: inputs sequence: int32 - name: inputspretokenized dtype: string - name: targets sequence: int32 - name: targetspretokenized dtype: string splits: - name: train numbytes: 18552810 numexamples: 10000 - name: validation numbytes: 1824231 numexamples: 1000 - name: test numbytes: 1954952 numexamples: 1000 downloadsize: 5882604 datasetsize: 22331993 - configname: adversarialqadbertquestioncontextanswer features: - name: inputs seque