HomeDatasetsopenai/gsm8k
G

openai/gsm8k

Text Generation · openai· 880.9K
["mit"] 4.8 MB benchmark:officialbenchmark:eval-yamltask_categories:text-generationannotations_creators:crowdsourcedlanguage_creators:crowdsourced

Dataset Card for GSM8K Dataset Summary GSM8K (Grade School Math 8K) is a dataset of 8.5K high quality linguistically diverse grade school math word problems. The dataset was created to support the task of question answering on basic mathematical problems that require multi-step reasoning. These problems take between 2 and 8 steps to solve. Solutions primarily involve performing a sequence of elementary calculations using basic arithmetic operations (+ − ×÷) to reach the… See the full description on the dataset page: https://huggingface.co/datasets/openai/gsm8k.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull openai/gsm8k

Dataset details

Task
Text Generation
Language
en
License
["mit"]
Size
4.8 MB
Creator
openai
Downloads
880.9K
Source
huggingface_datasets
Updated
2026-03-23

About openai/gsm8k

Table of Contents - Dataset Description - Dataset Summary - Supported Tasks - Languages - Dataset Structure - Data Instances - Data Fields - Data Splits - Dataset Creation - Curation Rationale - Source Data - Annotations - Personal and Sensitive Information - Considerations for Using the Data - Social Impact of Dataset - Discussion of Biases - Other Known Limitations - Additional Information - Dataset Curators - Licensing Information - Citation Information