HomeDatasetsopen-r1/OpenR1-Math-220k
O

open-r1/OpenR1-Math-220k

General · open-r1· 39.2K
apache-2.0 14 GB language:enlicense:apache-2.0size_categories:100K<n<1Mformat:parquetmodality:text

Dataset description OpenR1-Math-220k is a large-scale dataset for mathematical reasoning. It consists of 220k math problems with two to four reasoning traces generated by DeepSeek R1 for problems from NuminaMath 1.5. The traces were verified using Math Verify for most samples and Llama-3.3-70B-Instruct as a judge for 12% of the samples, and each problem contains at least one reasoning trace with

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull open-r1/OpenR1-Math-220k

Dataset details

Task
General
Language
en
License
apache-2.0
Size
14 GB
Rows / images
450.3K
Creator
open-r1
Downloads
39.2K
Source
huggingface_datasets
Updated
2025-02-18

About open-r1/OpenR1-Math-220k

Dataset description OpenR1-Math-220k is a large-scale dataset for mathematical reasoning. It consists of 220k math problems with two to four reasoning traces generated by DeepSeek R1 for problems from NuminaMath 1.5. The traces were verified using Math Verify for most samples and Llama-3.3-70B-Instruct as a judge for 12% of the samples, and each problem contains at least one reasoning trace with a correct answer.