HomeDatasetsOpenSQZ/AutoMathText-V2
A

OpenSQZ/AutoMathText-V2

Text Generation · OpenSQZ· 122.2K
Unknown 7.1 TB task_categories:text-generationtask_categories:question-answeringlanguage:enlanguage:zhsize_categories:100M<n<1B

🚀 AutoMathText-V2: A 2.46 Trillion Token AI-Curated STEM Pretraining Dataset

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull OpenSQZ/AutoMathText-V2

Dataset details

Task
Text Generation
Language
en
License
Unknown
Size
7.1 TB
Rows / images
276.4M
Creator
OpenSQZ
Downloads
122.2K
Source
huggingface_datasets
Updated
2026-06-09

About OpenSQZ/AutoMathText-V2

🚀 AutoMathText-V2: A 2.46 Trillion Token AI-Curated STEM Pretraining Dataset