A
OpenSQZ/AutoMathText-V2
Text Generation · OpenSQZ
· 122.2K
Unknown
7.1 TB
task_categories:text-generationtask_categories:question-answeringlanguage:enlanguage:zhsize_categories:100M<n<1B
🚀 AutoMathText-V2: A 2.46 Trillion Token AI-Curated STEM Pretraining Dataset
# download instantly
mlforge datasets pull OpenSQZ/AutoMathText-V2
Dataset details
Source
huggingface_datasets
About OpenSQZ/AutoMathText-V2
🚀 AutoMathText-V2: A 2.46 Trillion Token AI-Curated STEM Pretraining Dataset