HomeDatasetsallenai/dolma3_pool
D

allenai/dolma3_pool

Text Generation · allenai· 37.8K
odc-by 16 TB task_categories:text-generationlanguage:enlicense:odc-byarxiv:2512.13961region:us

This is the Dolma 3 pool, pre–quality upsampling and mixing. If you are interested in the data used to train Olmo 3 7B and Olmo 3 32B, visit allenai/dolma3mix-6T-1025.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull allenai/dolma3_pool

Dataset details

Task
Text Generation
Language
en
License
odc-by
Size
16 TB
Creator
allenai
Downloads
37.8K
Source
huggingface_datasets
Updated
2026-02-24

About allenai/dolma3_pool

This is the Dolma 3 pool, pre–quality upsampling and mixing. If you are interested in the data used to train Olmo 3 7B and Olmo 3 32B, visit allenai/dolma3mix-6T-1025.