HomeDatasetsallenai/dolma3_dolmino_mix-100B-1025
D

allenai/dolma3_dolmino_mix-100B-1025

Text Generation · allenai· 23.5K
odc-by 195 GB task_categories:text-generationlanguage:enlicense:odc-bysize_categories:10M<n<100Mmodality:text

The Dolma 3 Dolmino Mix (100B) is the mixture of high-quality data used for the second stage of training for Olmo 3 7B model.

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull allenai/dolma3_dolmino_mix-100B-1025

Dataset details

Task
Text Generation
Language
en
License
odc-by
Size
195 GB
Rows / images
2.8M
Creator
allenai
Downloads
23.5K
Source
huggingface_datasets
Updated
2026-01-05

About allenai/dolma3_dolmino_mix-100B-1025

The Dolma 3 Dolmino Mix (100B) is the mixture of high-quality data used for the second stage of training for Olmo 3 7B model.