HomeDatasetsHuggingFaceFW/fineweb
F

HuggingFaceFW/fineweb

Text Generation · HuggingFaceFW· 318.9K
odc-by 107 TB task_categories:text-generationlanguage:enlicense:odc-bysize_categories:10B<n<100Bmodality:tabular

15 trillion tokens of the finest data the 🌐 web has to offer

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull HuggingFaceFW/fineweb

Dataset details

Task
Text Generation
Language
en
License
odc-by
Size
107 TB
Rows / images
52.5B
Creator
HuggingFaceFW
Downloads
318.9K
Source
huggingface_datasets
Updated
2025-07-11

About HuggingFaceFW/fineweb

15 trillion tokens of the finest data the 🌐 web has to offer