HomeDatasetsHuggingFaceFW/fineweb-edu
F

HuggingFaceFW/fineweb-edu

Text Generation · HuggingFaceFW· 399.4K
odc-by 5.6 TB task_categories:text-generationlanguage:enlicense:odc-bysize_categories:1B<n<10Bformat:parquet

1.3 trillion tokens of the finest educational data the 🌐 web has to offer

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull HuggingFaceFW/fineweb-edu

Dataset details

Task
Text Generation
Language
en
License
odc-by
Size
5.6 TB
Rows / images
3.5B
Creator
HuggingFaceFW
Downloads
399.4K
Source
huggingface_datasets
Updated
2025-07-11

About HuggingFaceFW/fineweb-edu

1.3 trillion tokens of the finest educational data the 🌐 web has to offer