HomeDatasetsDATA-MASK/FineWeb-Mask
F

DATA-MASK/FineWeb-Mask

Text Generation · DATA-MASK· 34.0K
apache-2.0 5.9 TB task_categories:text-generationlanguage:enlicense:apache-2.0size_categories:n>1Tarxiv:2512.24265

📜 DATAMASK Paper 💻 GitHub Repository 📦 Fineweb-Mask Dataset

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull DATA-MASK/FineWeb-Mask

Dataset details

Task
Text Generation
Language
en
License
apache-2.0
Size
5.9 TB
Creator
DATA-MASK
Downloads
34.0K
Source
huggingface_datasets
Updated
2026-01-19

About DATA-MASK/FineWeb-Mask

📜 DATAMASK Paper 💻 GitHub Repository 📦 Fineweb-Mask Dataset