HomeDatasetsm-a-p/FineFineWeb-sample
F

m-a-p/FineFineWeb-sample

Text Classification · m-a-p· 12.0K
apache-2.0 959 GB task_categories:text-classificationtask_categories:text-generationlanguage:enlicense:apache-2.0size_categories:100M<n<1B

FineFineWeb: A Comprehensive Study on Fine-Grained Domain Web Corpus

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull m-a-p/FineFineWeb-sample

Dataset details

Task
Text Classification
Language
en
License
apache-2.0
Size
959 GB
Rows / images
1.7M
Creator
m-a-p
Downloads
12.0K
Source
huggingface_datasets
Updated
2024-12-19

About m-a-p/FineFineWeb-sample

FineFineWeb: A Comprehensive Study on Fine-Grained Domain Web Corpus