HomeDatasetsjhu-clsp/ettin-pretraining-data
E

jhu-clsp/ettin-pretraining-data

Text Generation · jhu-clsp· 88.3K
mit 2.4 TB task_categories:text-generationtask_categories:fill-masktask_categories:text-classificationlanguage:enlicense:mit

[](https://opensource.org/licenses/MIT) [](https://arxiv.org/abs/2507.11412) [](https://huggingface.co/jhu-clsp) [](https://github.com/jhu-clsp/ettin-encoder-vs-decoder)

Open in MLForge Sign up free Desktop app
# download instantly
mlforge datasets pull jhu-clsp/ettin-pretraining-data

Dataset details

Task
Text Generation
Language
en
License
mit
Size
2.4 TB
Creator
jhu-clsp
Downloads
88.3K
Source
huggingface_datasets
Updated
2025-07-18

About jhu-clsp/ettin-pretraining-data

[](https://opensource.org/licenses/MIT) [](https://arxiv.org/abs/2507.11412) [](https://huggingface.co/jhu-clsp) [](https://github.com/jhu-clsp/ettin-encoder-vs-decoder)