HomeModelsText GenerationEleutherAI/pythia-70m-deduped
P

EleutherAI/pythia-70m-deduped

Text Generation·EleutherAI· 1.9M· 28
transformers apache-2.0 95.6M params dataset:EleutherAI/the_pile_deduplicatedarxiv:2304.01373arxiv:2101.00027arxiv:2201.07311

text generation · transformers model

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull EleutherAI/pythia-70m-deduped

Model details

Task
Text Generation
Provider
EleutherAI
Framework
transformers
Parameters
95.6M
License
apache-2.0
Downloads
1.9M
Likes
28
Paper
arXiv:2304.01373
Updated
2023-07-09

About EleutherAI/pythia-70m-deduped

The Pythia Scaling Suite is a collection of models developed to facilitate interpretability research (see paper). It contains two sets of eight models of sizes 70M, 160M, 410M, 1B, 1.4B, 2.8B, 6.9B, and 12B. For each size, there are two models: one trained on the Pile, and one trained on the Pile after the dataset has been globally deduplicated. All 8 model sizes are trained on the exact same data, in the exact same order. We also provide 154 intermediate checkpoints per model, hosted on Hugging Face as branches.

Related Text Generation

Q Qwen/Qwen3-0.6B Text Generation ·751.6M params 27.8M 1.4K 🤗 HF Q Qwen/Qwen3-4B Text Generation ·4.0B params 16.4M 641 🤗 HF G openai-community/gpt2 Text Generation ·137.0M params 13.3M 3.3K 🤗 HF Q Qwen/Qwen3-8B Text Generation ·8.2B params 13.0M 1.2K 🤗 HF Q Qwen/Qwen2.5-7B-Instruct Text Generation ·7.6B params 12.8M 1.4K 🤗 HF O facebook/opt-125m Text Generation 12.3M 267 🤗 HF