MMLForge Search models, datasets, tasks… Start free

Home › Models › Text Generation › Qwen/Qwen3-8B-Base

Q

Qwen/Qwen3-8B-Base

Text Generation·Qwen· 427.2K· 110

transformers apache-2.0 8.2B params arxiv:2505.09388license:apache-2.0region:us

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Building upon extensive advancements in training data, model architecture, and optimization techniques, Qwen3 delivers the following key improvements over the previously released Qwen2.5:

Open in MLForge Sign up free Desktop app Source ↗

# pull & run locally
pip install mlforge-sdk && mlforge pull Qwen/Qwen3-8B-Base

Model details

Task

Text Generation

Provider

Qwen

Framework

transformers

Parameters

8.2B

Size

15 GB

License

apache-2.0

Downloads

427.2K

Likes

110

Paper

arXiv:2505.09388

Updated

2025-05-21

About Qwen/Qwen3-8B-Base

Qwen3 is the latest generation of large language models in Qwen series, offering a comprehensive suite of dense and mixture-of-experts (MoE) models. Building upon extensive advancements in training data, model architecture, and optimization techniques, Qwen3 delivers the following key improvements over the previously released Qwen2.5:

Related Text Generation

Q Qwen/Qwen3-0.6B Text Generation ·751.6M params 27.8M 1.4K 🤗 HF Q Qwen/Qwen3-4B Text Generation ·4.0B params 16.4M 641 🤗 HF G openai-community/gpt2 Text Generation ·137.0M params 13.3M 3.3K 🤗 HF Q Qwen/Qwen3-8B Text Generation ·8.2B params 13.0M 1.2K 🤗 HF Q Qwen/Qwen2.5-7B-Instruct Text Generation ·7.6B params 12.8M 1.4K 🤗 HF O facebook/opt-125m Text Generation 12.3M 267 🤗 HF