HomeModelsText GenerationQwen/Qwen3-4B-Instruct-2507-FP8
Q

Qwen/Qwen3-4B-Instruct-2507-FP8

Text Generation·Qwen· 873.2K· 78
transformers apache-2.0 4.4B params arxiv:2505.09388base_model:Qwen/Qwen3-4B-Instruct-2507base_model:quantized:Qwen/Qwen3-4B-Instruct-2507license:apache-2.0

We introduce the updated version of the Qwen3-4B-FP8 non-thinking mode, named Qwen3-4B-Instruct-2507-FP8, featuring the following key enhancements:

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull Qwen/Qwen3-4B-Instruct-2507-FP8

Model details

Task
Text Generation
Provider
Qwen
Framework
transformers
Parameters
4.4B
Size
4.8 GB
License
apache-2.0
Downloads
873.2K
Likes
78
Paper
arXiv:2505.09388
Updated
2025-09-17

About Qwen/Qwen3-4B-Instruct-2507-FP8

We introduce the updated version of the Qwen3-4B-FP8 non-thinking mode, named Qwen3-4B-Instruct-2507-FP8, featuring the following key enhancements:

Related Text Generation

Q Qwen/Qwen3-0.6B Text Generation ·751.6M params 27.8M 1.4K 🤗 HF Q Qwen/Qwen3-4B Text Generation ·4.0B params 16.4M 641 🤗 HF G openai-community/gpt2 Text Generation ·137.0M params 13.3M 3.3K 🤗 HF Q Qwen/Qwen3-8B Text Generation ·8.2B params 13.0M 1.2K 🤗 HF Q Qwen/Qwen2.5-7B-Instruct Text Generation ·7.6B params 12.8M 1.4K 🤗 HF O facebook/opt-125m Text Generation 12.3M 267 🤗 HF