HomeModelsText To SpeechQwen/Qwen3-TTS-12Hz-1.7B-CustomVoice
Q

Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

Text To Speech·Qwen· 2.0M· 1.6K
apache-2.0 1.9B params arxiv:2601.15621license:apache-2.0region:us

text to speech model

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

Model details

Task
Text To Speech
Provider
Qwen
Parameters
1.9B
License
apache-2.0
Downloads
2.0M
Likes
1.6K
Paper
arXiv:2601.15621
Updated
2026-01-29

About Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice

Qwen3-TTS covers 10 major languages (Chinese, English, Japanese, Korean, German, French, Russian, Portuguese, Spanish, and Italian) as well as multiple dialectal voice profiles to meet global application needs. In addition, the models feature strong contextual understanding, enabling adaptive control of tone, speaking rate, and emotional expression based on instructions and text semantics, and they show markedly improved robustness to noisy input text. Key features:

Related Text To Speech

K hexgrad/Kokoro-82M Text To Speech 15.5M 6.4K 🤗 HF X coqui/XTTS-v2 Text To Speech 9.2M 3.6K 🤗 HF C ResembleAI/chatterbox Text To Speech 2.2M 1.7K 🤗 HF Q Qwen/Qwen3-TTS-12Hz-0.6B-CustomVoice Text To Speech ·905.8M params 1.2M 163 🤗 HF O k2-fsa/OmniVoice Text To Speech ·612.6M params 1.0M 1.1K 🤗 HF F SWivid/F5-TTS Text To Speech 800.8K 1.2K 🤗 HF