HomeModelsText To Speechk2-fsa/OmniVoice
O

k2-fsa/OmniVoice

Text To Speech·k2-fsa· 1.0M· 1.1K
omnivoice apache-2.0 612.6M params

text to speech · omnivoice model

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull k2-fsa/OmniVoice

Model details

Task
Text To Speech
Provider
k2-fsa
Framework
omnivoice
Parameters
612.6M
License
apache-2.0
Downloads
1.0M
Likes
1.1K
Paper
arXiv:2604.00688
Updated
2026-05-07

About k2-fsa/OmniVoice

OmniVoice is a massively multilingual zero-shot text-to-speech (TTS) model supporting over 600 languages. Built on a novel diffusion language model-style architecture, it delivers high-quality speech with superior inference speed, supporting voice cloning and voice design.

Related Text To Speech

K hexgrad/Kokoro-82M Text To Speech 15.5M 6.4K 🤗 HF X coqui/XTTS-v2 Text To Speech 9.2M 3.6K 🤗 HF C ResembleAI/chatterbox Text To Speech 2.2M 1.7K 🤗 HF Q Qwen/Qwen3-TTS-12Hz-1.7B-CustomVoice Text To Speech ·1.9B params 2.0M 1.6K 🤗 HF Q Qwen/Qwen3-TTS-12Hz-0.6B-CustomVoice Text To Speech ·905.8M params 1.2M 163 🤗 HF F SWivid/F5-TTS Text To Speech 800.8K 1.2K 🤗 HF