HomeModelsAudio Text To Textfixie-ai/ultravox-v0_6-llama-3_1-8b
U

fixie-ai/ultravox-v0_6-llama-3_1-8b

Audio Text To Text·fixie-ai· 645.4K· 6
transformers mit 687.3M params

Ultravox is a multimodal Speech LLM built around a pretrained LLM (Llama, Gemma, Qwen, etc) and a speech encoder (whisper-large-v3-turbo) backbone.

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull fixie-ai/ultravox-v0_6-llama-3_1-8b

Model details

Task
Audio Text To Text
Provider
fixie-ai
Framework
transformers
Parameters
687.3M
Size
1.3 GB
License
mit
Downloads
645.4K
Likes
6
Updated
2025-07-05

About fixie-ai/ultravox-v0_6-llama-3_1-8b

Ultravox is a multimodal Speech LLM built around a pretrained LLM (Llama, Gemma, Qwen, etc) and a speech encoder (whisper-large-v3-turbo) backbone.

Related Audio Text To Text

U fixie-ai/ultravox-v0_5-llama-3_2-1b Audio Text To Text ·683.1M params 1.1M 88 🤗 HF Q Qwen/Qwen2-Audio-7B-Instruct Audio Text To Text ·8.4B params 670.1K 543 🤗 HF V microsoft/VibeVoice-ASR-HF Audio Text To Text ·8.3B params 665.1K 156 🤗 HF