HomeModelsAudio Text To Textfixie-ai/ultravox-v0_5-llama-3_2-1b
U

fixie-ai/ultravox-v0_5-llama-3_2-1b

Audio Text To Text·fixie-ai· 1.1M· 88
transformers mit 683.1M params

Ultravox is a multimodal Speech LLM built around a pretrained Llama3.2-1B-Instruct and whisper-large-v3-turbo backbone.

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull fixie-ai/ultravox-v0_5-llama-3_2-1b

Model details

Task
Audio Text To Text
Provider
fixie-ai
Framework
transformers
Parameters
683.1M
Size
1.7 GB
License
mit
Downloads
1.1M
Likes
88
Updated
2026-03-11

About fixie-ai/ultravox-v0_5-llama-3_2-1b

Ultravox is a multimodal Speech LLM built around a pretrained Llama3.2-1B-Instruct and whisper-large-v3-turbo backbone.

Related Audio Text To Text

Q Qwen/Qwen2-Audio-7B-Instruct Audio Text To Text ·8.4B params 670.1K 543 🤗 HF V microsoft/VibeVoice-ASR-HF Audio Text To Text ·8.3B params 665.1K 156 🤗 HF U fixie-ai/ultravox-v0_6-llama-3_1-8b Audio Text To Text ·687.3M params 645.4K 6 🤗 HF