HomeModelsAny To AnyQwen/Qwen2.5-Omni-3B
Q

Qwen/Qwen2.5-Omni-3B

Any To Any·Qwen· 1.7M· 336
transformers other 5.5B params arxiv:2503.20215license:otherregion:us

any to any · transformers model

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull Qwen/Qwen2.5-Omni-3B

Model details

Task
Any To Any
Provider
Qwen
Framework
transformers
Parameters
5.5B
License
other
Downloads
1.7M
Likes
336
Paper
arXiv:2503.20215
Updated
2025-04-30

About Qwen/Qwen2.5-Omni-3B

Overview Introduction Qwen2.5-Omni is an end-to-end multimodal model designed to perceive diverse modalities, including text, images, audio, and video, while simultaneously generating text and natural speech responses in a streaming manner.

Related Any To Any

G google/gemma-4-E4B-it Any To Any ·8.0B params 6.0M 1.3K 🤗 HF G google/gemma-4-E2B-it Any To Any ·5.1B params 2.3M 777 🤗 HF G google/gemma-4-12B-it Any To Any ·12.0B params 2.2M 1.2K 🤗 HF Q Qwen/Qwen3-Omni-30B-A3B-Instruct Any To Any ·35.3B params 2.0M 944 🤗 HF N nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4 Any To Any ·18.3B params 1.7M 145 🤗 HF G google/gemma-4-12B-it-qat-w4a16-ct Any To Any ·13.3B params 1.7M 33 🤗 HF