Qwen/Qwen2.5-Omni-7B

Any To Any·Qwen· 647.0K· 1.9K

transformers other 10.7B params arxiv:2503.20215license:otherregion:us

Overview Introduction Qwen2.5-Omni is an end-to-end multimodal model designed to perceive diverse modalities, including text, images, audio, and video, while simultaneously generating text and natural speech responses in a streaming manner.

Open in MLForge Sign up free Desktop app Source ↗

# pull & run locally
pip install mlforge-sdk && mlforge pull Qwen/Qwen2.5-Omni-7B

Model details

Task

Any To Any

Provider

Qwen

Framework

transformers

Parameters

10.7B

Size

21 GB

License

other

Downloads

647.0K

Likes

1.9K

Paper

arXiv:2503.20215

Updated

2025-04-30

About Qwen/Qwen2.5-Omni-7B

Related Any To Any

G google/gemma-4-E4B-it Any To Any ·8.0B params 6.0M 1.3K 🤗 HF G google/gemma-4-E2B-it Any To Any ·5.1B params 2.3M 777 🤗 HF G google/gemma-4-12B-it Any To Any ·12.0B params 2.2M 1.2K 🤗 HF Q Qwen/Qwen3-Omni-30B-A3B-Instruct Any To Any ·35.3B params 2.0M 944 🤗 HF N nvidia/Nemotron-3-Nano-Omni-30B-A3B-Reasoning-NVFP4 Any To Any ·18.3B params 1.7M 145 🤗 HF G google/gemma-4-12B-it-qat-w4a16-ct Any To Any ·13.3B params 1.7M 33 🤗 HF