HomeModelsAudio Text To TextQwen/Qwen2-Audio-7B-Instruct
Q

Qwen/Qwen2-Audio-7B-Instruct

Audio Text To Text·Qwen· 670.1K· 543
transformers apache-2.0 8.4B params arxiv:2407.10759arxiv:2311.07919license:apache-2.0

Qwen2-Audio is the new series of Qwen large audio-language models. Qwen2-Audio is capable of accepting various audio signal inputs and performing audio analysis or direct textual responses with regard to speech instructions. We introduce two distinct audio interaction modes:

Open in MLForge Sign up free Desktop app Source ↗
# pull & run locally
pip install mlforge-sdk && mlforge pull Qwen/Qwen2-Audio-7B-Instruct

Model details

Task
Audio Text To Text
Provider
Qwen
Framework
transformers
Parameters
8.4B
Size
16 GB
License
apache-2.0
Downloads
670.1K
Likes
543
Paper
arXiv:2407.10759
Updated
2025-01-12

About Qwen/Qwen2-Audio-7B-Instruct

Qwen2-Audio is the new series of Qwen large audio-language models. Qwen2-Audio is capable of accepting various audio signal inputs and performing audio analysis or direct textual responses with regard to speech instructions. We introduce two distinct audio interaction modes:

Related Audio Text To Text

U fixie-ai/ultravox-v0_5-llama-3_2-1b Audio Text To Text ·683.1M params 1.1M 88 🤗 HF V microsoft/VibeVoice-ASR-HF Audio Text To Text ·8.3B params 665.1K 156 🤗 HF U fixie-ai/ultravox-v0_6-llama-3_1-8b Audio Text To Text ·687.3M params 645.4K 6 🤗 HF