Qwen/Qwen2-Audio-7B-Instruct

Audio Text To Text·Qwen· 670.1K· 543

transformers apache-2.0 8.4B params arxiv:2407.10759arxiv:2311.07919license:apache-2.0

Qwen2-Audio is the new series of Qwen large audio-language models. Qwen2-Audio is capable of accepting various audio signal inputs and performing audio analysis or direct textual responses with regard to speech instructions. We introduce two distinct audio interaction modes:

Open in MLForge Sign up free Desktop app Source ↗

# pull & run locally
pip install mlforge-sdk && mlforge pull Qwen/Qwen2-Audio-7B-Instruct

Model details

Task

Audio Text To Text

Provider

Qwen

Framework

transformers

Parameters

8.4B

Size

16 GB

License

apache-2.0

Downloads

670.1K

Likes

543

Paper

arXiv:2407.10759

Updated

2025-01-12

About Qwen/Qwen2-Audio-7B-Instruct

Related Audio Text To Text

U fixie-ai/ultravox-v0_5-llama-3_2-1b Audio Text To Text ·683.1M params 1.1M 88 🤗 HF V microsoft/VibeVoice-ASR-HF Audio Text To Text ·8.3B params 665.1K 156 🤗 HF U fixie-ai/ultravox-v0_6-llama-3_1-8b Audio Text To Text ·687.3M params 645.4K 6 🤗 HF