Qwen/Qwen3-VL-30B-A3B-Instruct

Image Text To Text·Qwen· 548.8K· 582

transformers apache-2.0 31.1B params arxiv:2505.09388arxiv:2502.13923arxiv:2409.12191arxiv:2308.12966license:apache-2.0deploy:azure

Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date.

# pull & run locally
pip install mlforge-sdk && mlforge pull Qwen/Qwen3-VL-30B-A3B-Instruct

Model details

Task

Image Text To Text

Provider

Qwen

Framework

transformers

Parameters

31.1B

Size

58 GB

License

apache-2.0

Downloads

548.8K

Likes

582

Paper

arXiv:2505.09388

Updated

2025-11-26

Meet Qwen3-VL — the most powerful vision-language model in the Qwen series to date.