qwen

Qwen: Qwen3 VL 30B A3B Instruct

Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...

Quality Score
99/100
price + capability + benchmarks
Input Price
$0.13
per 1M tokens
Output Price
$0.52
per 1M tokens
Context Window
262,144
tokens
Model ID
qwen/qwen3-vl-30b-a3b-instruct
Vendor
qwen
Tokenizer
Qwen3
Input Modalities
text, image
Output Modalities
text
Max Output
32,768 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
not supported
Vision
✓ accepts images
Audio
no
Moderated
no

Similar models