qwen
Qwen: Qwen3 VL 30B A3B Instruct
Qwen3-VL-30B-A3B-Instruct is a multimodal model that unifies strong text generation with visual understanding for images and videos. Its Instruct variant optimizes instruction-following for general multimodal tasks. It excels in perception...
Quality Score
99/100
price + capability + benchmarks
Input Price
$0.13
per 1M tokens
Output Price
$0.52
per 1M tokens
Context Window
262,144
tokens
- Model ID
- qwen/qwen3-vl-30b-a3b-instruct
- Vendor
- qwen
- Tokenizer
- Qwen3
- Input Modalities
- text, image
- Output Modalities
- text
- Max Output
- 32,768 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- no
Similar models
qwen
Qwen: Qwen3 VL 32B Instruct
$0.10 in / $0.42 out
262,144 ctx
99
qwen
Qwen: Qwen3 VL 8B Instruct
$0.08 in / $0.50 out
256,000 ctx
99
qwen
Qwen: Qwen3 Next 80B A3B Thinking
$0.10 in / $0.78 out
262,144 ctx
99
qwen
Qwen: Qwen3 235B A22B Thinking 2507
$0.10 in / $0.10 out
262,144 ctx
99
qwen
Qwen: Qwen3 VL 235B A22B Instruct
$0.20 in / $0.88 out
262,144 ctx
99
qwen
Qwen: Qwen Plus 0728 (thinking)
$0.26 in / $0.78 out
1,000,000 ctx
99