What is Qwen: Qwen3.5-Flash's context window?

1,000,000 tokens, roughly 1,500 pages of text in a single request.

Does Qwen: Qwen3.5-Flash support tool calling?

It supports tool calling, structured output, a reasoning mode.

qwen

Qwen: Qwen3.5-Flash

Qwen3.5-Flash is a multimodal model from Qwen that accepts text, image, and video inputs, making it applicable to tasks that involve mixed media content. It supports a context window of up to one million tokens, tool use, and reasoning, which positions it for agentic workflows and long-document tasks. Structured output support is unconfirmed. Maximum output is capped at 65,536 tokens per response. At $0.065 per million input tokens and $0.26 per million output tokens, this model sits at the budget end of the multimodal market, which is its clearest selling point. However, it carries zero independent benchmark coverage, so there is no external evidence to validate its reasoning or task performance claims. Buyers who prioritize low cost and need video input support may find it worth testing, but teams requiring verified quality baselines before committing should treat Qwen3.5-Flash as unproven until coverage appears.

Query via API → View on qwen → Estimate cost

Quality Score

100/100

price + capability + benchmarks

Input Price

$0.07

per 1M tokens

Output Price

$0.26

per 1M tokens

Context Window

1,000,000

tokens

Model ID: qwen/qwen3.5-flash-02-23
Vendor: qwen
Released: February 2026
Tokenizer: Qwen3
Input Modalities: text, image, video
Output Modalities: text
Max Output: 65,536 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: ✓ supported
Vision: ✓ accepts images
Audio: no
Moderated: no

What it costs in practice

Computed from the current $0.07/M input and $0.26/M output rates. Run your own numbers →

Job	Tokens	Cost
Summarize a 50-page report	30k in / 1.5k out	under $0.01
Classify 1,000 customer emails	500k in / 50k out	$0.05
A month of a busy support chatbot	5M in / 2M out	$0.84

Category rankings

Where Qwen: Qwen3.5-Flash places across the 5 categories it ranks in. How we rank →

#	Category	Score
#10	Self-Hosted / LocalCost · of 25 ranked	117
#11	Real-Time ChatLatency · of 25 ranked	118
#13	Cheap Bulk InferenceCost · of 25 ranked	137
#14	Social Media PostsWriting · of 25 ranked	119
#14	Voice Assistant BackendVoice · of 25 ranked	123

Similar models

qwen

Quick answers

How much does Qwen: Qwen3.5-Flash cost?: $0.07 per million input tokens and $0.26 per million output tokens.
What is Qwen: Qwen3.5-Flash's context window?: 1,000,000 tokens, roughly 1,500 pages of text in a single request.
Does Qwen: Qwen3.5-Flash support tool calling?: It supports tool calling, structured output, a reasoning mode.
Can Qwen: Qwen3.5-Flash process images?: Yes, it accepts image input.

Qwen: Qwen3.5-Flash

What it costs in practice

Category rankings

Similar models

Qwen: Qwen3.7 Flash

Qwen: Qwen3.7 Plus

Qwen: Qwen3.5 Plus 2026-04-20

Qwen: Qwen3.6 Flash

Qwen: Qwen3.6 35B A3B

Qwen: Qwen3.6 27B

Quick answers