qwen

Qwen: Qwen3 8B

Qwen3 8B is a text-only model from Qwen with a 131K-token context window and a maximum of 8,192 output tokens per response. It supports tool use and reasoning, which makes it viable for multi-step and agentic workflows. Structured output support is unconfirmed from available data, so developers who depend on it should test before committing. At $0.05 per million input tokens and $0.40 per million output tokens, this model sits at the low end of the pricing spectrum, making it worth considering for high-volume or cost-sensitive deployments. Its blended benchmark score of 18.2 across four benchmarks is modest, with particular weakness in coding (11.7) and relative strength in agentic tasks (19.1). Teams running lightweight agentic pipelines on a tight budget have the clearest reason to shortlist it, but those needing strong coding assistance or validated structured output should weigh those gaps carefully.

Quality Score
91/100
price + capability + benchmarks
Input Price
$0.05
per 1M tokens
Output Price
$0.40
per 1M tokens
Context Window
131,072
tokens
Model ID
qwen/qwen3-8b
Vendor
qwen
Tokenizer
Qwen3
Input Modalities
text
Output Modalities
text
Max Output
8,192 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
✓ supported
Vision
text only
Audio
no
Moderated
no

Similar models