Qwen: Qwen3 8B
Qwen3 8B is a text-only model from Qwen with a 131K-token context window and a maximum of 8,192 output tokens per response. It supports tool use and reasoning, which makes it viable for multi-step and agentic workflows. Structured output support is unconfirmed from available data, so developers who depend on it should test before committing. At $0.05 per million input tokens and $0.40 per million output tokens, this model sits at the low end of the pricing spectrum, making it worth considering for high-volume or cost-sensitive deployments. Its blended benchmark score of 18.2 across four benchmarks is modest, with particular weakness in coding (11.7) and relative strength in agentic tasks (19.1). Teams running lightweight agentic pipelines on a tight budget have the clearest reason to shortlist it, but those needing strong coding assistance or validated structured output should weigh those gaps carefully.
- Model ID
- qwen/qwen3-8b
- Vendor
- qwen
- Tokenizer
- Qwen3
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 8,192 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- text only
- Audio
- no
- Moderated
- no