Qwen2.5 72B Instruct
Qwen2.5 72B Instruct is a text-only model from Qwen with a 131,072-token context window and a 16,384-token output ceiling. It supports tool use, which makes it usable in agentic or function-calling workflows. It does not support reasoning mode, and structured output support is unconfirmed, so workflows that depend on guaranteed JSON schemas should be tested carefully before committing. At $0.36 per million input tokens and $0.40 per million output tokens, the pricing sits in the budget-to-mid range. Its blended benchmark score of 31.1 is drawn from only one benchmark, so the performance picture is thin; treat that number as a starting signal rather than a settled verdict. Teams looking for a tool-capable, long-context model at a modest cost may find it worth trialing, but the sparse benchmark coverage means head-to-head comparisons with better-tested alternatives should include your own task-specific evaluation.
- Model ID
- qwen/qwen-2.5-72b-instruct
- Vendor
- qwen
- Tokenizer
- Qwen
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 16,384 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- text only
- Audio
- no
- Moderated
- no