inclusionAI: Ling-2.6-flash
Ling-2.6-flash is a text-in, text-out model from inclusionAI with a 262,144-token context window and a 32,768-token output ceiling. It supports tool use, which makes it usable for function-calling workflows, but it does not offer native reasoning or structured output. The large context window suits document-heavy tasks where fitting long inputs in a single pass matters. At $0.01 per million input tokens and $0.03 per million output tokens, it sits at the very low end of the pricing spectrum, making cost the clearest reason to shortlist it. Its blended benchmark score of 44.8 across only three benchmarks is modest, and the narrow coverage means general performance is not well established. The agentic subscore of 62.8 is its strongest result, so teams running tool-calling or agentic pipelines on a tight budget have the most concrete reason to consider inclusionAI: Ling-2.6-flash, while teams prioritizing verified general capability should weigh the limited benchmark evidence carefully.
- Model ID
- inclusionai/ling-2.6-flash
- Vendor
- inclusionai
- Tokenizer
- Other
- Input Modalities
- text
- Output Modalities
- text
- Max Output
- 32,768 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- not supported
- Vision
- text only
- Audio
- no
- Moderated
- no