inclusionai

inclusionAI: Ling-2.6-flash

Ling-2.6-flash is a text-in, text-out model from inclusionAI with a 262,144-token context window and a 32,768-token output ceiling. It supports tool use, which makes it usable for function-calling workflows, but it does not offer native reasoning or structured output. The large context window suits document-heavy tasks where fitting long inputs in a single pass matters. At $0.01 per million input tokens and $0.03 per million output tokens, it sits at the very low end of the pricing spectrum, making cost the clearest reason to shortlist it. Its blended benchmark score of 44.8 across only three benchmarks is modest, and the narrow coverage means general performance is not well established. The agentic subscore of 62.8 is its strongest result, so teams running tool-calling or agentic pipelines on a tight budget have the most concrete reason to consider inclusionAI: Ling-2.6-flash, while teams prioritizing verified general capability should weigh the limited benchmark evidence carefully.

Quality Score
95/100
price + capability + benchmarks
Input Price
$0.01
per 1M tokens
Output Price
$0.03
per 1M tokens
Context Window
262,144
tokens
Model ID
inclusionai/ling-2.6-flash
Vendor
inclusionai
Tokenizer
Other
Input Modalities
text
Output Modalities
text
Max Output
32,768 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
not supported
Vision
text only
Audio
no
Moderated
no

Similar models