How much does IBM: Granite 4.1 8B cost?

$0.05 per million input tokens and $0.10 per million output tokens.

What is IBM: Granite 4.1 8B's context window?

131,072 tokens, roughly 196 pages of text in a single request.

Does IBM: Granite 4.1 8B support tool calling?

It supports tool calling, structured output.

Can IBM: Granite 4.1 8B process images?

No, it is text-only on the input side.

ibm-granite

IBM: Granite 4.1 8B

The IBM Granite 4.1 8B model is designed for handling text-based tasks with a context length of up to 131,072 tokens and can generate completions up to the same limit. It supports tools integration but lacks reasoning capabilities and structured output options. This model should be considered by users requiring extensive input handling and tool usage but who do not prioritize advanced reasoning or structured outputs. With a blended benchmark score of 10.4 across two independent tests, it performs moderately well; however, the pricing at $0.05 per million input tokens and $0.1 per million output tokens may make it less attractive for cost-sensitive applications, especially given its lower ranking in reasoning benchmarks.

Query via API → View on ibm-granite → Estimate cost

Quality Score

86/100

price + capability + benchmarks

Input Price

$0.05

per 1M tokens

Output Price

$0.10

per 1M tokens

Context Window

131,072

tokens

Benchmark results

Independent, published benchmarks. Blended score 10.4 across 2 benchmarks, last refreshed 2026-07-31. How scoring works →

Benchmark	Measures	Score
AI Index	broad capability composite	11.0
AI Index Coding	software engineering tasks	15.7

Model ID: ibm-granite/granite-4.1-8b
Vendor: ibm-granite
Released: April 2026
Tokenizer: Other
Input Modalities: text
Output Modalities: text
Max Output: 131,072 tokens
Tool Calling: ✓ supported
Structured Output: ✓ supported
Reasoning Mode: not supported
Vision: text only
Audio: no
Moderated: no

What it costs in practice

Computed from the current $0.05/M input and $0.10/M output rates. Run your own numbers →

Job	Tokens	Cost
Summarize a 50-page report	30k in / 1.5k out	under $0.01
Classify 1,000 customer emails	500k in / 50k out	$0.03
A month of a busy support chatbot	5M in / 2M out	$0.45

Similar models

ibm-granite

IBM: Granite 4.0 Micro

$0.02 in / $0.11 out

131,000 ctx

76

Quick answers

How much does IBM: Granite 4.1 8B cost?: $0.05 per million input tokens and $0.10 per million output tokens.
What is IBM: Granite 4.1 8B's context window?: 131,072 tokens, roughly 196 pages of text in a single request.
Does IBM: Granite 4.1 8B support tool calling?: It supports tool calling, structured output.
Can IBM: Granite 4.1 8B process images?: No, it is text-only on the input side.