ibm-granite

IBM: Granite 4.1 8B

Granite 4.1 8B is a text-in, text-out model from IBM with a 131,072-token context window and full tool-use support. It does not support reasoning mode, and structured output support is unconfirmed. The long context ceiling makes it technically capable of processing large documents or extended conversations in a single pass, and tool calling opens it to agentic workflow integration. At $0.05 per million input tokens and $0.10 per million output tokens, this is one of the cheaper options on the market, which is its clearest argument. However, its blended benchmark score of 13.1 across only three benchmarks gives buyers limited evidence to work from; performance on coding tasks scores 12.0 and agentic tasks 17.6, but coverage is too thin to draw confident conclusions. Teams with strict budget constraints and low-stakes or internal workloads may find the price worth a trial, but anyone selecting a model for demanding production use should wait for broader benchmark data.

Quality Score
86/100
price + capability + benchmarks
Input Price
$0.05
per 1M tokens
Output Price
$0.10
per 1M tokens
Context Window
131,072
tokens
Model ID
ibm-granite/granite-4.1-8b
Vendor
ibm-granite
Tokenizer
Other
Input Modalities
text
Output Modalities
text
Max Output
131,072 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
not supported
Vision
text only
Audio
no
Moderated
no

Similar models