google

Google: Nano Banana Pro (Gemini 3 Pro Image)

Google's Nano Banana Pro is a multimodal model from Google that accepts both text and image inputs and returns up to 32,768 tokens per response within a 65,536-token context window. It supports tool use and reasoning, which makes it suitable for agentic workflows and multi-step tasks. Structured output support is unconfirmed at this time. At $2.00 per million input tokens and $12.00 per million output tokens, this model sits at a mid-to-higher price tier for output costs, so the economics favor use cases where reasoning and image understanding justify the spend. The catch for comparison shoppers is that there is currently no independent benchmark coverage, meaning performance claims are unverified. Buyers who need a proven track record before committing budget should treat this as an unproven option and wait for third-party evaluations before deploying it in production.

Quality Score
81/100
price + capability + benchmarks
Input Price
$2.00
per 1M tokens
Output Price
$12.00
per 1M tokens
Context Window
65,536
tokens
Model ID
google/gemini-3-pro-image
Vendor
google
Tokenizer
Gemini
Input Modalities
image, text
Output Modalities
image, text
Max Output
32,768 tokens
Tool Calling
✓ supported
Structured Output
✓ supported
Reasoning Mode
✓ supported
Vision
✓ accepts images
Audio
no
Moderated
no

Category rankings

Where Google: Nano Banana Pro (Gemini 3 Pro Image) places across the 1 category it ranks in. How we rank →

#CategoryScore
#6 Image GenerationVision · of 8 ranked 96

Similar models