Google: Nano Banana Pro (Gemini 3 Pro Image)
Google's Nano Banana Pro is a multimodal model from Google that accepts both text and image inputs and returns up to 32,768 tokens per response within a 65,536-token context window. It supports tool use and reasoning, which makes it suitable for agentic workflows and multi-step tasks. Structured output support is unconfirmed at this time. At $2.00 per million input tokens and $12.00 per million output tokens, this model sits at a mid-to-higher price tier for output costs, so the economics favor use cases where reasoning and image understanding justify the spend. The catch for comparison shoppers is that there is currently no independent benchmark coverage, meaning performance claims are unverified. Buyers who need a proven track record before committing budget should treat this as an unproven option and wait for third-party evaluations before deploying it in production.
- Model ID
- google/gemini-3-pro-image
- Vendor
- Tokenizer
- Gemini
- Input Modalities
- image, text
- Output Modalities
- image, text
- Max Output
- 32,768 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- no
Category rankings
Where Google: Nano Banana Pro (Gemini 3 Pro Image) places across the 1 category it ranks in. How we rank →
| # | Category | Score |
|---|---|---|
| #6 | Image GenerationVision · of 8 ranked | 96 |