google
Google: Gemma 4 31B (free)
Gemma 4 31B Instruct is Google DeepMind's 30.7B dense multimodal model supporting text and image input with text output. Features a 256K token context window, configurable thinking/reasoning mode, native function...
Quality Score
100/100
price + capability + benchmarks
Input Price
Free
per 1M tokens
Output Price
Free
per 1M tokens
Context Window
262,144
tokens
- Model ID
- google/gemma-4-31b-it:free
- Vendor
- Tokenizer
- Gemma
- Input Modalities
- image, text, video
- Output Modalities
- text
- Max Output
- 32,768 tokens
- Tool Calling
- ✓ supported
- Structured Output
- ✓ supported
- Reasoning Mode
- ✓ supported
- Vision
- ✓ accepts images
- Audio
- no
- Moderated
- no
Strong choice for
Writing
Short-Form Summarization
TL;DRs of articles and emails at scale.
Writing
Social Media Posts
Tweets, LinkedIn posts, captions in the right voice.
Voice
Voice Assistant Backend
Real-time voice agent backbones.
Personal
Chat Companion
General-purpose conversation.
Cost
Cheap Bulk Inference
Lowest cost-per-million for high-volume jobs.
Cost
Self-Hosted / Local
Open-weights models you can run yourself.
Similar models
google
Google: Gemini 3.5 Flash
$1.50 in / $9.00 out
1,048,576 ctx
100
google
Google: Gemini 3.1 Flash Lite
$0.25 in / $1.50 out
1,048,576 ctx
100
google
Google: Gemma 4 26B A4B (free)
Free
262,144 ctx
100
google
Google: Gemma 4 26B A4B
$0.06 in / $0.33 out
262,144 ctx
100
google
Google: Gemma 4 31B
$0.12 in / $0.36 out
262,144 ctx
100
google
Google: Gemini 3.1 Flash Lite Preview
$0.25 in / $1.50 out
1,048,576 ctx
100