head-to-head

Anthropic: Claude Sonnet 5 vs Google: Gemma 4 31B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-07-01.

Anthropic: Claude Sonnet 5 Google: Gemma 4 31B
Vendoranthropicgoogle
Quality Score100100
Benchmark Score-57.2
Input Price$2.00/M$0.12/M
Output Price$10.00/M$0.35/M
Context Window1,000,000262,144
Max Output128,000262,144
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index-48.4
ai_index_agentic-23.8
ai_index_coding-71.7
eqbench-70.8

Who wins by task?

TaskAnthropic: Claude Sonnet 5Google: Gemma 4 31B
SQL Generation 132 157
Code Review 132 154
Code Completion 117 132
Code Refactoring 136 152
Bug Fixing 136 161
Unit Test Generation 124 143
Code Documentation 129 138
Regex Writing 117 131
CI/CD Pipelines 120 136
Frontend Component Design 122 138
Data Analysis 124 152
CSV / Spreadsheet Cleanup 132 146
ETL Scripting 128 144
JSON Extraction 121 143
Bulk Data Labeling 118 133
OCR / Document Parsing 131 140
Table Extraction from PDFs 131 140
Long-Document Summarization 136 150
Short-Form Summarization 113 129
Blog Post Writing 120 134

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

Anthropic: Claude Sonnet 5 vs MoonshotAI: Kimi K2.7 Code Anthropic: Claude Sonnet 5 vs Qwen: Qwen3.7 Plus Anthropic: Claude Sonnet 5 vs MiniMax: MiniMax M3 Anthropic: Claude Sonnet 5 vs StepFun: Step 3.7 Flash Anthropic: Claude Sonnet 5 vs xAI: Grok Build 0.1 Anthropic: Claude Sonnet 5 vs Google: Gemini 3.5 Flash Anthropic: Claude Sonnet 5 vs Google: Gemini 3.1 Flash Lite Anthropic: Claude Sonnet 5 vs xAI: Grok 4.3