head-to-head

Google: Gemma 4 31B vs Anthropic: Claude Sonnet 4.6

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-05-12.

Who wins by task?

Task	Google: Gemma 4 31B	Anthropic: Claude Sonnet 4.6
SQL Generation	131	181
Code Review	126	177
Code Completion	129	118
Code Refactoring	127	172
Bug Fixing	130	194
Unit Test Generation	121	163
Code Documentation	126	144
Regex Writing	119	139
CI/CD Pipelines	117	152
Frontend Component Design	122	153
Data Analysis	124	184
CSV / Spreadsheet Cleanup	128	158
ETL Scripting	122	162
JSON Extraction	131	141
Bulk Data Labeling	129	123
OCR / Document Parsing	128	150
Table Extraction from PDFs	128	150
Long-Document Summarization	129	166
Short-Form Summarization	123	123
Blog Post Writing	119	145

Scores reflect capability match + benchmark data + pricing for each task. Methodology →