head-to-head

Anthropic: Claude Sonnet 5 vs Google: Gemma 4 31B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-07-01.

Who wins by task?

Task	Anthropic: Claude Sonnet 5	Google: Gemma 4 31B
SQL Generation	132	157
Code Review	132	154
Code Completion	117	132
Code Refactoring	136	152
Bug Fixing	136	161
Unit Test Generation	124	143
Code Documentation	129	138
Regex Writing	117	131
CI/CD Pipelines	120	136
Frontend Component Design	122	138
Data Analysis	124	152
CSV / Spreadsheet Cleanup	132	146
ETL Scripting	128	144
JSON Extraction	121	143
Bulk Data Labeling	118	133
OCR / Document Parsing	131	140
Table Extraction from PDFs	131	140
Long-Document Summarization	136	150
Short-Form Summarization	113	129
Blog Post Writing	120	134

Scores reflect capability match + benchmark data + pricing for each task. Methodology →