head-to-head

Google: Gemma 4 31B vs Qwen: Qwen3.5-9B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-20.

Who wins by task?

Task	Google: Gemma 4 31B	Qwen: Qwen3.5-9B
SQL Generation	157	144
Code Review	154	139
Code Completion	132	130
Code Refactoring	152	138
Bug Fixing	161	144
Unit Test Generation	143	132
Code Documentation	138	131
Regex Writing	131	126
CI/CD Pipelines	136	126
Frontend Component Design	138	131
Data Analysis	152	138
CSV / Spreadsheet Cleanup	146	137
ETL Scripting	144	132
JSON Extraction	143	139
Bulk Data Labeling	133	132
OCR / Document Parsing	140	135
Table Extraction from PDFs	140	135
Long-Document Summarization	150	138
Short-Form Summarization	129	127
Blog Post Writing	134	126

Scores reflect capability match + benchmark data + pricing for each task. Methodology →