head-to-head
Qwen: Qwen3.5 Plus 2026-04-20 vs Google: Gemma 4 31B
Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-16.
| Qwen: Qwen3.5 Plus 2026-04-20 | Google: Gemma 4 31B | |
|---|---|---|
| Vendor | qwen | |
| Quality Score | 100 | 100 |
| Benchmark Score | - | 68.5 |
| Input Price | $0.30/M | $0.12/M |
| Output Price | $1.80/M | $0.35/M |
| Context Window | 1,000,000 | 262,144 |
| Max Output | 65,536 | 262,144 |
| Tool Calling | ✓ | ✓ |
| Structured Output | ✓ | ✓ |
| Reasoning Mode | ✓ | ✓ |
| Vision | ✓ | ✓ |
| Audio | - | - |
| Benchmark Scores | ||
| ai_index | - | 64.7 |
| ai_index_agentic | - | 67.6 |
| ai_index_coding | - | 63.9 |
| eqbench | - | 70.8 |
Who wins by task?
| Task | Qwen: Qwen3.5 Plus 2026-04-20 | Google: Gemma 4 31B |
|---|---|---|
| SQL Generation | 133 | 164 |
| Code Review | 132 | 161 |
| Code Completion | 131 | 132 |
| Code Refactoring | 136 | 157 |
| Bug Fixing | 136 | 173 |
| Unit Test Generation | 124 | 148 |
| Code Documentation | 131 | 141 |
| Regex Writing | 119 | 135 |
| CI/CD Pipelines | 120 | 140 |
| Frontend Component Design | 122 | 143 |
| Data Analysis | 124 | 163 |
| CSV / Spreadsheet Cleanup | 133 | 146 |
| ETL Scripting | 128 | 149 |
| JSON Extraction | 131 | 143 |
| Bulk Data Labeling | 129 | 133 |
| OCR / Document Parsing | 131 | 141 |
| Table Extraction from PDFs | 131 | 141 |
| Long-Document Summarization | 137 | 155 |
| Short-Form Summarization | 123 | 131 |
| Blog Post Writing | 121 | 138 |
Scores reflect capability match + benchmark data + pricing for each task. Methodology →
Related comparisons
MoonshotAI: Kimi K2.7 Code vs Qwen: Qwen3.5 Plus 2026-04-20
MoonshotAI: Kimi K2.7 Code vs Google: Gemma 4 31B
Qwen: Qwen3.7 Plus vs Qwen: Qwen3.5 Plus 2026-04-20
Qwen: Qwen3.7 Plus vs Google: Gemma 4 31B
MiniMax: MiniMax M3 vs Qwen: Qwen3.5 Plus 2026-04-20
MiniMax: MiniMax M3 vs Google: Gemma 4 31B
StepFun: Step 3.7 Flash vs Qwen: Qwen3.5 Plus 2026-04-20
StepFun: Step 3.7 Flash vs Google: Gemma 4 31B