head-to-head

xAI: Grok 4.3 vs Google: Gemma 4 31B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-16.

xAI: Grok 4.3 Google: Gemma 4 31B
Vendorx-aigoogle
Quality Score100100
Benchmark Score84.868.5
Input Price$1.25/M$0.12/M
Output Price$2.50/M$0.35/M
Context Window1,000,000262,144
Max Output-262,144
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index87.864.7
ai_index_agentic100.067.6
ai_index_coding67.763.9
eqbench-70.8

Who wins by task?

TaskxAI: Grok 4.3Google: Gemma 4 31B
SQL Generation 169 164
Code Review 166 161
Code Completion 121 132
Code Refactoring 163 157
Bug Fixing 181 173
Unit Test Generation 152 148
Code Documentation 143 141
Regex Writing 136 135
CI/CD Pipelines 143 140
Frontend Component Design 145 143
Data Analysis 170 163
CSV / Spreadsheet Cleanup 150 146
ETL Scripting 153 149
JSON Extraction 136 143
Bulk Data Labeling 125 133
OCR / Document Parsing 144 141
Table Extraction from PDFs 144 141
Long-Document Summarization 160 155
Short-Form Summarization 124 131
Blog Post Writing 140 138

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs xAI: Grok 4.3 MoonshotAI: Kimi K2.7 Code vs Google: Gemma 4 31B Qwen: Qwen3.7 Plus vs xAI: Grok 4.3 Qwen: Qwen3.7 Plus vs Google: Gemma 4 31B MiniMax: MiniMax M3 vs xAI: Grok 4.3 MiniMax: MiniMax M3 vs Google: Gemma 4 31B StepFun: Step 3.7 Flash vs xAI: Grok 4.3 StepFun: Step 3.7 Flash vs Google: Gemma 4 31B