head-to-head

xAI: Grok 4.3 vs xAI: Grok 4.20

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-16.

xAI: Grok 4.3 xAI: Grok 4.20
Vendorx-aix-ai
Quality Score100100
Benchmark Score84.874.7
Input Price$1.25/M$1.25/M
Output Price$2.50/M$2.50/M
Context Window1,000,0002,000,000
Max Output--
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index87.881.4
ai_index_agentic100.088.9
ai_index_coding67.769.6
eqbench-55.8

Who wins by task?

TaskxAI: Grok 4.3xAI: Grok 4.20
SQL Generation 169 171
Code Review 166 170
Code Completion 121 122
Code Refactoring 163 168
Bug Fixing 181 185
Unit Test Generation 152 154
Code Documentation 143 147
Regex Writing 136 137
CI/CD Pipelines 143 146
Frontend Component Design 145 146
Data Analysis 170 170
CSV / Spreadsheet Cleanup 150 153
ETL Scripting 153 157
Bulk Data Labeling 125 125
OCR / Document Parsing 144 145
Table Extraction from PDFs 144 145
Long-Document Summarization 160 166
Short-Form Summarization 124 124
Blog Post Writing 140 143

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs xAI: Grok 4.3 MoonshotAI: Kimi K2.7 Code vs xAI: Grok 4.20 Qwen: Qwen3.7 Plus vs xAI: Grok 4.3 Qwen: Qwen3.7 Plus vs xAI: Grok 4.20 MiniMax: MiniMax M3 vs xAI: Grok 4.3 MiniMax: MiniMax M3 vs xAI: Grok 4.20 StepFun: Step 3.7 Flash vs xAI: Grok 4.3 StepFun: Step 3.7 Flash vs xAI: Grok 4.20