head-to-head

xAI: Grok 4.3 vs xAI: Grok 4.20

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-16.

Who wins by task?

Task	xAI: Grok 4.3	xAI: Grok 4.20
SQL Generation	169	171
Code Review	166	170
Code Completion	121	122
Code Refactoring	163	168
Bug Fixing	181	185
Unit Test Generation	152	154
Code Documentation	143	147
Regex Writing	136	137
CI/CD Pipelines	143	146
Frontend Component Design	145	146
Data Analysis	170	170
CSV / Spreadsheet Cleanup	150	153
ETL Scripting	153	157
Bulk Data Labeling	125	125
OCR / Document Parsing	144	145
Table Extraction from PDFs	144	145
Long-Document Summarization	160	166
Short-Form Summarization	124	124
Blog Post Writing	140	143

Scores reflect capability match + benchmark data + pricing for each task. Methodology →