head-to-head

Mistral: Mistral Medium 3.5 vs xAI: Grok 4.20

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-19.

Mistral: Mistral Medium 3.5 xAI: Grok 4.20
Vendormistralaix-ai
Quality Score100100
Benchmark Score63.369.3
Input Price$1.50/M$1.25/M
Output Price$7.50/M$2.50/M
Context Window262,1442,000,000
Max Output--
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index49.461.0
ai_index_agentic87.788.9
ai_index_coding58.469.6
eqbench-55.8

Who wins by task?

TaskMistral: Mistral Medium 3.5xAI: Grok 4.20
SQL Generation 158 169
Code Review 151 166
Code Completion 116 122
Code Refactoring 147 165
Bug Fixing 165 181
Unit Test Generation 142 152
Code Documentation 132 145
Regex Writing 130 135
CI/CD Pipelines 134 143
Frontend Component Design 139 144
Data Analysis 160 167
CSV / Spreadsheet Cleanup 141 152
ETL Scripting 140 154
JSON Extraction 133 136
Bulk Data Labeling 123 125
OCR / Document Parsing 138 144
Table Extraction from PDFs 138 144
Long-Document Summarization 144 162
Short-Form Summarization 120 123
Blog Post Writing 131 141

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs Mistral: Mistral Medium 3.5 MoonshotAI: Kimi K2.7 Code vs xAI: Grok 4.20 Qwen: Qwen3.7 Plus vs Mistral: Mistral Medium 3.5 Qwen: Qwen3.7 Plus vs xAI: Grok 4.20 MiniMax: MiniMax M3 vs Mistral: Mistral Medium 3.5 MiniMax: MiniMax M3 vs xAI: Grok 4.20 StepFun: Step 3.7 Flash vs Mistral: Mistral Medium 3.5 StepFun: Step 3.7 Flash vs xAI: Grok 4.20