head-to-head
xAI: Grok Build 0.1 vs Mistral: Mistral Small 4
Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-17.
| xAI: Grok Build 0.1 | Mistral: Mistral Small 4 | |
|---|---|---|
| Vendor | x-ai | mistralai |
| Quality Score | 100 | 100 |
| Benchmark Score | - | 6.1 |
| Input Price | $1.00/M | $0.15/M |
| Output Price | $2.00/M | $0.60/M |
| Context Window | 256,000 | 262,144 |
| Max Output | - | - |
| Tool Calling | ✓ | ✓ |
| Structured Output | ✓ | ✓ |
| Reasoning Mode | ✓ | ✓ |
| Vision | ✓ | ✓ |
| Audio | - | - |
| Benchmark Scores | ||
| ai_index | - | 7.7 |
Who wins by task?
| Task | xAI: Grok Build 0.1 | Mistral: Mistral Small 4 |
|---|---|---|
| SQL Generation | 130 | 132 |
| Code Review | 126 | 127 |
| Code Completion | 116 | 129 |
| Code Refactoring | 127 | 129 |
| Bug Fixing | 130 | 131 |
| Unit Test Generation | 121 | 122 |
| Code Documentation | 125 | 127 |
| Regex Writing | 119 | 120 |
| CI/CD Pipelines | 117 | 118 |
| Frontend Component Design | 122 | 122 |
| Data Analysis | 124 | 125 |
| CSV / Spreadsheet Cleanup | 127 | 128 |
| ETL Scripting | 122 | 123 |
| JSON Extraction | 123 | 131 |
| Bulk Data Labeling | 121 | 129 |
| OCR / Document Parsing | 128 | 128 |
| Table Extraction from PDFs | 128 | 128 |
| Long-Document Summarization | 129 | 130 |
| Short-Form Summarization | 115 | 124 |
| Blog Post Writing | 118 | 120 |
Scores reflect capability match + benchmark data + pricing for each task. Methodology →
Related comparisons
MoonshotAI: Kimi K2.7 Code vs xAI: Grok Build 0.1
MoonshotAI: Kimi K2.7 Code vs Mistral: Mistral Small 4
Qwen: Qwen3.7 Plus vs xAI: Grok Build 0.1
Qwen: Qwen3.7 Plus vs Mistral: Mistral Small 4
MiniMax: MiniMax M3 vs xAI: Grok Build 0.1
MiniMax: MiniMax M3 vs Mistral: Mistral Small 4
StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1
StepFun: Step 3.7 Flash vs Mistral: Mistral Small 4