head-to-head
Z.ai: GLM 5V Turbo vs xAI: Grok 4.20
Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-23.
| Z.ai: GLM 5V Turbo | xAI: Grok 4.20 | |
|---|---|---|
| Vendor | z-ai | x-ai |
| Quality Score | 100 | 100 |
| Benchmark Score | 56.8 | 61.5 |
| Input Price | $1.20/M | $1.25/M |
| Output Price | $4.00/M | $2.50/M |
| Context Window | 202,752 | 2,000,000 |
| Max Output | 131,072 | - |
| Tool Calling | ✓ | ✓ |
| Structured Output | ✓ | ✓ |
| Reasoning Mode | ✓ | ✓ |
| Vision | ✓ | ✓ |
| Audio | - | - |
| Benchmark Scores | ||
| ai_index | 56.9 | 61.0 |
| eqbench | - | 55.8 |
Who wins by task?
| Task | Z.ai: GLM 5V Turbo | xAI: Grok 4.20 |
|---|---|---|
| SQL Generation | 136 | 144 |
| Code Review | 134 | 150 |
| Code Completion | 116 | 122 |
| Code Refactoring | 133 | 153 |
| Bug Fixing | 138 | 154 |
| Unit Test Generation | 126 | 135 |
| Code Documentation | 128 | 141 |
| Regex Writing | 124 | 127 |
| CI/CD Pipelines | 122 | 131 |
| Frontend Component Design | 128 | 131 |
| Data Analysis | 132 | 136 |
| CSV / Spreadsheet Cleanup | 127 | 139 |
| ETL Scripting | 128 | 142 |
| JSON Extraction | 123 | 123 |
| Bulk Data Labeling | 120 | 120 |
| OCR / Document Parsing | 129 | 135 |
| Table Extraction from PDFs | 129 | 135 |
| Long-Document Summarization | 134 | 154 |
| Short-Form Summarization | 117 | 119 |
| Blog Post Writing | 124 | 132 |
Scores reflect capability match + benchmark data + pricing for each task. Methodology →
Related comparisons
MoonshotAI: Kimi K2.7 Code vs Z.ai: GLM 5V Turbo
MoonshotAI: Kimi K2.7 Code vs xAI: Grok 4.20
Qwen: Qwen3.7 Plus vs Z.ai: GLM 5V Turbo
Qwen: Qwen3.7 Plus vs xAI: Grok 4.20
MiniMax: MiniMax M3 vs Z.ai: GLM 5V Turbo
MiniMax: MiniMax M3 vs xAI: Grok 4.20
StepFun: Step 3.7 Flash vs Z.ai: GLM 5V Turbo
StepFun: Step 3.7 Flash vs xAI: Grok 4.20