head-to-head

Qwen: Qwen3.5 Plus 2026-04-20 vs xAI: Grok 4.20

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-20.

Qwen: Qwen3.5 Plus 2026-04-20 xAI: Grok 4.20
Vendorqwenx-ai
Quality Score100100
Benchmark Score-61.5
Input Price$0.30/M$1.25/M
Output Price$1.80/M$2.50/M
Context Window1,000,0002,000,000
Max Output65,536-
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index-61.0
eqbench-55.8

Who wins by task?

TaskQwen: Qwen3.5 Plus 2026-04-20xAI: Grok 4.20
SQL Generation 133 144
Code Review 132 150
Code Completion 131 122
Code Refactoring 136 153
Bug Fixing 136 154
Unit Test Generation 124 135
Code Documentation 131 141
Regex Writing 119 127
CI/CD Pipelines 120 131
Frontend Component Design 122 131
Data Analysis 124 136
CSV / Spreadsheet Cleanup 133 139
ETL Scripting 128 142
JSON Extraction 131 123
Bulk Data Labeling 129 120
OCR / Document Parsing 131 135
Table Extraction from PDFs 131 135
Long-Document Summarization 137 154
Short-Form Summarization 123 119
Blog Post Writing 121 132

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs Qwen: Qwen3.5 Plus 2026-04-20 MoonshotAI: Kimi K2.7 Code vs xAI: Grok 4.20 Qwen: Qwen3.7 Plus vs Qwen: Qwen3.5 Plus 2026-04-20 Qwen: Qwen3.7 Plus vs xAI: Grok 4.20 MiniMax: MiniMax M3 vs Qwen: Qwen3.5 Plus 2026-04-20 MiniMax: MiniMax M3 vs xAI: Grok 4.20 StepFun: Step 3.7 Flash vs Qwen: Qwen3.5 Plus 2026-04-20 StepFun: Step 3.7 Flash vs xAI: Grok 4.20