head-to-head

xAI: Grok 4.20 vs Qwen: Qwen3.5-27B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-22.

xAI: Grok 4.20 Qwen: Qwen3.5-27B
Vendorx-aiqwen
Quality Score100100
Benchmark Score61.555.6
Input Price$1.25/M$0.20/M
Output Price$2.50/M$1.56/M
Context Window2,000,000262,144
Max Output-65,536
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index61.055.7
eqbench55.8-

Who wins by task?

TaskxAI: Grok 4.20Qwen: Qwen3.5-27B
SQL Generation 144 137
Code Review 150 137
Code Completion 122 130
Code Refactoring 153 136
Bug Fixing 154 141
Unit Test Generation 135 127
Code Documentation 141 131
Regex Writing 127 125
CI/CD Pipelines 131 123
Frontend Component Design 131 128
Data Analysis 136 132
CSV / Spreadsheet Cleanup 139 130
ETL Scripting 142 130
JSON Extraction 123 131
Bulk Data Labeling 120 129
OCR / Document Parsing 135 131
Table Extraction from PDFs 135 131
Long-Document Summarization 154 138
Short-Form Summarization 119 126
Blog Post Writing 132 125

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs xAI: Grok 4.20 MoonshotAI: Kimi K2.7 Code vs Qwen: Qwen3.5-27B Qwen: Qwen3.7 Plus vs xAI: Grok 4.20 Qwen: Qwen3.7 Plus vs Qwen: Qwen3.5-27B MiniMax: MiniMax M3 vs xAI: Grok 4.20 MiniMax: MiniMax M3 vs Qwen: Qwen3.5-27B StepFun: Step 3.7 Flash vs xAI: Grok 4.20 StepFun: Step 3.7 Flash vs Qwen: Qwen3.5-27B