head-to-head

xAI: Grok 4.20 vs Qwen: Qwen3.5-35B-A3B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-18.

xAI: Grok 4.20 Qwen: Qwen3.5-35B-A3B
Vendorx-aiqwen
Quality Score100100
Benchmark Score69.355.8
Input Price$1.25/M$0.14/M
Output Price$2.50/M$1.00/M
Context Window2,000,000262,144
Max Output-81,920
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index61.048.3
ai_index_agentic88.972.8
ai_index_coding69.649.9
eqbench55.8-

Who wins by task?

TaskxAI: Grok 4.20Qwen: Qwen3.5-35B-A3B
SQL Generation 169 155
Code Review 166 148
Code Completion 122 130
Code Refactoring 165 145
Bug Fixing 181 160
Unit Test Generation 152 139
Code Documentation 145 133
Regex Writing 135 131
CI/CD Pipelines 143 132
Frontend Component Design 144 137
Data Analysis 167 155
CSV / Spreadsheet Cleanup 152 140
ETL Scripting 154 138
JSON Extraction 136 141
Bulk Data Labeling 125 132
OCR / Document Parsing 144 137
Table Extraction from PDFs 144 137
Long-Document Summarization 162 143
Short-Form Summarization 123 129
Blog Post Writing 141 131

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs xAI: Grok 4.20 MoonshotAI: Kimi K2.7 Code vs Qwen: Qwen3.5-35B-A3B Qwen: Qwen3.7 Plus vs xAI: Grok 4.20 Qwen: Qwen3.7 Plus vs Qwen: Qwen3.5-35B-A3B MiniMax: MiniMax M3 vs xAI: Grok 4.20 MiniMax: MiniMax M3 vs Qwen: Qwen3.5-35B-A3B StepFun: Step 3.7 Flash vs xAI: Grok 4.20 StepFun: Step 3.7 Flash vs Qwen: Qwen3.5-35B-A3B