head-to-head

xAI: Grok Build 0.1 vs Qwen: Qwen3.5-9B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-17.

xAI: Grok Build 0.1 Qwen: Qwen3.5-9B
Vendorx-aiqwen
Quality Score100100
Benchmark Score-47.1
Input Price$1.00/M$0.10/M
Output Price$2.00/M$0.15/M
Context Window256,000262,144
Max Output-262,144
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index-41.2
ai_index_agentic-61.7
ai_index_coding-41.8

Who wins by task?

TaskxAI: Grok Build 0.1Qwen: Qwen3.5-9B
SQL Generation 130 151
Code Review 126 145
Code Completion 116 130
Code Refactoring 127 142
Bug Fixing 130 156
Unit Test Generation 121 137
Code Documentation 125 132
Regex Writing 119 129
CI/CD Pipelines 117 130
Frontend Component Design 122 135
Data Analysis 124 150
CSV / Spreadsheet Cleanup 127 138
ETL Scripting 122 136
JSON Extraction 123 140
Bulk Data Labeling 121 132
OCR / Document Parsing 128 135
Table Extraction from PDFs 128 135
Long-Document Summarization 129 141
Short-Form Summarization 115 128
Blog Post Writing 118 129

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs xAI: Grok Build 0.1 MoonshotAI: Kimi K2.7 Code vs Qwen: Qwen3.5-9B Qwen: Qwen3.7 Plus vs xAI: Grok Build 0.1 Qwen: Qwen3.7 Plus vs Qwen: Qwen3.5-9B MiniMax: MiniMax M3 vs xAI: Grok Build 0.1 MiniMax: MiniMax M3 vs Qwen: Qwen3.5-9B StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Qwen: Qwen3.5-9B