head-to-head

Anthropic: Claude Sonnet 4.6 vs Qwen: Qwen3.5 397B A17B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-05-12.

Anthropic: Claude Sonnet 4.6 Qwen: Qwen3.5 397B A17B
Vendoranthropicqwen
Quality Score100100
Benchmark Score80.0-
Input Price$3.00/M$0.39/M
Output Price$15.00/M$2.34/M
Context Window1,000,000262,144
Max Output128,00065,536
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index85.3-
ai_index_agentic100.0-
ai_index_coding84.1-
aider_polyglot61.3-

Who wins by task?

TaskAnthropic: Claude Sonnet 4.6Qwen: Qwen3.5 397B A17B
SQL Generation 181 131
Code Review 177 126
Code Completion 118 128
Code Refactoring 172 127
Bug Fixing 194 130
Unit Test Generation 163 121
Code Documentation 144 125
Regex Writing 139 119
CI/CD Pipelines 152 117
Frontend Component Design 153 122
Data Analysis 184 124
CSV / Spreadsheet Cleanup 158 128
ETL Scripting 162 122
JSON Extraction 141 131
Bulk Data Labeling 123 129
OCR / Document Parsing 150 128
Table Extraction from PDFs 150 128
Long-Document Summarization 166 129
Short-Form Summarization 123 123
Blog Post Writing 145 119

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

Google: Gemini 3.1 Flash Lite vs Anthropic: Claude Sonnet 4.6 Google: Gemini 3.1 Flash Lite vs Qwen: Qwen3.5 397B A17B xAI: Grok 4.3 vs Anthropic: Claude Sonnet 4.6 xAI: Grok 4.3 vs Qwen: Qwen3.5 397B A17B Mistral: Mistral Medium 3.5 vs Anthropic: Claude Sonnet 4.6 Mistral: Mistral Medium 3.5 vs Qwen: Qwen3.5 397B A17B NVIDIA: Nemotron 3 Nano Omni (free) vs Anthropic: Claude Sonnet 4.6 NVIDIA: Nemotron 3 Nano Omni (free) vs Qwen: Qwen3.5 397B A17B