head-to-head

Anthropic: Claude Sonnet 4.6 vs Qwen: Qwen3.5 Plus 2026-02-15

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-05-12.

Anthropic: Claude Sonnet 4.6 Qwen: Qwen3.5 Plus 2026-02-15
Vendoranthropicqwen
Quality Score100100
Benchmark Score80.0-
Input Price$3.00/M$0.26/M
Output Price$15.00/M$1.56/M
Context Window1,000,0001,000,000
Max Output128,00065,536
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index85.3-
ai_index_agentic100.0-
ai_index_coding84.1-
aider_polyglot61.3-

Who wins by task?

TaskAnthropic: Claude Sonnet 4.6Qwen: Qwen3.5 Plus 2026-02-15
SQL Generation 181 133
Code Review 177 132
Code Completion 118 131
Code Refactoring 172 136
Bug Fixing 194 136
Unit Test Generation 163 124
Code Documentation 144 131
Regex Writing 139 119
CI/CD Pipelines 152 120
Frontend Component Design 153 122
Data Analysis 184 124
CSV / Spreadsheet Cleanup 158 133
ETL Scripting 162 128
JSON Extraction 141 131
Bulk Data Labeling 123 129
OCR / Document Parsing 150 131
Table Extraction from PDFs 150 131
Long-Document Summarization 166 137
Short-Form Summarization 123 123
Blog Post Writing 145 121

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

Google: Gemini 3.1 Flash Lite vs Anthropic: Claude Sonnet 4.6 Google: Gemini 3.1 Flash Lite vs Qwen: Qwen3.5 Plus 2026-02-15 xAI: Grok 4.3 vs Anthropic: Claude Sonnet 4.6 xAI: Grok 4.3 vs Qwen: Qwen3.5 Plus 2026-02-15 Mistral: Mistral Medium 3.5 vs Anthropic: Claude Sonnet 4.6 Mistral: Mistral Medium 3.5 vs Qwen: Qwen3.5 Plus 2026-02-15 NVIDIA: Nemotron 3 Nano Omni (free) vs Anthropic: Claude Sonnet 4.6 NVIDIA: Nemotron 3 Nano Omni (free) vs Qwen: Qwen3.5 Plus 2026-02-15