head-to-head
OpenAI: GPT-5.4 vs Qwen: Qwen3.5-27B
Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-22.
| OpenAI: GPT-5.4 | Qwen: Qwen3.5-27B | |
|---|---|---|
| Vendor | openai | qwen |
| Quality Score | 100 | 100 |
| Benchmark Score | 90.4 | 55.6 |
| Input Price | $2.50/M | $0.20/M |
| Output Price | $15.00/M | $1.56/M |
| Context Window | 1,050,000 | 262,144 |
| Max Output | 128,000 | 65,536 |
| Tool Calling | ✓ | ✓ |
| Structured Output | ✓ | ✓ |
| Reasoning Mode | ✓ | ✓ |
| Vision | ✓ | ✓ |
| Audio | - | - |
| Benchmark Scores | ||
| ai_index | 84.8 | 55.7 |
| ai_index_agentic | 67.8 | - |
| ai_index_coding | 100.0 | - |
| eqbench | 82.4 | - |
Who wins by task?
| Task | OpenAI: GPT-5.4 | Qwen: Qwen3.5-27B |
|---|---|---|
| SQL Generation | 174 | 137 |
| Code Review | 175 | 137 |
| Code Completion | 120 | 130 |
| Code Refactoring | 174 | 136 |
| Bug Fixing | 188 | 141 |
| Unit Test Generation | 159 | 127 |
| Code Documentation | 146 | 131 |
| Regex Writing | 136 | 125 |
| CI/CD Pipelines | 149 | 123 |
| Frontend Component Design | 149 | 128 |
| Data Analysis | 173 | 132 |
| CSV / Spreadsheet Cleanup | 157 | 130 |
| ETL Scripting | 161 | 130 |
| JSON Extraction | 137 | 131 |
| Bulk Data Labeling | 122 | 129 |
| OCR / Document Parsing | 149 | 131 |
| Table Extraction from PDFs | 149 | 131 |
| Long-Document Summarization | 168 | 138 |
| Short-Form Summarization | 122 | 126 |
| Blog Post Writing | 144 | 125 |
Scores reflect capability match + benchmark data + pricing for each task. Methodology →
Related comparisons
MoonshotAI: Kimi K2.7 Code vs OpenAI: GPT-5.4
MoonshotAI: Kimi K2.7 Code vs Qwen: Qwen3.5-27B
Qwen: Qwen3.7 Plus vs OpenAI: GPT-5.4
Qwen: Qwen3.7 Plus vs Qwen: Qwen3.5-27B
MiniMax: MiniMax M3 vs OpenAI: GPT-5.4
MiniMax: MiniMax M3 vs Qwen: Qwen3.5-27B
StepFun: Step 3.7 Flash vs OpenAI: GPT-5.4
StepFun: Step 3.7 Flash vs Qwen: Qwen3.5-27B