head-to-head
StepFun: Step 3.7 Flash vs Qwen: Qwen3.6 27B
Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-12.
| StepFun: Step 3.7 Flash | Qwen: Qwen3.6 27B | |
|---|---|---|
| Vendor | stepfun | qwen |
| Quality Score | 100 | 100 |
| Benchmark Score | 74.4 | 77.0 |
| Input Price | $0.20/M | $0.29/M |
| Output Price | $1.15/M | $2.40/M |
| Context Window | 256,000 | 262,144 |
| Max Output | 256,000 | 131,072 |
| Tool Calling | ✓ | ✓ |
| Structured Output | ✓ | ✓ |
| Reasoning Mode | ✓ | ✓ |
| Vision | ✓ | ✓ |
| Audio | - | - |
| Benchmark Scores | ||
| ai_index | 70.3 | 75.6 |
| ai_index_agentic | 98.2 | 100.0 |
| ai_index_coding | 61.2 | 60.2 |
Who wins by task?
| Task | StepFun: Step 3.7 Flash | Qwen: Qwen3.6 27B |
|---|---|---|
| SQL Generation | 163 | 164 |
| Code Review | 156 | 157 |
| Code Completion | 130 | 130 |
| Code Refactoring | 151 | 152 |
| Bug Fixing | 171 | 173 |
| Unit Test Generation | 146 | 146 |
| Code Documentation | 136 | 137 |
| Regex Writing | 135 | 135 |
| CI/CD Pipelines | 137 | 138 |
| Frontend Component Design | 142 | 143 |
| Data Analysis | 166 | 167 |
| CSV / Spreadsheet Cleanup | 143 | 143 |
| ETL Scripting | 144 | 145 |
| JSON Extraction | 143 | 143 |
| Bulk Data Labeling | 133 | 133 |
| OCR / Document Parsing | 139 | 140 |
| Table Extraction from PDFs | 139 | 140 |
| Long-Document Summarization | 148 | 150 |
| Blog Post Writing | 135 | 136 |
Scores reflect capability match + benchmark data + pricing for each task. Methodology →
Related comparisons
Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash
Qwen: Qwen3.7 Plus vs Qwen: Qwen3.6 27B
MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash
MiniMax: MiniMax M3 vs Qwen: Qwen3.6 27B
StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1
StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash
StepFun: Step 3.7 Flash vs Google: Gemini 3.1 Flash Lite
StepFun: Step 3.7 Flash vs xAI: Grok 4.3