head-to-head

StepFun: Step 3.7 Flash vs Qwen: Qwen3.6 27B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-12.

StepFun: Step 3.7 Flash Qwen: Qwen3.6 27B
Vendorstepfunqwen
Quality Score100100
Benchmark Score74.477.0
Input Price$0.20/M$0.29/M
Output Price$1.15/M$2.40/M
Context Window256,000262,144
Max Output256,000131,072
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index70.375.6
ai_index_agentic98.2100.0
ai_index_coding61.260.2

Who wins by task?

TaskStepFun: Step 3.7 FlashQwen: Qwen3.6 27B
SQL Generation 163 164
Code Review 156 157
Code Completion 130 130
Code Refactoring 151 152
Bug Fixing 171 173
Unit Test Generation 146 146
Code Documentation 136 137
Regex Writing 135 135
CI/CD Pipelines 137 138
Frontend Component Design 142 143
Data Analysis 166 167
CSV / Spreadsheet Cleanup 143 143
ETL Scripting 144 145
JSON Extraction 143 143
Bulk Data Labeling 133 133
OCR / Document Parsing 139 140
Table Extraction from PDFs 139 140
Long-Document Summarization 148 150
Blog Post Writing 135 136

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash Qwen: Qwen3.7 Plus vs Qwen: Qwen3.6 27B MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash MiniMax: MiniMax M3 vs Qwen: Qwen3.6 27B StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash StepFun: Step 3.7 Flash vs Google: Gemini 3.1 Flash Lite StepFun: Step 3.7 Flash vs xAI: Grok 4.3