head-to-head

StepFun: Step 3.7 Flash vs Qwen: Qwen3.6 Flash

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-12.

StepFun: Step 3.7 Flash Qwen: Qwen3.6 Flash
Vendorstepfunqwen
Quality Score100100
Benchmark Score74.4-
Input Price$0.20/M$0.19/M
Output Price$1.15/M$1.12/M
Context Window256,0001,000,000
Max Output256,00065,536
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index70.3-
ai_index_agentic98.2-
ai_index_coding61.2-

Who wins by task?

TaskStepFun: Step 3.7 FlashQwen: Qwen3.6 Flash
SQL Generation 163 133
Code Review 156 132
Code Completion 130 131
Code Refactoring 151 136
Bug Fixing 171 136
Unit Test Generation 146 124
Code Documentation 136 131
Regex Writing 135 119
CI/CD Pipelines 137 120
Frontend Component Design 142 122
Data Analysis 166 124
CSV / Spreadsheet Cleanup 143 133
ETL Scripting 144 128
JSON Extraction 143 131
Bulk Data Labeling 133 129
OCR / Document Parsing 139 131
Table Extraction from PDFs 139 131
Long-Document Summarization 148 137
Short-Form Summarization 131 123
Blog Post Writing 135 121

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash Qwen: Qwen3.7 Plus vs Qwen: Qwen3.6 Flash MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash MiniMax: MiniMax M3 vs Qwen: Qwen3.6 Flash StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash StepFun: Step 3.7 Flash vs Google: Gemini 3.1 Flash Lite StepFun: Step 3.7 Flash vs xAI: Grok 4.3