head-to-head

StepFun: Step 3.7 Flash vs Qwen: Qwen3.5-35B-A3B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-17.

StepFun: Step 3.7 Flash Qwen: Qwen3.5-35B-A3B
Vendorstepfunqwen
Quality Score100100
Benchmark Score67.255.8
Input Price$0.20/M$0.14/M
Output Price$1.15/M$1.00/M
Context Window256,000262,144
Max Output256,00081,920
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index49.148.3
ai_index_agentic98.272.8
ai_index_coding61.249.9

Who wins by task?

TaskStepFun: Step 3.7 FlashQwen: Qwen3.5-35B-A3B
SQL Generation 161 155
Code Review 152 148
Code Completion 129 130
Code Refactoring 147 145
Bug Fixing 167 160
Unit Test Generation 143 139
Code Documentation 134 133
Regex Writing 133 131
CI/CD Pipelines 135 132
Frontend Component Design 140 137
Data Analysis 163 155
CSV / Spreadsheet Cleanup 142 140
ETL Scripting 141 138
JSON Extraction 143 141
Bulk Data Labeling 133 132
OCR / Document Parsing 138 137
Table Extraction from PDFs 138 137
Long-Document Summarization 145 143
Short-Form Summarization 130 129
Blog Post Writing 133 131

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

MoonshotAI: Kimi K2.7 Code vs StepFun: Step 3.7 Flash MoonshotAI: Kimi K2.7 Code vs Qwen: Qwen3.5-35B-A3B Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash Qwen: Qwen3.7 Plus vs Qwen: Qwen3.5-35B-A3B MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash MiniMax: MiniMax M3 vs Qwen: Qwen3.5-35B-A3B StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash