head-to-head

StepFun: Step 3.7 Flash vs Google: Gemma 4 26B A4B

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-12.

StepFun: Step 3.7 Flash Google: Gemma 4 26B A4B
Vendorstepfungoogle
Quality Score100100
Benchmark Score74.454.6
Input Price$0.20/M$0.06/M
Output Price$1.15/M$0.33/M
Context Window256,000262,144
Max Output256,000-
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index70.351.5
ai_index_agentic98.253.0
ai_index_coding61.237.0
eqbench-70.0

Who wins by task?

TaskStepFun: Step 3.7 FlashGoogle: Gemma 4 26B A4B
SQL Generation 163 156
Code Review 156 154
Code Completion 130 132
Code Refactoring 151 152
Bug Fixing 171 164
Unit Test Generation 146 141
Code Documentation 136 139
Regex Writing 135 132
CI/CD Pipelines 137 135
Frontend Component Design 142 138
Data Analysis 166 153
CSV / Spreadsheet Cleanup 143 141
ETL Scripting 144 143
JSON Extraction 143 139
Bulk Data Labeling 133 132
OCR / Document Parsing 139 137
Table Extraction from PDFs 139 137
Long-Document Summarization 148 151
Short-Form Summarization 131 130
Blog Post Writing 135 134

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash Qwen: Qwen3.7 Plus vs Google: Gemma 4 26B A4B MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash MiniMax: MiniMax M3 vs Google: Gemma 4 26B A4B StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash StepFun: Step 3.7 Flash vs Google: Gemini 3.1 Flash Lite StepFun: Step 3.7 Flash vs xAI: Grok 4.3