head-to-head

StepFun: Step 3.7 Flash vs Google: Gemma 4 31B (free)

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-06-12.

StepFun: Step 3.7 Flash Google: Gemma 4 31B (free)
Vendorstepfungoogle
Quality Score100100
Benchmark Score74.4-
Input Price$0.20/MFree
Output Price$1.15/MFree
Context Window256,000262,144
Max Output256,00032,768
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index70.3-
ai_index_agentic98.2-
ai_index_coding61.2-

Who wins by task?

TaskStepFun: Step 3.7 FlashGoogle: Gemma 4 31B (free)
SQL Generation 163 131
Code Review 156 126
Code Completion 130 129
Code Refactoring 151 127
Bug Fixing 171 130
Unit Test Generation 146 121
Code Documentation 136 126
Regex Writing 135 120
CI/CD Pipelines 137 117
Frontend Component Design 142 122
Data Analysis 166 124
CSV / Spreadsheet Cleanup 143 128
ETL Scripting 144 122
JSON Extraction 143 132
Bulk Data Labeling 133 130
OCR / Document Parsing 139 128
Table Extraction from PDFs 139 128
Long-Document Summarization 148 129
Short-Form Summarization 131 124
Blog Post Writing 135 119

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

Qwen: Qwen3.7 Plus vs StepFun: Step 3.7 Flash Qwen: Qwen3.7 Plus vs Google: Gemma 4 31B (free) MiniMax: MiniMax M3 vs StepFun: Step 3.7 Flash MiniMax: MiniMax M3 vs Google: Gemma 4 31B (free) StepFun: Step 3.7 Flash vs xAI: Grok Build 0.1 StepFun: Step 3.7 Flash vs Google: Gemini 3.5 Flash StepFun: Step 3.7 Flash vs Google: Gemini 3.1 Flash Lite StepFun: Step 3.7 Flash vs xAI: Grok 4.3