head-to-head

Anthropic: Claude Sonnet 5 vs StepFun: Step 3.7 Flash

Side-by-side comparison of specs, pricing, benchmark scores, and task rankings. Updated 2026-07-01.

Anthropic: Claude Sonnet 5 StepFun: Step 3.7 Flash
Vendoranthropicstepfun
Quality Score100100
Benchmark Score-49.5
Input Price$2.00/M$0.20/M
Output Price$10.00/M$1.15/M
Context Window1,000,000256,000
Max Output128,000256,000
Tool Calling
Structured Output
Reasoning Mode
Vision
Audio--
Benchmark Scores
ai_index-49.1
ai_index_agentic-35.5
ai_index_coding-61.6

Who wins by task?

TaskAnthropic: Claude Sonnet 5StepFun: Step 3.7 Flash
SQL Generation 132 152
Code Review 132 145
Code Completion 117 129
Code Refactoring 136 143
Bug Fixing 136 154
Unit Test Generation 124 138
Code Documentation 129 132
Regex Writing 117 129
CI/CD Pipelines 120 131
Frontend Component Design 122 135
Data Analysis 124 149
CSV / Spreadsheet Cleanup 132 140
ETL Scripting 128 137
JSON Extraction 121 142
Bulk Data Labeling 118 133
OCR / Document Parsing 131 137
Table Extraction from PDFs 131 137
Long-Document Summarization 136 141
Short-Form Summarization 113 128
Blog Post Writing 120 129

Scores reflect capability match + benchmark data + pricing for each task. Methodology →

Related comparisons

Anthropic: Claude Sonnet 5 vs MoonshotAI: Kimi K2.7 Code Anthropic: Claude Sonnet 5 vs Qwen: Qwen3.7 Plus Anthropic: Claude Sonnet 5 vs MiniMax: MiniMax M3 Anthropic: Claude Sonnet 5 vs xAI: Grok Build 0.1 Anthropic: Claude Sonnet 5 vs Google: Gemini 3.5 Flash Anthropic: Claude Sonnet 5 vs Google: Gemini 3.1 Flash Lite Anthropic: Claude Sonnet 5 vs xAI: Grok 4.3 Anthropic: Claude Sonnet 5 vs Mistral: Mistral Medium 3.5