benchmarks
Top Models by Benchmark Score (2026)
Ranked by blended benchmark data from Aider Polyglot and Artificial Analysis Intelligence Index.
| # | Model | Blended |
|---|---|---|
| 1 | Anthropic: Claude Opus 4.7 | 86.5 |
| 2 | Google: Gemini 2.5 Pro | 83.1 |
| 3 | DeepSeek: DeepSeek V3 | 81.7 |
| 4 | Anthropic: Claude Sonnet 4.6 | 80.0 |
| 5 | xAI: Grok 4 | 79.6 |