Back to GalleryBack
Model Comparison
Model Comparison
Compare performance, benchmarks, and characteristics
claude-sonnet-4
anthropic
Context200K tokens
Input Price$3 / 1M tokens
Output Price$15 / 1M tokens
Loading comparison...
Compare performance, benchmarks, and characteristics
Loading comparison...
| Metric | ||
|---|---|---|
Pricing | ||
| Input Price | $3 / 1M tokens | $3 / 1M tokens |
| Output Price | $15 / 1M tokens | $12 / 1M tokens |
Capabilities | ||
| Context Window | 200K tokens | 128K tokens |
| Capabilities | tools | |
| Input type | text, image | text |
Category Scores | ||
| Overall Average | 64.9 | 57.2 |
| Maths | 74.3 | N/A |
| Finance | 65.4 | N/A |
| Science | 72.7 | 59.5 |
| Writing | 63.5 | 56.1 |
| Academia | 67.1 | 49.7 |
| Marketing | 66.4 | 63.7 |
| Programming | 45.1 | N/A |
Benchmark Tests | ||
| HLE | 9.6 | 4.9 |
| AIME | 77.3 | 60.3 |
| GPQA | 77.7 | 60.1 |
| MMLU | _ | 85.2 |
| SciCode | 40.0 | 32.3 |
| MATH-500 | 99.1 | 94.4 |
| MMLU-Pro | 84.2 | 74.2 |
| HumanEval | _ | 92.4 |
| LiveCodeBench | 65.5 | 57.6 |
| AA Math Index | 74.3 | _ |
| AA Coding Index | 45.1 | _ |
| AAII | 56.5 | 39.2 |