Back to GalleryBack
Model Comparison
Model Comparison
Compare performance, benchmarks, and characteristics
claude-sonnet-4
anthropic
Context200K tokens
Input Price$3 / 1M tokens
Output Price$15 / 1M tokens
Loading comparison...
Compare performance, benchmarks, and characteristics
Loading comparison...
| Metric | ||
|---|---|---|
Pricing | ||
| Input Price | $1 / 1M tokens | $3 / 1M tokens |
| Output Price | $3 / 1M tokens | $15 / 1M tokens |
Capabilities | ||
| Context Window | 131072 tokens | 200K tokens |
| Capabilities | tools | tools |
| Input type | text | text, image |
Category Scores | ||
| Overall Average | 56.7 | 64.9 |
| Academia | 63.3 | 67.1 |
| Finance | 53.8 | 65.4 |
| Marketing | 57.6 | 66.4 |
| Maths | 57.3 | 74.3 |
| Programming | 38.1 | 45.1 |
| Science | 70.8 | 72.7 |
| Writing | 56.1 | 63.5 |
Benchmark Tests | ||
| AIME | _ | 77.3 |
| AA Coding Index | 38.1 | 45.1 |
| AAII | 50.4 | 56.5 |
| AA Math Index | 57.3 | 74.3 |
| GPQA | 76.3 | 77.7 |
| HLE | 6.3 | 9.6 |
| HumanEval | 94.5 | _ |
| LiveCodeBench | 61.0 | 65.5 |
| MATH-500 | _ | 99.1 |
| MMLU | 90.2 | _ |
| MMLU-Pro | 82.2 | 84.2 |
| SciCode | 30.7 | 40.0 |