Back to GalleryBack
Model Comparison
Model Comparison
Compare performance, benchmarks, and characteristics
claude-sonnet-4
anthropic
Context200K tokens
Input Price$3 / 1M tokens
Output Price$15 / 1M tokens
Loading comparison...
Compare performance, benchmarks, and characteristics
Loading comparison...
Metric | ||
---|---|---|
Pricing | ||
Input Price | $15 / 1M tokens | $3 / 1M tokens |
Output Price | $60 / 1M tokens | $15 / 1M tokens |
Capabilities | ||
Context Window | 200K tokens | 200K tokens |
Capabilities | tools | tools |
Input type | text, image | text, image |
Category Scores | ||
Overall Average | 64.8 | 53.5 |
Academia | 61.8 | 59.9 |
Finance | 66.0 | 55.7 |
Marketing | 57.1 | 30.9 |
Maths | 84.7 | 67.0 |
Programming | 51.9 | 41.1 |
Science | 76.4 | 75.4 |
Vision | 77.6 | 74.4 |
Writing | 43.0 | 23.7 |
Benchmark Tests | ||
AIME | 72.3 | 40.7 |
AA Coding Index | 51.9 | 41.1 |
AAII | 47.2 | 44.4 |
AA Math Index | 84.7 | 67.0 |
GPQA | 76.4 | 75.4 |
HLE | 7.7 | 4.0 |
HumanEval | 88.1 | _ |
LiveCodeBench | 67.9 | 44.9 |
MATH-500 | 97.0 | 93.4 |
MMLU | 91.8 | _ |
MMLU-Pro | 84.1 | 83.7 |
MMMU | 77.6 | 74.4 |
SciCode | 35.8 | 37.3 |