Back to GalleryBack
Model Comparison
Model Comparison
Compare performance, benchmarks, and characteristics
Loading comparison...
Compare performance, benchmarks, and characteristics
Loading comparison...
Metric | ||
---|---|---|
Pricing | ||
Input Price | $2 / 1M tokens | $3 / 1M tokens |
Output Price | $8 / 1M tokens | $15 / 1M tokens |
Capabilities | ||
Context Window | 200K tokens | 256K tokens |
Capabilities | tools | tools |
Input type | text, image | text |
Category Scores | ||
Overall Average | 74.7 | 74.3 |
Academia | 74.3 | 76.4 |
Finance | 76.9 | 79.0 |
Marketing | 74.2 | 69.1 |
Maths | 88.3 | 92.7 |
Programming | 52.2 | 55.1 |
Science | 77.5 | 80.1 |
Vision | 82.9 | N/A |
Writing | 71.0 | 67.6 |
Benchmark Tests | ||
AIME | 90.3 | 94.3 |
AA Coding Index | 52.2 | 55.1 |
AAII | 65.5 | 65.3 |
AA Math Index | 88.3 | 92.7 |
GPQA | 83.0 | 87.6 |
HLE | 20.0 | 23.9 |
LiveCodeBench | 80.8 | 81.9 |
MATH-500 | 99.2 | 99.0 |
MMLU-Pro | 85.3 | 86.6 |
MMMU | 82.9 | _ |
SciCode | 41.0 | 45.7 |