Back to GalleryBack
Model Comparison
Model Comparison
Compare performance, benchmarks, and characteristics
Loading comparison...
Compare performance, benchmarks, and characteristics
Loading comparison...
| Metric | ||
|---|---|---|
Pricing | ||
| Input Price | $2 / 1M tokens | $2.5 / 1M tokens |
| Output Price | $8 / 1M tokens | $10 / 1M tokens |
Capabilities | ||
| Context Window | 1047576 tokens | 128K tokens |
| Capabilities | tools | tools |
| Input type | text, image | text, image |
Category Scores | ||
| Overall Average | 52.2 | 30.6 |
| Academia | 54.9 | 40.7 |
| Finance | 39.0 | 16.5 |
| Marketing | 59.0 | 34.4 |
| Maths | 34.7 | 6.0 |
| Programming | 32.2 | 24.0 |
| Science | 66.1 | 56.6 |
| Vision | 74.8 | N/A |
| Writing | 56.6 | 35.8 |
Benchmark Tests | ||
| AIME | 43.7 | 15.0 |
| AA Coding Index | 32.2 | 24.0 |
| AAII | 43.4 | 27.0 |
| AA Math Index | 34.7 | 6.0 |
| GPQA | 66.5 | 54.3 |
| HLE | 4.6 | 3.3 |
| LiveCodeBench | 45.7 | 30.9 |
| MATH-500 | 91.3 | 75.9 |
| MMLU-Pro | 80.6 | 74.8 |
| MMMU | 74.8 | _ |
| SciCode | 38.1 | 33.3 |