Back to GalleryBack
Model Comparison
Model Comparison
Compare performance, benchmarks, and characteristics
Loading comparison...
Compare performance, benchmarks, and characteristics
Loading comparison...
| Metric | ||
|---|---|---|
Pricing  | ||
| Input Price | $2 / 1M tokens | $3 / 1M tokens | 
| Output Price | $8 / 1M tokens | $15 / 1M tokens | 
Capabilities  | ||
| Context Window | 200K tokens | 256K tokens | 
| Capabilities | tools | tools | 
| Input type | text, image | text | 
Category Scores  | ||
| Overall Average | 74.7 | 74.3 | 
| Academia | 74.3 | 76.4 | 
| Finance | 76.9 | 79.0 | 
| Marketing | 74.2 | 69.1 | 
| Maths | 88.3 | 92.7 | 
| Programming | 52.2 | 55.1 | 
| Science | 77.5 | 80.1 | 
| Vision | 82.9 | N/A | 
| Writing | 71.0 | 67.6 | 
Benchmark Tests  | ||
| AIME | 90.3 | 94.3 | 
| AA Coding Index | 52.2 | 55.1 | 
| AAII | 65.5 | 65.3 | 
| AA Math Index | 88.3 | 92.7 | 
| GPQA | 83.0 | 87.6 | 
| HLE | 20.0 | 23.9 | 
| LiveCodeBench | 80.8 | 81.9 | 
| MATH-500 | 99.2 | 99.0 | 
| MMLU-Pro | 85.3 | 86.6 | 
| MMMU | 82.9 | _ | 
| SciCode | 41.0 | 45.7 |