Back to GalleryBack
Model Comparison
Model Comparison
Compare performance, benchmarks, and characteristics
Loading comparison...
Compare performance, benchmarks, and characteristics
Loading comparison...
| Metric | ||
|---|---|---|
Pricing  | ||
| Input Price | $15 / 1M tokens | $3 / 1M tokens | 
| Output Price | $60 / 1M tokens | $15 / 1M tokens | 
Capabilities  | ||
| Context Window | 200K tokens | 256K tokens | 
| Capabilities | tools | tools | 
| Input type | text, image | text | 
Category Scores  | ||
| Overall Average | 65.1 | 74.3 | 
| Academia | 61.8 | 76.4 | 
| Marketing | 73.9 | 69.1 | 
| Programming | 38.6 | 55.1 | 
| Science | 72.3 | 80.1 | 
| Vision | 77.6 | N/A | 
| Writing | 66.5 | 67.6 | 
| Finance | N/A | 79.0 | 
| Maths | N/A | 92.7 | 
Benchmark Tests  | ||
| AIME | 72.3 | 94.3 | 
| AA Coding Index | 38.6 | 55.1 | 
| AAII | 47.2 | 65.3 | 
| AA Math Index | _ | 92.7 | 
| GPQA | 76.4 | 87.6 | 
| HLE | 7.7 | 23.9 | 
| HumanEval | 88.1 | _ | 
| LiveCodeBench | 67.9 | 81.9 | 
| MATH-500 | 97.0 | 99.0 | 
| MMLU | 91.8 | _ | 
| MMLU-Pro | 84.1 | 86.6 | 
| MMMU | 77.6 | _ | 
| SciCode | 35.8 | 45.7 |