Back to GalleryBack
Model Comparison
Model Comparison
Compare performance, benchmarks, and characteristics
Loading comparison...
Compare performance, benchmarks, and characteristics
Loading comparison...
| Metric | ||
|---|---|---|
Pricing | ||
| Input Price | $2 / 1M tokens | $2 / 1M tokens |
| Output Price | $10 / 1M tokens | $8 / 1M tokens |
Capabilities | ||
| Context Window | 131072 tokens | 1047576 tokens |
| Capabilities | tools | tools |
| Input type | text | text, image |
Category Scores | ||
| Overall Average | 66.3 | 52.2 |
| Science | 64.7 | 66.1 |
| Vision | 66.1 | 74.8 |
| Writing | 68.2 | 56.6 |
| Academia | N/A | 54.9 |
| Finance | N/A | 39.0 |
| Marketing | N/A | 59.0 |
| Maths | N/A | 34.7 |
| Programming | N/A | 32.2 |
Benchmark Tests | ||
| AIME | _ | 43.7 |
| AA Coding Index | _ | 32.2 |
| AAII | _ | 43.4 |
| AA Math Index | _ | 34.7 |
| GPQA | 56.0 | 66.5 |
| HLE | _ | 4.6 |
| HumanEval | 88.4 | _ |
| LiveCodeBench | _ | 45.7 |
| MATH-500 | _ | 91.3 |
| MMLU | 87.5 | 90.2 |
| MMLU-Pro | 75.5 | 80.6 |
| MMMU | 66.1 | 74.8 |
| SciCode | _ | 38.1 |