Back to GalleryBack
Model Comparison
Model Comparison
Compare performance, benchmarks, and characteristics
Loading comparison...
Compare performance, benchmarks, and characteristics
Loading comparison...
| Metric | ||
|---|---|---|
Pricing | ||
| Input Price | $3 / 1M tokens | $3 / 1M tokens |
| Output Price | $12 / 1M tokens | $15 / 1M tokens |
Capabilities | ||
| Context Window | 128K tokens | 256K tokens |
| Capabilities | tools | |
| Input type | text | text |
Category Scores | ||
| Overall Average | 57.2 | 74.3 |
| Academia | 49.7 | 76.4 |
| Marketing | 63.7 | 69.1 |
| Science | 59.5 | 80.1 |
| Writing | 56.1 | 67.6 |
| Finance | N/A | 79.0 |
| Maths | N/A | 92.7 |
| Programming | N/A | 55.1 |
Benchmark Tests | ||
| AIME | 60.3 | 94.3 |
| AA Coding Index | _ | 55.1 |
| AAII | 39.2 | 65.3 |
| AA Math Index | _ | 92.7 |
| GPQA | 60.1 | 87.6 |
| HLE | 4.9 | 23.9 |
| HumanEval | 92.4 | _ |
| LiveCodeBench | 57.6 | 81.9 |
| MATH-500 | 94.4 | 99.0 |
| MMLU | 85.2 | _ |
| MMLU-Pro | 74.2 | 86.6 |
| SciCode | 32.3 | 45.7 |