gemini-1.5-flash-8b by gemini - AI Model Details, Pricing, and Performance Metrics

google
gemini-1.5-flash-8b
google

gemini-1.5-flash-8b

completions
bygemini

lightweight model, smaller and faster, lower price + higher rate limits + Lower latency on small prompts (compared to 1.5 Flash)

Released
Mar 15, 2024
Knowledge
Oct 1, 2024
License
Proprietary
Context
1M
Input
$0.04 / 1M tokens
Output
$0.15 / 1M tokens
Capabilities: tools
Accepts: text, image, audio, video
Returns: text

Access gemini-1.5-flash-8b through LangDB AI Gateway

Recommended

Integrate with google's gemini-1.5-flash-8b and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Request Volume
Daily API requests
78
Performance (TPS)
Tokens per second
4138.84 tokens/s

Category Scores

Benchmark Tests

View Other Benchmarks
AIME
3.3
Mathematics
AA Coding Index
22.3
Programming
AAII
16.3
General
GPQA
37.1
STEM (Physics, Chemistry, Biology)
HLE
4.5
General Knowledge
LiveCodeBench
21.7
Programming
MATH-500
68.9
Mathematics
MMLU-Pro
57.8
General Knowledge
MMMU
53.7
General Knowledge
SciCode
22.9
Scientific

Code Examples

Integration samples and API usage