gemini-2.5-flash-lite

completions

byopenrouter

gemini-2.5-flash-lite

completions

Published by: googleProvider:

openrouter

Gemini 2.5 Flash-Lite is a lightweight reasoning model in the Gemini 2.5 family, optimized for ultra-low latency and cost efficiency. It offers improved throughput, faster token generation, and better performance across common benchmarks compared to earlier Flash models. By default, "thinking" (i.e. multi-pass reasoning) is disabled to prioritize speed, but developers can enable it via the [Reasoning API parameter](https://openrouter.ai/docs/use-cases/reasoning-tokens) to selectively trade off cost for intelligence.

Released

Jun 17, 2025

Knowledge

Jan 1, 2025

License

CC-BY-4.0

Context

1048576

Input

$0.1 / 1M tokens

Output

$0.4 / 1M tokens

Capabilities: tools

Accepts: text, image

Returns: text

Released Jun 17, 2025Knowledge Cutoff: Jan 1, 2025License: CC-BY-4.0

Context: 1048576 Input: $0.1 / 1M tokensOutput: $0.4 / 1M tokensCapabilities: toolsAccepts: text, imageReturns: text

Access gemini-2.5-flash-lite through LangDB AI Gateway

Recommended

Integrate with google's gemini-2.5-flash-lite and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for gemini-2.5-flash-lite

Overall Performance: 40.70249999999999 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
HLE	3.70	Top 88%	General Knowledge
AIME	50.00	Top 32%	Mathematics
GPQA	64.60	Top 60%	STEM (Physics, Chemistry, Biology)
MMMU	72.90	Top 54%	General Knowledge
SciCode	17.70	Top 86%	Scientific
MATH-500	92.60	Top 40%	Mathematics
MMLU-Pro	72.40	Top 67%	General Knowledge
LiveCodeBench	40.00	Top 55%	Programming
AA Math Index	35.30	Top 64%	Mathematics
AA Coding Index	7.40	Top 92%	Programming
AAII	12.70	Top 80%	General

GPQA Score: 64.60 - Graduate-level reasoning benchmark

Model Comparison:

Provider: openrouter

Model Type: completions

Context Size: 1048576 tokens

Comparing against 348 models in the database

Category Scores

Benchmark Tests

View Other Benchmarks

HLE

3.7

General Knowledge

AIME

50.0

Mathematics

GPQA

64.6

STEM (Physics, Chemistry, Biology)

MMMU

72.9

General Knowledge

SciCode

17.7

Scientific

MATH-500

92.6

Mathematics

MMLU-Pro

72.4

General Knowledge

LiveCodeBench

40.0

Programming

AA Math Index

35.3

Mathematics

AA Coding Index

7.4

Programming

AAII

12.7

General

Metric	HLE	AIME	GPQA	MMMU	SciCode	MATH-500	MMLU-Pro	LiveCodeBench	AA Math Index	AA Coding Index	AAII
Score	3.7	50.0	64.6	72.9	17.7	92.6	72.4	40.0	35.3	7.4	12.7

Compare with Similar Models

claude-opus-4.5

claude-opus-4.6

gemini-3-flash-preview

claude-sonnet-4.5

gemini-3-pro-preview

claude-sonnet-4

Code Examples

Integration samples and API usage

Code Samples for gemini-2.5-flash-lite

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.langdb.ai/projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="gemini-2.5-flash-lite",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.langdb.ai/projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "gemini-2.5-flash-lite",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.langdb.ai/projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemini-2.5-flash-lite",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: gemini-2.5-flash-lite

Provider: openrouter

API Endpoint: $https://api.langdb.ai

Create API Key

Related Models

Similar models from openrouter

gemini-2.5-flash-lite

gemini-2.5-flash-lite

Access gemini-2.5-flash-lite through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

aion-1.0

aion-1.0-mini

aion-2.0

aion-rp-llama-3.1-8b

coder-large

cogito-v2.1-671b

gemini-2.5-flash-lite by openrouter - AI Model Details, Pricing, and Performance Metrics

gemini-2.5-flash-lite

gemini-2.5-flash-lite

Access gemini-2.5-flash-lite through LangDB AI Gateway

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

aion-1.0

aion-1.0-mini

aion-2.0

aion-rp-llama-3.1-8b

coder-large

cogito-v2.1-671b