gpt-oss-20b

completions

bydeepinfra

gpt-oss-20b

completions

Published by: openaiProvider:

deepinfra

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for lower-latency inference and deployability on consumer or single-GPU hardware. The model is trained in OpenAI’s Harmony response format and supports reasoning level configuration, fine-tuning, and agentic capabilities including function calling, tool use, and structured outputs.

Released

Aug 5, 2025

Knowledge

Feb 6, 2025

License

Apache-2.0

Context

131072

Input

$0.04 / 1M tokens

Output

$0.16 / 1M tokens

Capabilities: tools

Accepts: text

Returns: text

Released Aug 5, 2025Knowledge Cutoff: Feb 6, 2025License: Apache-2.0

Context: 131072 Input: $0.04 / 1M tokensOutput: $0.16 / 1M tokensCapabilities: toolsAccepts: textReturns: text

Access gpt-oss-20b through LangDB AI Gateway

Recommended

Integrate with openai's gpt-oss-20b and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API

Cost Optimization

Enterprise Security

Get Started Now

Free tier available • No credit card required

Instant Setup

99.9% Uptime

10,000+Monthly Requests

Benchmark Results for gpt-oss-20b

Category Performance Scores:

Maths: Score 89.30 (Top 12% - Rank #39)
Finance: Score 70.70 (Top 18% - Rank #58)
Science: Score 65.25 (Top 43% - Rank #139)
Writing: Score 54.15 (Top 40% - Rank #129)
Academia: Score 61.13 (Top 30% - Rank #97)
Marketing: Score 56.16 (Top 31% - Rank #100)
Programming: Score 40.70 (Top 29% - Rank #94)

Overall Performance: 62.48396825396826 average score across all categories

Detailed Benchmark Scores:

Benchmark	Score	Percentile	Domain
HLE	9.80	Top 23%	General Knowledge
GPQA	70.15	Top 37%	STEM (Physics, Chemistry, Biology)
MMLU	85.30	Top 47%	General Knowledge
SciCode	34.40	Top 39%	Scientific
MMLU-Pro	74.80	Top 56%	General Knowledge
LiveCodeBench	77.70	Top 14%	Programming
AA Math Index	89.30	Top 12%	Mathematics
AA Coding Index	40.70	Top 29%	Programming
AAII	52.10	Top 18%	General

MMLU Score: 85.30 - Massive Multitask Language Understanding benchmark

GPQA Score: 70.15 - Graduate-level reasoning benchmark

Model Comparison:

Provider: deepinfra

Model Type: completions

Context Size: 131072 tokens

Comparing against 322 models in the database

Performance Analytics for gpt-oss-20b

Usage Statistics (Last 2 Days):

Total Requests: 7 API calls
Average TPS: 30.09 tokens per second
Average Response Time: 53570.90ms
Average Time to First Token: 53570.90ms
Total Cost: $0.00
Average Request Cost: $0.0002

Daily Performance Breakdown:

Date	Requests	TPS	Response Time	TTFT	Cost
11/11/2025	3	14.36	116918.10ms	116918.10ms	$0.00
11/12/2025	4	257.61	6060.50ms	6060.50ms	$0.00

Performance Summary:

Model: gpt-oss-20b by deepinfra

Monitoring Period: 11/11/2025 to 11/18/2025

Average Daily Requests: 4

Peak Daily Requests: 4

Statistics

View Full Statistics

Request Volume

Daily API requests

Performance (TPS)

Tokens per second

30.09 tokens/s

Category Scores

Benchmark Tests

View Other Benchmarks

HLE

9.8

General Knowledge

GPQA

70.2

STEM (Physics, Chemistry, Biology)

MMLU

85.3

General Knowledge

SciCode

34.4

Scientific

MMLU-Pro

74.8

General Knowledge

LiveCodeBench

77.7

Programming

AA Math Index

89.3

Mathematics

AA Coding Index

40.7

Programming

AAII

52.1

General

Metric	HLE	GPQA	MMLU	SciCode	MMLU-Pro	LiveCodeBench	AA Math Index	AA Coding Index	AAII
Score	9.8	70.2	85.3	34.4	74.8	77.7	89.3	40.7	52.1

Compare with Similar Models

claude-opus-4.5

gemini-3-flash-preview

Code Examples

Integration samples and API usage

Code Samples for gpt-oss-20b

Python SDK Example:

from openai import OpenAI

client = OpenAI(
    base_url="https://api.us-east-1.langdb.ai//projects/<your_project_id>",
    api_key="<your_api_key>"
)

response = client.chat.completions.create(
    model="gpt-oss-20b",
    messages=[
        {"role": "user", "content": "Hello, how are you?"}
    ]
)

print(response.choices[0].message.content)

TypeScript SDK Example:

import OpenAI from 'openai';

const client = new OpenAI({
    baseURL: "https://api.us-east-1.langdb.ai//projects/<your_project_id>",
    apiKey: "<your_api_key>"
});

const response = await client.chat.completions.create({
    model: "gpt-oss-20b",
    messages: [
        { role: "user", content: "Hello, how are you?" }
    ]
});

console.log(response.choices[0].message.content);

cURL Example:

curl -X POST "https://api.us-east-1.langdb.ai//projects/<your_project_id>/v1/chat/completions" \
  -H "Authorization: Bearer <your_api_key>" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gpt-oss-20b",
    "messages": [
        {"role": "user", "content": "Hello, how are you?"}
    ]
  }'

Model: gpt-oss-20b

Provider: deepinfra

API Endpoint: $https://api.us-east-1.langdb.ai/

Create API Key

Related Models

Similar models from deepinfra

gpt-oss-20b

gpt-oss-20b

Access gpt-oss-20b through LangDB AI Gateway

Statistics

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B

gpt-oss-20b by deepinfra - AI Model Details, Pricing, and Performance Metrics

gpt-oss-20b

gpt-oss-20b

Access gpt-oss-20b through LangDB AI Gateway

Statistics

Category Scores

Benchmark Tests

Compare with Similar Models

Code Examples

Related Models

deepseek-chat-v3-0324

deepseek-chat-v3.1

deepseek-prover-v2

DeepSeek-R1

deepseek-r1-0528

DeepSeek-R1-Distill-Llama-70B