gpt-oss-20b by deepinfra - AI Model Details, Pricing, and Performance Metrics

openai
gpt-oss-20b
openai

gpt-oss-20b

completions
On:deepinfrafireworksai

gpt-oss-20b is an open-weight 21B parameter model released by OpenAI under the Apache 2.0 license. It uses a Mixture-of-Experts (MoE) architecture with 3.6B active parameters per forward pass, optimized for lower-latency inference and deployability on consumer or single-GPU hardware. The model is trained in OpenAI’s Harmony response format and supports reasoning level configuration, fine-tuning, and agentic capabilities including function calling, tool use, and structured outputs.

ProviderInputOutput
deepinfra
deepinfra
$0.04 / 1M tokens$0.16 / 1M tokens
fireworksai
fireworksai
$0.07 / 1M tokens$0.3 / 1M tokens
Released
Aug 5, 2025
Knowledge
Feb 6, 2025
Context
131072
Input
$0.04 / 1M tokens
Output
$0.16 / 1M tokens
Capabilities: tools
Accepts: text
Returns: text

Access gpt-oss-20b through LangDB AI Gateway

Recommended

Integrate with openai's gpt-oss-20b and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Available from 2 providers
Provider:

Category Scores

Benchmark Tests

View Other Benchmarks
AA Coding Index
53.7
Programming
AAII
44.8
General
AA Math Index
61.7
Mathematics
GPQA
71.5
STEM (Physics, Chemistry, Biology)
HLE
8.5
General Knowledge
LiveCodeBench
72.1
Programming
MMLU-Pro
73.6
General Knowledge
SciCode
35.4
Scientific

Code Examples

Integration samples and API usage