anthropic

claude-sonnet-4

completions

Our high-performance model with exceptional reasoning and efficiency

Input:$3 / 1M tokens$0.3 / 1M tokenscached read$3.75 / 1M tokenscached write
Output:$15 / 1M tokens
Context:200K tokens
tools
text
image
text
Category Rankings
programming#18
vision#9

Access claude-sonnet-4 through LangDB AI Gateway

Recommended

Integrate with anthropic's claude-sonnet-4 and 250+ other models through a unified API. Monitor usage, control costs, and enhance security.

Unified API
Cost Optimization
Enterprise Security
Get Started Now

Free tier available • No credit card required

Instant Setup
99.9% Uptime
10,000+Monthly Requests
Code Example
Configuration
Base URL
API Keys
Headers
Project ID in header
X-Run-Id
X-Thread-Id
Model Parameters
6 available
include_reasoning
max_tokens
stop
temperature
012
tool_choice
tools
Additional Configuration
Tools
Guards
User:
Id:
Name:
Tags:
Publicly Shared Threads5
  • anthropic
    Detailed 3-step weighing strategy to find the single lighter counterfeit coin among 12 identical coins using balance scales with three weighings.
    12 coin problem
    counterfeit coin detection
    balance scale strategy
    3 weighings solution
  • anthropic
    A complete Python Flask To-Do list API with CRUD endpoints, in-memory storage, input validation, error handling, and example usage instructions.
    python flask todo api
    restful todo list api
    in-memory todo api
    flask api error handling
  • anthropic
    Created a responsive nav bar with a hamburger menu that toggles links on screens under 600px using HTML, CSS, and vanilla JS.
    responsive navigation bar
    hamburger menu css
    mobile navigation toggle
    vanilla javascript navbar
  • anthropic
    Solved ages of siblings Emily and Derek using algebra: Emily is 16 and Derek is 8, verified with given age conditions.
    age algebra problem
    sibling age word problem
    current and past ages
    solving system of equations
  • anthropic
    Defines "ubiquitous" as an adjective meaning present everywhere, exemplified by a sentence about smartphones being found in nearly all public spaces.
    ubiquitous definition
    ubiquitous example sentence
    meaning of ubiquitous
    ubiquitous usage
Popular Models10
  • deepseek
    DeepSeek V3, a 685B-parameter, mixture-of-experts model, is the latest iteration of the flagship chat model family from the DeepSeek team. It succeeds the [DeepSeek V3](/deepseek/deepseek-chat-v3) model and performs really well on a variety of tasks.
    Input:$0.24 / 1M tokens
    Output:$0.82 / 1M tokens
    Context:163840 tokens
    tools
    text
    text
  • openai
    GPT-4o mini (o for omni) is a fast, affordable small model for focused tasks. It accepts both text and image inputs, and produces text outputs (including Structured Outputs). It is ideal for fine-tuning, and model outputs from a larger model like GPT-4o can be distilled to GPT-4o-mini to produce similar results at lower cost and latency.The knowledge cutoff for GPT-4o-mini models is October, 2023.
    Input:$0.15 / 1M tokens
    Output:$0.6 / 1M tokens
    Context:128K tokens
    tools
    text
    image
    text
  • z-ai
    GLM-4.5 is our latest flagship foundation model, purpose-built for agent-based applications. It leverages a Mixture-of-Experts (MoE) architecture and supports a context length of up to 128k tokens. GLM-4.5 delivers significantly enhanced capabilities in reasoning, code generation, and agent alignment. It supports a hybrid inference mode with two options, a "thinking mode" designed for complex reasoning and tool use, and a "non-thinking mode" optimized for instant responses. Users can control the reasoning behaviour with the `reasoning` `enabled` boolean. [Learn more in our docs](https://openrouter.ai/docs/use-cases/reasoning-tokens#enable-reasoning-with-default-config)
    Input:$0.45 / 1M tokens
    Output:$1.63 / 1M tokens
    Context:98304 tokens
    tools
    text
    text
  • langdb
    Gemma 3 introduces multimodality, supporting vision-language input and text outputs. It handles context windows up to 128k tokens, understands over 140 languages, and offers improved math, reasoning, and chat capabilities, including structured outputs and function calling. Gemma 3 27B is Google's latest open source model, successor to [Gemma 2](google/gemma-2-27b-it)
    Input:$0.1 / 1M tokens
    Output:$0.29 / 1M tokens
    Context:96K tokens
    text
    image
    text
  • deepseek
    May 28th update to the [original DeepSeek R1](/deepseek/deepseek-r1) Performance on par with [OpenAI o1](/openai/o1), but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass. Fully open-source model.
    Input:$0.39 / 1M tokens
    Output:$1.63 / 1M tokens
    Context:163840 tokens
    text
    text
  • qwen
    Qwen3-235B-A22B-Instruct-2507 is a multilingual, instruction-tuned mixture-of-experts language model based on the Qwen3-235B architecture, with 22B active parameters per forward pass. It is optimized for general-purpose text generation, including instruction following, logical reasoning, math, code, and tool usage. The model supports a native 262K context length and does not implement "thinking mode" (<think> blocks). Compared to its base variant, this version delivers significant gains in knowledge coverage, long-context reasoning, coding benchmarks, and alignment with open-ended tasks. It is particularly strong on multilingual understanding, math reasoning (e.g., AIME, HMMT), and alignment evaluations like Arena-Hard and WritingBench.
    Input:$0.13 / 1M tokens
    Output:$0.66 / 1M tokens
    Context:262144 tokens
    text
    text
  • anthropic
    Our high-performance model with exceptional reasoning and efficiency
    Input:$3 / 1M tokens
    Output:$15 / 1M tokens
    Context:200K tokens
    tools
    text
    image
    text
  • anthropic
    claude-opus-4
    anthropic
    Our most capable and intelligent model yet. Claude Opus 4 sets new standards in complex reasoning and advanced coding
    Input:$15 / 1M tokens
    Output:$75 / 1M tokens
    Context:200K tokens
    tools
    text
    image
    text
  • gemini
    Gemini 2.5 Pro is our most advanced reasoning Gemini model, capable of solving complex problems.
    Input:$1.25 / 1M tokens
    Output:$10 / 1M tokens
    Context:1M tokens
    tools
    text
    image
    audio
    video
    text
  • openai
    gpt-4.1
    openai
    GPT-4.1 is OpenAI's flagship model for complex tasks. It is well suited for problem solving across domains.
    Input:$2 / 1M tokens
    Output:$8 / 1M tokens
    Context:1047576 tokens
    tools
    text
    image
    text