Smart Routing

    We Pick the Best Model for You

    GateRouter's smart routing analyzes your task and automatically selects the optimal model — no manual selection needed.

    GPT-5.4
    OpenAI

    Frontier reasoning, coding, and agents. 1M+ context window.

    Best For: Complex Tasks | Code | Agents
    $2.50/1M in | $20.00/1M out
    GPT-5.4 Pro
    OpenAI

    Maximum reasoning depth for high-stakes professional work.

    Best For: Finance | Legal | Research
    $30.00/1M in | $180.00/1M out
    GPT-5.3 Codex
    OpenAI

    Purpose-built for code. Multi-file editing and dev workflows.

    Best For: Code | Refactoring | Multi-file
    $3.00/1M in | $15.00/1M out
    GPT-5.2
    OpenAI

    Previous flagship. Strong reasoning at a lower price.

    Best For: Reasoning | Code | Budget Pick
    $1.75/1M in | $14.00/1M out
    GPT-5
    OpenAI

    Reliable all-rounder for everyday tasks.

    Best For: Everyday Tasks | Code | General
    $1.25/1M in | $10.00/1M out
    GPT-5 Mini
    OpenAI

    Good reasoning at one-fifth the cost of GPT-5.

    Best For: Medium Tasks | Cost Saver | Versatile
    $0.25/1M in | $2.00/1M out
    GPT-5 Nano
    OpenAI

    Cheapest GPT. Built for high-volume simple jobs.

    Best For: Classification | Extraction | Bulk Jobs
    $0.05/1M in | $0.40/1M out
    GPT-4.1
    OpenAI

    Stable code generation with predictable output.

    Best For: Production Code | Stable Output | Reliable
    $2.00/1M in | $8.00/1M out
    GPT-4.1 Nano
    OpenAI

    Fastest GPT. Sub-second responses.

    Best For: Fastest | Tagging | Data Extraction
    $0.10/1M in | $0.40/1M out
    Claude Opus 4.6
    Anthropic

    Deepest reasoning in the Claude family. 200K context.

    Best For: Long Reports | Nuanced Writing | Precision
    $5.00/1M in | $25.00/1M out
    Claude Sonnet 4.6
    Anthropic

    Best balance of quality and cost from Anthropic.

    Best For: Writing | Code Review | Analysis
    $3.00/1M in | $15.00/1M out
    Claude Sonnet 4.5
    Anthropic

    Previous-gen Sonnet. Still strong for long-form content.

    Best For: Long Content | Creative Writing | Fallback
    $3.00/1M in | $15.00/1M out
    Claude Haiku 4.5
    Anthropic

    Fast and affordable. Handles simple tasks well.

    Best For: Quick Q&A | Classification | Cheap
    $1.00/1M in | $5.00/1M out
    Gemini 3.1 Pro
    Google

    Google's latest. Strong reasoning with native image and video.

    Best For: Images | Video | Multimodal
    $2.00/1M in | $12.00/1M out
    Gemini 2.5 Pro
    Google

    Massive context window. Great for video understanding.

    Best For: Video | Large Context | Multimodal
    $1.25/1M in | $2.50/1M out
    Gemini 2.0 Flash
    Google

    Extremely affordable. Built for high throughput.

    Best For: Batch Jobs | High Volume | Cheapest
    $0.15/1M in | $0.60/1M out
    DeepSeek V3.2
    DeepSeek

    Best value for code and math. Fraction of flagship cost.

    Best For: Code | Math | Best Value
    $0.28/1M in | $0.42/1M out
    DeepSeek V3.1
    DeepSeek

    Budget-friendly general reasoning.

    Best For: Budget Reasoning | Code | Everyday
    $0.30/1M in | $1.00/1M out
    Grok 4
    xAI

    Real-time knowledge access. Strong reasoning.

    Best For: Live Data | Current Events | Analysis
    $3.00/1M in | $15.00/1M out
    Grok 4.1 Fast
    xAI

    Speed-optimized with real-time data access.

    Best For: Quick Lookups | Live Info | Fast
    $0.20/1M in | $0.50/1M out
    Kimi K2.5
    Moonshot

    Top Chinese-English bilingual. Strong long context.

    Best For: Chinese | Bilingual | Long Context
    $0.50/1M in | $2.60/1M out
    Qwen3 235B
    Alibaba

    Strong at code, math, and multilingual tasks.

    Best For: Code | Math | Multilingual
    $0.22/1M in | $0.88/1M out
    GLM-5
    Zhipu AI

    Competitive general-purpose Chinese model.

    Best For: Chinese | General | Balanced
    $1.20/1M in | $3.50/1M out
    GLM-4.7 Flash
    Zhipu AI

    Ultra-fast for Chinese. Very cheap.

    Best For: Chinese | Fast | Cheap
    $0.13/1M in | $0.50/1M out
    Seed 1.6
    ByteDance

    Good general performance at low cost.

    Best For: Budget | General | Versatile
    $0.25/1M in | $2.00/1M out
    MiniMax M2.5
    MiniMax

    Built for conversation and bilingual chat.

    Best For: Chat | Role-play | Bilingual
    $0.30/1M in | $1.20/1M out
    Why separate Input and Output pricing?

    Why separate Input and Output pricing? Input is the prompt you send, Output is the model's response — generation requires more compute than reading, so Output typically costs more.