LLM Gateway
    • Docs
    • Pricing
    • Pricing
    • Docs
    • Models
    1.1k
    Log InGet Started

    Models

    Comprehensive list of all supported models and their providers

    Compare

    Use Case

    Capabilities

    Provider

    Input Price ($/M tokens)

    Output Price ($/M tokens)

    Context Size (tokens)

    227
    Models
    30
    Providers
    107
    Vision Models
    143
    Tool-enabled
    3
    Free Models
    Features
    OpenAI
    gpt-4.1-mini
    $0.40$1.60$0.10
    Azure
    gpt-4.1-mini
    $0.40$1.60$0.10
    OpenAI
    gpt-4.1
    $2.00$8.00$0.50
    Azure
    gpt-4.1
    $2.00$8.00$0.50
    Nebius AI
    llama-3.1-nemotron-ultra-253b
    $0.60$1.80—
    AWS Bedrock
    llama-4-maverick-17b-instruct
    $0.24$0.97—
    NovitaAI
    llama-4-maverick-17b-instruct
    $0.27$0.85—
    AWS Bedrock
    llama-4-scout-17b-instruct
    $0.17$0.66—
    NovitaAI
    llama-4-scout-17b-instruct
    $0.18$0.59—
    Together AI
    llama-4-scout
    $0.18$0.59—
    NovitaAI
    llama-3-8b-instruct
    $0.04$0.04—
    Alibaba Cloud
    qwen-omni-turbo
    $0.20$0.16
    -20% off
    $0.80$0.64
    -20% off
    —
    Google AI Studio
    gemini-2.5-pro
    $1.25$10.00$0.13
    Google Vertex AI
    gemini-2.5-pro
    $1.25$10.00$0.13
    Nebius AI
    gemma-3-27b
    $0.27$0.27—
    Google AI Studio
    gemma-3-1b-it
    $0.07$0.30—
    Google AI Studio
    gemma-3-12b-it
    $0.07$0.30—
    Google AI Studio
    gemma-3-4b-it
    $0.07$0.30—
    Perplexity
    sonar-pro
    $3.00$15.00—
    Perplexity
    sonar-reasoning-pro
    $2.00$8.00—
    Alibaba Cloud
    qwq-plus
    $0.80$0.64
    -20% off
    $2.40$1.92
    -20% off
    —
    Alibaba Cloud(singapore)
    qwq-plus
    $0.80$0.64
    -20% off
    $2.40$1.92
    -20% off
    —
    Alibaba Cloud(cn-beijing)
    qwq-plus
    $0.23$0.18
    -20% off
    $0.57$0.46
    -20% off
    —
    Z AI
    cogview-4
    $0.010/req——
    Nebius AI
    qwen-qwq-32b
    $0.15$0.45—
    Anthropic
    claude-3-7-sonnet
    $3.00$15.00$0.30
    Alibaba Cloud
    qwen2-5-vl-32b-instruct
    $1.40$1.12
    -20% off
    $4.20$3.36
    -20% off
    —
    Anthropic
    claude-3-7-sonnet-20250219
    $3.00$15.00$0.30
    xAI
    grok-3
    $3.00$15.00—
    Alibaba Cloud
    qwen-vl-plus
    $0.21$0.17
    -20% off
    $0.64$0.51
    -20% off
    —
    Alibaba Cloud
    qwen-vl-max
    $0.80$0.64
    -20% off
    $3.20$2.56
    -20% off
    —
    Alibaba Cloud
    qwen-turbo
    $0.05$0.04
    -20% off
    $0.20$0.16
    -20% off
    —
    Alibaba Cloud(singapore)
    qwen-turbo
    $0.05$0.04
    -20% off
    $0.20$0.16
    -20% off
    —
    Alibaba Cloud(cn-beijing)
    qwen-turbo
    $0.04$0.04
    -20% off
    $0.09$0.07
    -20% off
    —
    NovitaAI
    qwen3-coder-480b-a35b-instruct
    $0.30$1.30—
    CanopyWave
    qwen3-coder-480b-a35b-instruct
    $0.30$0.21
    -30% off
    $1.30$0.91
    -30% off
    —
    Nebius AI
    qwen3-coder-480b-a35b-instruct
    $0.40$1.80—
    Nebius AI
    qwen2-5-vl-72b-instruct
    $0.13$0.40—
    Alibaba Cloud
    qwen-plus
    $0.40$0.32
    -20% off
    $1.20$0.96
    -20% off
    $0.08$0.06
    -20% off
    Alibaba Cloud(singapore)
    qwen-plus
    $0.40$0.32
    -20% off
    $1.20$0.96
    -20% off
    $0.08$0.06
    -20% off
    Alibaba Cloud(us-virginia)
    qwen-plus
    $0.12$0.09
    -20% off
    $0.29$0.23
    -20% off
    $0.02$0.02
    -20% off
    Alibaba Cloud(cn-beijing)
    qwen-plus
    $0.12$0.09
    -20% off
    $0.29$0.23
    -20% off
    $0.02$0.02
    -20% off
    Alibaba Cloud
    qwen-max-latest
    $1.60$1.28
    -20% off
    $6.40$5.12
    -20% off
    —
    Alibaba Cloud(singapore)
    qwen-max-latest
    $1.60$1.28
    -20% off
    $6.40$5.12
    -20% off
    —
    Alibaba Cloud(cn-beijing)
    qwen-max-latest
    $0.34$0.28
    -20% off
    $1.38$1.10
    -20% off
    —
    Groq
    deepseek-r1-distill-llama-70b
    $0.75$0.99—
    MiniMax
    minimax-text-01
    $0.20$1.10—
    Z AI
    glm-image
    $0.015/req——
    Perplexity
    sonar
    $1.00$1.00—
    Nebius AI
    deepseek-v3
    $0.50$1.50—
    Page 7 of 9

    Newsletter

    Stay ahead of the curve

    Join developers who get weekly insights on LLM routing, new model launches, and cost optimization — straight to their inbox.

    • New models & providers as they drop
    • Tips to cut latency & costs
    • Early access to beta features

    No spam. Unsubscribe anytime.

    LLM Gateway

    Product

    • Features
    • Models
    • Providers
    • Chat Playground
    • Changelog
    • DevPass
    • Compare Models
    • Enterprise

    Resources

    • Templates
    • Agents
    • MCP Server
    • Blog
    • Documentation
    • Integrations
    • Guides
    • Brand Assets
    • Token Cost Calculator
    • Referral Program
    • GitHub
    • Contact Us

    Community

    • Twitter
    • Discord

    Compare

    • OpenRouter
    • LiteLLM

    Models

    • Text Generation
    • Text to Image
    • Image to Image
    • Vision
    • Reasoning
    • Tool Calling
    • Web Search
    • Discounted

    Providers

    • OpenAI
    • Anthropic
    • Google AI Studio
    • Glacier
    • Google Vertex AI
    • Quartz
    • Obsidian
    • Avalanche
    • Groq
    • Cerebras
    • xAI
    • DeepSeek
    • Alibaba Cloud
    • NovitaAI
    • AWS Bedrock
    • Azure
    • Z AI
    • Moonshot AI
    • Perplexity
    • Nebius AI
    • Mistral AI
    • CanopyWave
    • Inference.net
    • Together AI
    • Custom
    • NanoGPT
    • ByteDance
    • MiniMax
    • EmberCloud

    © 2026 LLM Gateway. All rights reserved.

    All systems operationalPrivacy PolicyTerms of Use