Choose from the best AI models available through our unified API

DeepSeek R1
by DeepSeekDeepSeek

128K context| $0.45 input| $1.78 output|3.94s latency|99.99% uptime

DeepSeek R1 is here: Performance on par with OpenAI o1, but open-sourced and with fully open reasoning tokens. It's 671B parameters in size, with 37B active in an inference pass.

DeepSeek V3
by DeepSeekDeepSeek

128K context| $0.22 input| $0.88 output|2.37s latency|99.99% uptime

An open‑source Mixture‑of‑Experts chat model with a massive 128 K token context window, optimized for coding, math, reasoning, and long‑form workflows at low cost.

Grok 3 mini
by xAIxAI

131K context| $0.21 input| $0.35 output|2.02s latency|99.99% uptime

Streamlined for quick, casual chat and basic reasoning with the signature X-style tone.

Grok 3
by xAIxAI

131K context| $2.1 input| $10.5 output|1.83s latency|99.98% uptime

A general-purpose chat assistant optimized for informal dialogue and developer-friendly interactions.

Gemini 2.0 Flash
by GoogleGoogle

1M context| $0.07 input| $0.28 output|1.28s latency|99.98% uptime

A high-throughput, multimodal model with a massive 1 M‑token context window, optimized for real-time, large‑scale applications.

Gemini 2.5 Pro
by GoogleGoogle

1M context| $1.2 input| $8 output|1.51s latency|100.00% uptime

Expert-level multimodal LLM with advanced “Deep Think” reasoning—designed for complex coding, math, and scientific workflows.

Gemini 2.5 Flash
by GoogleGoogle

1M context| $0.21 input| $1.75 output|2.70s latency|99.97% uptime

Cost-effective multimodal workhorse with adaptive reasoning, high throughput, and built-in chain-of-thought.

Claude Opus 4 (2025-05-14)
by AnthropicAnthropic

200K context| $11.25 input| $56.25 output|2.74s latency|99.99% uptime

Anthropic’s flagship—the most powerful model yet for sustained coding, deep reasoning, and agentic workflows.

Claude 3.7 Sonnet (2025-02-19)
by AnthropicAnthropic

200K context| $2.1 input| $10.5 output|1.44s latency|99.97% uptime

Earlier hybrid model with instant or extended reasoning, suitable for general-purpose tasks and coding.

Claude 3.7 Sonnet
by AnthropicAnthropic

200K context| $2.55 input| $12.75 output|0.84s latency|99.99% uptime

Earlier hybrid model with instant or extended reasoning, suitable for general-purpose tasks and coding.