Explore APIAny models across chat, image, video, and audio — capabilities, request types, pricing, and parameters for every model.
Showing 12 groups

Google Gemini 3.5 Flash — fast, cost-efficient multimodal model with strong reasoning and a large context window.
Input
Output
Input
≈ $0.75-$1.5
Output
≈ $5-$10
Cache
≈ $0.075-$0.15

MiniMax M3 — a capable large language model for long-context reasoning, tool use, and multilingual chat.
Input
Output
Input
≈ $1.2-$2.4
Output
≈ $5-$10
Cache
≈ $1.5-$3

DeepSeek V4 — strong reasoning and coding performance at a highly competitive price.
Input
Output
Input
≈ $0.175-$0.35
Output
≈ $0.375-$0.75
Cache
≈ $0.01-$0.02

OpenAI GPT-5.5 — flagship model for advanced reasoning, coding, and complex multi-step instructions.
Input
Output
Input
≈ $1-$2
Output
≈ $6-$12
Cache
≈ $0.1-$0.2

Google Gemini 2.5 Flash Lite — ultra-low-cost, low-latency model for high-volume everyday tasks.
Input
Output
Input
≈ $0.1-$0.2
Output
≈ $0.15-$0.3
Cache
≈ $0.01-$0.02

Google Gemini 3.1 Flash Lite — fast, economical model with improved reasoning over the 2.5 generation.
Input
Output
Input
≈ $0.125-$0.25
Output
≈ $0.3-$0.6
Cache
≈ $0.0125-$0.025

Google Gemini 3.1 Pro — top-tier multimodal model for complex reasoning, long context, and demanding production workloads.
Input
Output
Input
≈ $1-$2
Output
≈ $6-$12
Cache
≈ $0.1-$0.2

OpenAI GPT-4o mini — fast, affordable multimodal model for everyday chat and lightweight tasks.
Input
Output
Input
≈ $0.25-$0.5
Output
≈ $0.375-$0.75
Cache
≈ $0.0125-$0.025

OpenAI GPT-5.4 — high-capability model for advanced reasoning, coding, and agentic workflows.
Input
Output
Input
≈ $0.625-$1.25
Output
≈ $3.75-$7.5
Cache
≈ $0.0625-$0.125

Anthropic Claude Opus 4.8 — frontier model for the most demanding reasoning, coding, and long-form work.
Input
Output
Input
≈ $0.5-$1
Output
≈ $2.5-$5
Cache
≈ $0.05-$0.1

Anthropic Claude Sonnet 4.6 — a balanced frontier model delivering strong reasoning at production speed and cost.
Input
Output
Input
≈ $0.375-$0.75
Output
≈ $1.88-$3.75
Cache
≈ $0.025-$0.05

Zhipu GLM-5.1 — a capable bilingual (Chinese/English) model for reasoning, coding, and agent applications.
Input
Output
Input
≈ $0.75-$1.5
Output
≈ $3.13-$6.25
Cache
≈ $0.25-$0.5