LLM API Pricing Comparison 2026
Compare input and output token pricing across 92 large language models from OpenAI, Anthropic, Google, Meta, and more. Sort by any column, filter by provider or capability, and click any model to see full benchmarks and details.
Data verified Apr 20, 2026
Capabilities:
Last verified: Apr 20, 2026Showing 92 of 92 models
| Model | Provider | Input $/M | Output $/M | Arena ELO |
|---|---|---|---|---|
| Claude Opus 4 | Anthropic | $5.00 | $25.00 | 1503 |
| Gemini 2.5 Pro | $1.25 | $10.00 | 1430 | |
| o3 | OpenAI | $2.00 | $8.00 | 1340 |
| DeepSeek R1 | DeepSeek | $0.500 | $2.15 | 1310 |
| o1 | OpenAI | $15.00 | $60.00 | 1310 |
| Qwen 3 235B MoE | Alibaba | $0.455 | $1.82 | 1310 |
| Gemini Experimental 1206 | $0.00 | $0.00 | 1300 | |
| GPT-4.5 | OpenAI | $75.00 | $150.00 | 1290 |
| DeepSeek R1 (Groq) | Groq | $0.750 | $0.990 | 1290 |
| Llama 4 Maverick | Meta | $0.150 | $0.600 | 1290 |
| DeepSeek R1 (Together) | Together AI | $3.00 | $7.00 | 1290 |
| Grok 3 | xAI | $3.00 | $15.00 | 1285 |
| Claude Sonnet 4 | Anthropic | $3.00 | $15.00 | 1280 |
| DeepSeek V3 | DeepSeek | $0.259 | $0.420 | 1280 |
| Gemini 2.0 Flash Thinking | $0.00 | $0.00 | 1280 | |
| o3-mini | OpenAI | $1.10 | $4.40 | 1280 |
| Claude 3.5 Sonnet | Anthropic | $3.00 | $15.00 | 1270 |
| Gemini 2.5 Flash | $0.300 | $2.50 | 1270 | |
| o1-mini | OpenAI | $1.10 | $4.40 | 1270 |
| ChatGPT-4o Latest | OpenAI | $5.00 | $15.00 | 1265 |
| Gemini 2.0 Flash | $0.100 | $0.400 | 1260 | |
| GPT-4o | OpenAI | $2.50 | $10.00 | 1260 |
| o4-mini | OpenAI | $1.10 | $4.40 | 1260 |
| Qwen 2.5 Max | Alibaba | $0.160 | $0.640 | 1260 |
| QwQ 32B | Alibaba | $0.150 | $0.580 | 1260 |
| GPT-4o (Aug 2024) | OpenAI | $2.50 | $10.00 | 1255 |
| DeepSeek R1 Distill Llama 70B | DeepSeek | $0.700 | $0.800 | 1250 |
| Llama 4 Scout | Meta | $0.080 | $0.300 | 1250 |
| Mistral Large | Mistral | $0.500 | $1.50 | 1245 |
| Command A | Cohere | $2.50 | $10.00 | 1240 |
| DeepSeek R1 Distill Qwen 32B | DeepSeek | $0.290 | $0.290 | 1240 |
| Llama 3.1 405B (Fireworks) | Fireworks AI | $3.00 | $3.00 | 1240 |
| GPT-4 Turbo | OpenAI | $10.00 | $30.00 | 1240 |
| Grok 2 | xAI | $2.00 | $10.00 | 1240 |
| Llama 3.1 405B | Meta | $3.00 | $3.00 | 1240 |
| Sonar Reasoning | Perplexity | $2.00 | $8.00 | 1240 |
| Llama 3.1 405B (Together) | Together AI | $3.50 | $3.50 | 1240 |
| Gemini 1.5 Pro | $1.25 | $5.00 | 1230 | |
| Grok 2 Vision | xAI | $2.00 | $10.00 | 1230 |
| Pixtral Large | Mistral AI | $2.00 | $6.00 | 1230 |
| Qwen 2.5 72B | Alibaba | $0.120 | $0.390 | 1230 |
| Qwen 2.5 72B (Together) | Together AI | $1.20 | $1.20 | 1230 |
| Amazon Nova Pro | Amazon | $0.800 | $3.20 | 1220 |
| Claude 3.5 Haiku | Anthropic | $0.800 | $4.00 | 1220 |
| Claude Haiku 4 | Anthropic | $1.00 | $5.00 | 1220 |
| Llama 3.3 70B (Fireworks) | Fireworks AI | $0.900 | $0.900 | 1220 |
| GPT-4o Mini | OpenAI | $0.150 | $0.600 | 1220 |
| Llama 3.3 70B (Groq) | Groq | $0.590 | $0.790 | 1220 |
| Llama 3.3 70B | Meta | $0.120 | $0.380 | 1220 |
| Mistral Medium 3 | Mistral AI | $0.400 | $2.00 | 1220 |
| Llama 3.3 70B (Together) | Together AI | $0.880 | $0.880 | 1220 |
| Llama 3.2 90B Vision | Meta | $0.900 | $0.900 | 1210 |
| Command R+ | Cohere | $2.50 | $10.00 | 1200 |
| DeepSeek V2.5 | DeepSeek | $0.140 | $0.280 | 1200 |
| Mixtral 8x22B (Fireworks) | Fireworks AI | $0.900 | $0.900 | 1200 |
| Gemini 2.0 Flash Lite | $0.075 | $0.300 | 1200 | |
| GPT-4 1 | OpenAI | $2.00 | $8.00 | 1200 |
| Sonar Pro | Perplexity | $3.00 | $15.00 | 1200 |
| WizardLM-2 8x22B | Microsoft | $0.620 | $0.620 | 1200 |
| Llama 3.1 70B | Meta | $0.400 | $0.400 | 1195 |
| Phi-3.5 MoE | Microsoft | $0.170 | $0.680 | 1195 |
| Gemini 1.5 Flash | $0.075 | $0.300 | 1190 | |
| Gemma 2 27B | $0.650 | $0.650 | 1190 | |
| Mistral Small | Mistral | $0.150 | $0.600 | 1185 |
| Yi-Large | 01.AI | $3.00 | $3.00 | 1185 |
| GPT-4 1.5-mini | OpenAI | $0.400 | $1.60 | 1180 |
| Grok 3-mini | xAI | $0.300 | $0.500 | 1175 |
| Amazon Nova Lite | Amazon | $0.060 | $0.240 | 1170 |
| Gemma 2 9B (Groq) | Groq | $0.200 | $0.200 | 1170 |
| Phi-3 Medium | Microsoft | $0.170 | $0.170 | 1170 |
| Yi-Lightning | 01.AI | $0.140 | $0.140 | 1165 |
| Gemma 2 9B | $0.030 | $0.090 | 1160 | |
| Mixtral 8x7B (Groq) | Groq | $0.240 | $0.240 | 1160 |
| Llama 3.2 11B Vision | Meta | $0.245 | $0.245 | 1160 |
| Phi-3.5 Mini | Microsoft | $0.130 | $0.520 | 1160 |
| Qwen 2.5 7B | Alibaba | $0.040 | $0.100 | 1160 |
| Sonar | Perplexity | $1.00 | $1.00 | 1160 |
| InternLM 2.5 20B | Shanghai AI Lab | $0.180 | $0.180 | 1155 |
| Gemini 1.5 Flash 8B | $0.037 | $0.150 | 1150 | |
| GPT-4 1.5-nano | OpenAI | $0.100 | $0.400 | 1150 |
| Phi-4 | Microsoft | $0.065 | $0.140 | 1150 |
| Command R | Cohere | $0.150 | $0.600 | 1140 |
| Mistral Nemo 12B | Mistral AI | $0.020 | $0.040 | 1140 |
| Amazon Nova Micro | Amazon | $0.035 | $0.140 | 1130 |
| Command R7B | Cohere | $0.038 | $0.150 | 1120 |
| GPT-3.5 Turbo | OpenAI | $0.500 | $1.50 | 1120 |
| Llama 3.1 8B (Groq) | Groq | $0.050 | $0.080 | 1120 |
| Llama 3.1 8B | Meta | $0.020 | $0.050 | 1120 |
| Mistral 7B | Mistral AI | $0.110 | $0.190 | 1100 |
| Mistral 7B (Together) | Together AI | $0.200 | $0.200 | 1100 |
| Codestral 22B | Mistral AI | $0.300 | $0.900 | -- |
| Qwen 2.5 Coder 32B | Alibaba | $0.660 | $1.00 | -- |
Frequently Asked Questions
- Which LLM API is the cheapest in 2026?
- As of April 2026, GPT-4.1 Nano and Gemini 2.0 Flash Lite offer the lowest per-token pricing for production workloads. Prices vary by input vs. output tokens, so the cheapest option depends on your specific usage pattern.
- How often are LLM API prices updated?
- We verify pricing directly from provider documentation every week. Each model listing shows a 'Last verified' date so you can confirm the data is current.
- What is the difference between input and output token pricing?
- Input tokens are the tokens you send to the API (your prompt), while output tokens are the tokens the model generates in its response. Most providers charge different rates for each, with output tokens typically costing 2-5x more than input tokens.
- Do any LLM APIs offer free tiers?
- Several providers offer limited free tiers or trial credits. Google's Gemini API has a generous free tier for lower rate limits. OpenAI and Anthropic offer sign-up credits for new accounts. Check each provider's pricing page for current free tier details.