LLM API pricing in 2026 varies dramatically across 92 models from 18 providers. Here's a comprehensive breakdown:
01.AI:
Yi-Lightning: $0.140/M input, $0.140/M outputYi-Large: $3.00/M input, $3.00/M outputAlibaba:
Qwen 2.5 7B: $0.040/M input, $0.100/M outputQwen 2.5 72B: $0.120/M input, $0.390/M outputQwQ 32B: $0.150/M input, $0.580/M outputQwen 2.5 Max: $0.160/M input, $0.640/M outputQwen 3 235B MoE: $0.455/M input, $1.82/M outputQwen 2.5 Coder 32B: $0.660/M input, $1.00/M outputAmazon:
Amazon Nova Micro: $0.035/M input, $0.140/M outputAmazon Nova Lite: $0.060/M input, $0.240/M outputAmazon Nova Pro: $0.800/M input, $3.20/M outputAnthropic:
Claude 3.5 Haiku: $0.800/M input, $4.00/M outputClaude Haiku 4: $1.00/M input, $5.00/M outputClaude Sonnet 4: $3.00/M input, $15.00/M outputClaude 3.5 Sonnet: $3.00/M input, $15.00/M outputClaude Opus 4: $5.00/M input, $25.00/M outputCohere:
Command R7B: $0.038/M input, $0.150/M outputCommand R: $0.150/M input, $0.600/M outputCommand A: $2.50/M input, $10.00/M outputCommand R+: $2.50/M input, $10.00/M outputDeepSeek:
DeepSeek V2.5: $0.140/M input, $0.280/M outputDeepSeek V3: $0.259/M input, $0.420/M outputDeepSeek R1 Distill Qwen 32B: $0.290/M input, $0.290/M outputDeepSeek R1: $0.500/M input, $2.15/M outputDeepSeek R1 Distill Llama 70B: $0.700/M input, $0.800/M outputFireworks AI:
Llama 3.3 70B (Fireworks): $0.900/M input, $0.900/M outputMixtral 8x22B (Fireworks): $0.900/M input, $0.900/M outputLlama 3.1 405B (Fireworks): $3.00/M input, $3.00/M outputGoogle:
Gemini Experimental 1206: $0.00/M input, $0.00/M outputGemini 2.0 Flash Thinking: $0.00/M input, $0.00/M outputGemma 2 9B: $0.030/M input, $0.090/M outputGemini 1.5 Flash 8B: $0.037/M input, $0.150/M outputGemini 2.0 Flash Lite: $0.075/M input, $0.300/M outputGemini 1.5 Flash: $0.075/M input, $0.300/M outputGemini 2.0 Flash: $0.100/M input, $0.400/M outputGemini 2.5 Flash: $0.300/M input, $2.50/M outputGemma 2 27B: $0.650/M input, $0.650/M outputGemini 2.5 Pro: $1.25/M input, $10.00/M outputGemini 1.5 Pro: $1.25/M input, $5.00/M outputGroq:
Llama 3.1 8B (Groq): $0.050/M input, $0.080/M outputGemma 2 9B (Groq): $0.200/M input, $0.200/M outputMixtral 8x7B (Groq): $0.240/M input, $0.240/M outputLlama 3.3 70B (Groq): $0.590/M input, $0.790/M outputDeepSeek R1 (Groq): $0.750/M input, $0.990/M outputMeta:
Llama 3.1 8B: $0.020/M input, $0.050/M outputLlama 4 Scout: $0.080/M input, $0.300/M outputLlama 3.3 70B: $0.120/M input, $0.380/M outputLlama 4 Maverick: $0.150/M input, $0.600/M outputLlama 3.2 11B Vision: $0.245/M input, $0.245/M outputLlama 3.1 70B: $0.400/M input, $0.400/M outputLlama 3.2 90B Vision: $0.900/M input, $0.900/M outputLlama 3.1 405B: $3.00/M input, $3.00/M outputMicrosoft:
Phi-4: $0.065/M input, $0.140/M outputPhi-3.5 Mini: $0.130/M input, $0.520/M outputPhi-3.5 MoE: $0.170/M input, $0.680/M outputPhi-3 Medium: $0.170/M input, $0.170/M outputWizardLM-2 8x22B: $0.620/M input, $0.620/M outputMistral:
Mistral Small: $0.150/M input, $0.600/M outputMistral Large: $0.500/M input, $1.50/M outputMistral AI:
Mistral Nemo 12B: $0.020/M input, $0.040/M outputMistral 7B: $0.110/M input, $0.190/M outputCodestral 22B: $0.300/M input, $0.900/M outputMistral Medium 3: $0.400/M input, $2.00/M outputPixtral Large: $2.00/M input, $6.00/M outputOpenAI:
GPT-4 1.5-nano: $0.100/M input, $0.400/M outputGPT-4o Mini: $0.150/M input, $0.600/M outputGPT-4 1.5-mini: $0.400/M input, $1.60/M outputGPT-3.5 Turbo: $0.500/M input, $1.50/M outputo3-mini: $1.10/M input, $4.40/M outputo1-mini: $1.10/M input, $4.40/M outputo4-mini: $1.10/M input, $4.40/M outputo3: $2.00/M input, $8.00/M outputGPT-4 1: $2.00/M input, $8.00/M outputGPT-4o: $2.50/M input, $10.00/M outputGPT-4o (Aug 2024): $2.50/M input, $10.00/M outputChatGPT-4o Latest: $5.00/M input, $15.00/M outputGPT-4 Turbo: $10.00/M input, $30.00/M outputo1: $15.00/M input, $60.00/M outputGPT-4.5: $75.00/M input, $150.00/M outputPerplexity:
Sonar: $1.00/M input, $1.00/M outputSonar Reasoning: $2.00/M input, $8.00/M outputSonar Pro: $3.00/M input, $15.00/M outputShanghai AI Lab:
InternLM 2.5 20B: $0.180/M input, $0.180/M outputTogether AI:
Mistral 7B (Together): $0.200/M input, $0.200/M outputLlama 3.3 70B (Together): $0.880/M input, $0.880/M outputQwen 2.5 72B (Together): $1.20/M input, $1.20/M outputDeepSeek R1 (Together): $3.00/M input, $7.00/M outputLlama 3.1 405B (Together): $3.50/M input, $3.50/M outputxAI:
Grok 3-mini: $0.300/M input, $0.500/M outputGrok 2: $2.00/M input, $10.00/M outputGrok 2 Vision: $2.00/M input, $10.00/M outputGrok 3: $3.00/M input, $15.00/M outputThe cheapest model is Gemini Experimental 1206 at $0.00/M input tokens. The most expensive output pricing is GPT-4.5 at $150.00/M output tokens.
Use our interactive pricing table for sortable, filterable comparisons, or try the cost calculator to estimate your specific monthly spend.