How Much Does Claude API Cost?
Claude API pricing varies by model tier. Anthropic offers three model tiers with different performance and cost trade-offs:
Claude Opus 4: $5.00/M input tokens, $25.00/M output tokens. Context window: 200K. Cached input: $0.500/M.
Claude Sonnet 4: $3.00/M input tokens, $15.00/M output tokens. Context window: 200K. Cached input: $0.300/M.
Claude 3.5 Sonnet: $3.00/M input tokens, $15.00/M output tokens. Context window: 200K.
Claude Haiku 4: $1.00/M input tokens, $5.00/M output tokens. Context window: 200K. Cached input: $0.100/M.
Claude 3.5 Haiku: $0.800/M input tokens, $4.00/M output tokens. Context window: 200K.
For most production workloads, Claude Sonnet 4 offers the best balance of quality and cost. Use Haiku 4 for high-volume, low-complexity tasks like classification and extraction. Opus 4 is best for complex reasoning, coding, and analysis where quality is paramount.
Tip: Use prompt caching to reduce costs by up to 90% on repeated system prompts. Anthropic also offers batch API pricing at roughly 50% of standard rates for non-time-sensitive workloads.