Meta

Llama 3.1 405B

Complete specs, pricing, and benchmark data for Llama 3.1 405B by Meta. Last verified 2026-04-20.

JSON ModeFunctionsStreaming
Pricing

Input / 1M tokens

$3.00

Output / 1M tokens

$3.00

Context & Output

Context Window

128K

Max Output

4,096

TTFT

150ms

Speed

100 tok/s

Benchmarks

Arena ELO

1240

Coding ELO

1200

Reasoning ELO

1250

HumanEval

89.5

MMLU

85.9

MATH

112

GPQA

45

Try Llama 3.1 405B API

Start building with Llama 3.1 405B

Get API Access

Price History (Input $/M tokens)

Strengths
  • +Largest open model
  • +Excellent reasoning
  • +High quality outputs
Limitations
  • -Expensive for inference
  • -Slow token throughput

Best For

General PurposeReasoningLong Context

Compare Llama 3.1 405B with...

Official Pricing Page →
Your ad here