Meta

Llama 3.1 70B

Complete specs, pricing, and benchmark data for Llama 3.1 70B by Meta. Last verified 2026-04-20.

JSON ModeFunctionsStreaming
Pricing

Input / 1M tokens

$0.400

Output / 1M tokens

$0.400

Context & Output

Context Window

128K

Max Output

4,096

TTFT

150ms

Speed

100 tok/s

Benchmarks

Arena ELO

1195

Reasoning ELO

1195

HumanEval

85.9

MMLU

85.2

MATH

76

GPQA

40

Try Llama 3.1 70B API

Start building with Llama 3.1 70B

Get API Access

Price History (Input $/M tokens)

Strengths
  • +Good balance of speed and quality
  • +Large context window
  • +Strong instruction following
Limitations
  • -Weaker than 405B on complex reasoning

Best For

CodingGeneral Purpose

Compare Llama 3.1 70B with...

Official Pricing Page →
Your ad here