Meta

Llama 3.1 8B

Complete specs, pricing, and benchmark data for Llama 3.1 8B by Meta. Last verified 2026-04-20.

JSON ModeFunctionsStreaming
Pricing

Input / 1M tokens

$0.020

Output / 1M tokens

$0.050

Context & Output

Context Window

128K

Max Output

4,096

TTFT

150ms

Speed

100 tok/s

Benchmarks

Arena ELO

1120

Reasoning ELO

1120

HumanEval

72

MMLU

79

MATH

40

GPQA

40

Try Llama 3.1 8B API

Start building with Llama 3.1 8B

Get API Access

Price History (Input $/M tokens)

Strengths
  • +Very cheap
  • +Super fast
  • +Good for lightweight tasks
Limitations
  • -Weaker on complex tasks
  • -Limited reasoning

Best For

Cost EffectiveFast Inference

Compare Llama 3.1 8B with...

Official Pricing Page →
Your ad here