Fireworks AI

Llama 3.3 70B (Fireworks)

Complete specs, pricing, and benchmark data for Llama 3.3 70B (Fireworks) by Fireworks AI. Last verified 2026-04-20.

JSON ModeFunctionsStreaming
Pricing

Input / 1M tokens

$0.900

Output / 1M tokens

$0.900

Context & Output

Context Window

131.072K

Max Output

4,096

TTFT

150ms

Speed

100 tok/s

Benchmarks

Arena ELO

1220

Coding ELO

1180

Reasoning ELO

1220

HumanEval

88

MMLU

86.2

MATH

96

GPQA

40

Price History (Input $/M tokens)

Strengths
  • +Good speed/quality balance
  • +Competitive pricing
  • +Large context
Limitations
  • -Standard performance

Best For

CodingGeneral Purpose

Compare Llama 3.3 70B (Fireworks) with...

Official Pricing Page →
Your ad here