Groq

Llama 3.3 70B (Groq)

Complete specs, pricing, and benchmark data for Llama 3.3 70B (Groq) by Groq. Last verified 2026-04-20.

JSON ModeFunctionsStreaming
Pricing

Input / 1M tokens

$0.590

Output / 1M tokens

$0.790

Context & Output

Context Window

128K

Max Output

4,096

TTFT

150ms

Speed

100 tok/s

Benchmarks

Arena ELO

1220

Coding ELO

1180

Reasoning ELO

1220

HumanEval

88

MMLU

86.2

MATH

96

GPQA

40

Price History (Input $/M tokens)

Strengths
  • +Fastest inference available
  • +Excellent token throughput
  • +LPU technology
Limitations
  • -Limited context vs vLLM
  • -Smaller context window options

Best For

Fast InferenceGeneral PurposeCoding

Compare Llama 3.3 70B (Groq) with...

Official Pricing Page →
Your ad here