Google

Gemini 1.5 Flash 8B

Complete specs, pricing, and benchmark data for Gemini 1.5 Flash 8B by Google. Last verified 2026-04-20.

MultimodalJSON ModeFunctionsStreaming
Pricing

Input / 1M tokens

$0.037

Output / 1M tokens

$0.150

Context & Output

Context Window

1M

Max Output

8,192

TTFT

150ms

Speed

100 tok/s

Benchmarks

Arena ELO

1150

Reasoning ELO

1150

HumanEval

75

MMLU

78

MATH

40

GPQA

40

Try Gemini 1.5 Flash 8B API

Start building with Gemini 1.5 Flash 8B

Get API Access

Price History (Input $/M tokens)

Strengths
  • +Cheapest multimodal option
  • +Fast inference
  • +1M context
Limitations
  • -Limited reasoning
  • -Smaller model

Best For

Cost EffectiveFast InferenceMultimodal

Compare Gemini 1.5 Flash 8B with...

Official Pricing Page →
Your ad here