Microsoft

Phi-3.5 Mini

Complete specs, pricing, and benchmark data for Phi-3.5 Mini by Microsoft. Last verified 2026-04-20.

JSON ModeFunctionsStreaming
Pricing

Input / 1M tokens

$0.130

Output / 1M tokens

$0.520

Context & Output

Context Window

128K

Max Output

4,096

TTFT

150ms

Speed

100 tok/s

Benchmarks

Arena ELO

1160

Coding ELO

1165

Reasoning ELO

1160

HumanEval

79

MMLU

81

MATH

48

GPQA

40

Try Phi-3.5 Mini API

Start building with Phi-3.5 Mini

Get API Access

Price History (Input $/M tokens)

Strengths
  • +Very cheap
  • +Fast
  • +Large context
Limitations
  • -Limited reasoning
  • -Smaller model

Best For

Cost EffectiveFast Inference

Compare Phi-3.5 Mini with...

Official Pricing Page →
Your ad here