Google's fastest and cheapest model. Highest throughput of any major LLM API.
Avg TTFT
437ms
Avg Latency
711ms
P95 Latency
1431ms
Throughput
124.4tok/s
ITL
103.5ms
Error Rate
0%
Pings (24h)
287
Uptime
100%
See live performance charts
Interactive charts with all 14 models, updated every 5 minutes.
View Dashboard