Google's fastest and cheapest model. Highest throughput of any major LLM API.
Avg TTFT
281ms
Avg Latency
652ms
P95 Latency
840ms
Throughput
112.1tok/s
ITL
144.7ms
Error Rate
0%
Pings (24h)
144
Uptime
100%
See live performance charts
Interactive charts with all 15 models, updated every 5 minutes.
View Dashboard