Compact version of GPT-4.1. Optimized for speed and cost.
Avg TTFT
670ms
Avg Latency
1165ms
P95 Latency
1943ms
Throughput
31.6tok/s
ITL
14.1ms
Error Rate
0%
Pings (24h)
287
Uptime
100%
See live performance charts
Interactive charts with all 14 models, updated every 5 minutes.
View Dashboard