Monitor LLM APIs in real-time

ms

median TTFT

models operational

Direct response models — stream tokens as they generate

Model Comparison

Loading...

Time to First Token

Loading...

Time to Last Token

Loading...

Inter-Token Latency

Loading...

Throughput

Loading...

Error Rate

Loading...

P95 Latency

Loading...

Get instant alerts when models go down

Real-time notifications when Claude, GPT-4, or Gemini degrade, plus weekly performance digests.

Join engineers monitoring LLM performance

All Models

Model Provider Type Latency TTFT TTLT ITL Tok/s P95 Error % Uptime