Google's current-generation fast model. Strong multimodal performance with grounding and high throughput.
Performance
Time to first token
—ms
—vs prior 24h
Total response time
—ms
—vs prior 24h
Throughput
—tok/s
—vs prior 24h
Inter-token latency
—ms
—vs prior 24h