Google AI Blog
ยท
Apr 2, 2026 4:00 PM
New ways to balance cost and reliability in the Gemini API
Google is introducing two new inference tiers to the Gemini API, Flex and Priority,
to balance cost and latency.
Read at Google AI Blog
to balance cost and latency.