Back to News
Release2026-04-02
New ways to balance cost and reliability in the Gemini API
Source: Google DeepMind
Google is introducing two new inference tiers to the Gemini API, Flex and Priority, to balance cost and latency.
googlegemini
Google is introducing two new inference tiers to the Gemini API, Flex and Priority, to balance cost and latency.