Google is now processing over 3.2 quadrillion tokens per month, which represents a sevenfold increase compared to the previous year. This surge in processing capability follows the launch of Gemma 4 and updates to Gemini tools in April 2026, which aim to enhance reasoning, agents, and developer applications. Additionally, Google introduced expanded AI infrastructure tools at Cloud Next 2026 to support the growing demands imposed by large-scale model operations and the evolving landscape of AI.
Google: Google is a leading technology company focused on search, cloud computing, and artificial intelligence, with its Gemini models serving as a core part of its AI offerings. In the past month, the company rolled out significant AI infrastructure expansions at Google Cloud Next 2026, including new capabilities for the agentic era such as advanced chips and the Gemini Enterprise Agent Platform. This scaling directly supports the surge in AI-driven token processing highlighted in the news.
Model Releases: April 2026 saw the launch of Gemma 4 and enhancements to Gemini tools for reasoning, agents, and developer applications.
Infrastructure Scaling: Google introduced expanded AI infrastructure tools and platforms at Cloud Next 2026 to handle the demands of the agentic era and large-scale model operations.
