This week, Google announced updates to its filmmaker-focused platform, Google Flow, which has expanded to over 100 countries and enabled millions of creatives to produce over 3.5 billion images and videos since its launch at last year’s #GoogleIO. The latest enhancements include a new frontier model, Gemini Omni, that allows users to generate videos from a combination of text, audio, and image inputs, showcasing Google Flow’s capabilities in multimodal AI. Additionally, the introduction of the Google Flow Agent supports users in planning and executing complex tasks, further integrating video generation with music capabilities through advancements shared with Google Flow Music.

Google: Google develops and deploys advanced AI technologies across creative and productivity tools. It recently rolled out significant enhancements to its Google Flow ecosystem at Google I/O 2026, including new agent capabilities and the Gemini Omni model for multimodal content generation. These updates strengthen Google’s role in empowering filmmakers, artists, and creators with integrated AI workflows.
Gemini Omni: Gemini Omni is Google’s frontier multimodal AI model that processes and generates content from text, audio, video, and image inputs. It emphasizes natural editing, world understanding, and step-by-step reasoning for media creation. The model was highlighted in this week’s updates to Google Flow and Google Flow Music to advance intuitive creative tools.
Google Flow: Google Flow serves as an AI-powered creative studio designed for video and image generation and editing. It supports collaborative tools and now incorporates a new agent for planning complex tasks alongside the Gemini Omni model for flexible input combinations. The platform’s recent upgrades focus on giving users greater control and customization in their creative processes.
Google Flow Music: Google Flow Music extends Google’s creative platform to music composition, production, and songwriting with integrated AI assistance. It leverages advanced models to help artists refine ideas and outputs. Recent announcements tied its capabilities more closely to Gemini Omni for seamless audio-visual creative experiences.

`json
{
“Agentic Tools”: “The new Google Flow Agent assists with planning and reasoning through complex tasks while remaining under full user control.”,
“Multimodal AI”: “Gemini Omni enables the generation and editing of videos from combined text, audio, image, and video prompts using natural language.”,
“Creative Integration”: “Updates enhance collaboration between video generation in Google Flow and music capabilities in Google Flow Music.”
}
`