Fuel your creativity with new generative media models and tools

Google introduced Veo 3, Imagen 4, Flow, and expanded music generation tools for creators and enterprises.

Google announced new generative media models and tools at I/O 2025, led by Veo 3 for video, Imagen 4 for images, and Flow for AI filmmaking. Veo 3 adds audio generation, while Imagen 4 improves detail, typography, aspect ratios, and up to 2K output. Google also expanded Lyria 2 and Lyria RealTime access, while continuing SynthID watermarking and launching SynthID Detector.

This Google DeepMind article centers on the release of a set of generative media models and tools aimed at creators. The core updates include the video model Veo 3, the image model Imagen 4, the AI filmmaking tool Flow, and an expanded rollout of the music model Lyria 2. Veo 3 is the most eye-catching update; Google says it not only improves on Veo 2's video quality but, for the first time, can add audio when generating video—for example, traffic sounds of a city street, birdsong in a park, and even dialogue between characters. It also emphasizes understanding of text and image prompts, a sense of real-world physics, and lip-sync capabilities. In the U.S., Veo 3 was made available on launch day to Ultra subscribers of the Gemini app and to Flow, and is also open to enterprises via Vertex AI.

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Summaries are AI-generated; the original article is authoritative.