Google recently unveiled a brand-new "anything-to-anything" multimodal AI model — Gemini Omni — whose powerful cross-modal generation and transformation…
In the latest issue of Latent Space AINews, the major announcements from Google I/O 2026 were covered in depth. Google demonstrated its formidable R&D and…
Google DeepMind recently shared the behind-the-scenes production process of the science fiction short film ANCESTRA, created in collaboration with…
Alibaba's open-source Wan2.1 is a text-to-video model that has been attracting considerable attention. To help developers and creators get the most out of this…
This official Hugging Face blog post takes an in-depth look at the current state of open-source video generation models within the Diffusers ecosystem. As…
In the history of AI development, the open-sourcing of Stable Diffusion in 2022 is regarded as a pivotal turning point in the field of image generation — it…