HiDream-O1-Image-1.5, a Chinese text-to-image model, has reached the top of domestic leaderboards and secured second place globally in the latest benchmark standings. The model reportedly outperforms image-generation offerings from Google and NVIDIA. The result marks a significant milestone for Chinese generative image research on the world stage.
Latent Space’s roundup frames image composition as a major barrier now being tackled by layout-aware image models. Reve 2.0 emphasizes precise generation and editing with layouts, while Ideogram 4.0 uses bounding boxes tied to region descriptions. The issue also covers MAI-Thinking-1, Gemma 4 12B, open audio models, agent execution layers, and model-routing cost debates.
Black Forest Labs — a new AI team founded by the original creators of Stable Diffusion (including core developer Robin Rombach and others) — has officially…
In late 2022, while continuous-space diffusion models represented by Stable Diffusion were stealing the spotlight, diffusion models operating in discrete space…
This classic blog post from Hugging Face, "The Annotated Diffusion Model," is an essential guide for learning about generative AI image synthesis. Modeled…