HiDream-O1-Image-1.5, a Chinese text-to-image model, has reached the top of domestic leaderboards and secured second place globally in the latest benchmark standings. The model reportedly outperforms image-generation offerings from Google and NVIDIA. The result marks a significant milestone for Chinese generative image research on the world stage.
Apple announced improvements to Image Playground at WWDC 2026, positioning the iPhone’s built-in AI image generator as a more capable tool. The update emphasizes natural-language photo transformations, multi-person image use, flexible output dimensions, and integrations across lock screens, iMessage backgrounds, and contact posters. TechCrunch has not tested it yet, but the presentation suggests Apple Intelligence apps may become more practical.
Latent Space’s roundup frames image composition as a major barrier now being tackled by layout-aware image models. Reve 2.0 emphasizes precise generation and editing with layouts, while Ideogram 4.0 uses bounding boxes tied to region descriptions. The issue also covers MAI-Thinking-1, Gemma 4 12B, open audio models, agent execution layers, and model-routing cost debates.
Vercel recently announced in its product update Changelog that the Vercel AI Gateway has officially added support for the "GPT Image 2" model. Vercel AI…
Vercel recently updated its AI Gateway service to officially add support for "image-only models." Previously, Vercel AI Gateway primarily provided API…
Google announced new generative media models and tools at I/O 2025, led by Veo 3 for video, Imagen 4 for images, and Flow for AI filmmaking. Veo 3 adds audio generation, while Imagen 4 improves detail, typography, aspect ratios, and up to 2K output. Google also expanded Lyria 2 and Lyria RealTime access, while continuing SynthID watermarking and launching SynthID Detector.
In late January 2025, Hugging Face officially launched a brand-new initiative: the first edition of the "AI Tools for Art Newsletter." This newsletter was…
Black Forest Labs — a new AI team founded by the original creators of Stable Diffusion (including core developer Robin Rombach and others) — has officially…
This official Hugging Face blog post provides a detailed guide on how to deploy and run ComfyUI workflows on Hugging Face Spaces for free using Gradio…
In late 2022, while continuous-space diffusion models represented by Stable Diffusion were stealing the spotlight, diffusion models operating in discrete space…
This hands-on tutorial from Replicate walks developers step by step through building a "bot artist" deployed on Discord. Users simply type a text prompt into a…
This classic blog post from Hugging Face, "The Annotated Diffusion Model," is an essential guide for learning about generative AI image synthesis. Modeled…