HiDream-O1-Image-1.5, a Chinese text-to-image model, has reached the top of domestic leaderboards and secured second place globally in the latest benchmark standings. The model reportedly outperforms image-generation offerings from Google and NVIDIA. The result marks a significant milestone for Chinese generative image research on the world stage.
Photoroom, the well-known AI image editing tool, recently published Part 3 of its technical blog series on Hugging Face about its in-house image generation…
Photoroom, the well-known image editing platform, recently published a series of technical blog posts about their in-house text-to-image model, PRX. In Part 2…
The Hugging Face official blog has announced that the popular diffusion model library `diffusers` now officially supports FLUX-2, the next-generation…
Google DeepMind has unveiled a new model called "Nano Banana Pro," which is also the Pro-tier image model of the Gemini 3 generation (Gemini 3 Pro Image…
### Introduction: An Important Piece of the Open-Source Image Generation Puzzle As text-to-image (T2I) technology advances rapidly, ensuring that AI-generated…
Stability AI officially launched the Stable Diffusion 3.5 (SD3.5) model series in late October 2024, and Hugging Face's Diffusers team simultaneously announced…
Black Forest Labs — a new AI team founded by the original creators of Stable Diffusion (including core developer Robin Rombach and others) — has officially…
This issue of Replicate Intelligence summarizes three major core updates from the recent open-source AI landscape: 1. **Google Gemma 2 officially launches**…
Hugging Face's official blog announced that its diffusers library now officially supports Stable Diffusion 3 (SD3), the latest release from Stability AI. SD3…
The Hugging Face official blog formally introduced a brand-new open-source text-to-image model called "aMUSEd." This model is based on a reproduction and…
Hugging Face published a blog post introducing how to use the DDPO (Denoising Diffusion Policy Optimization) algorithm within the TRL (Transformer…
Hugging Face, in collaboration with the research community, has introduced a new text-to-image diffusion model called "Würstchen." The model's standout feature…
On the occasion of the first anniversary of Stable Diffusion and Replicate's launch of Stable Diffusion XL (SDXL) fine-tuning services, this article provides…
The Hugging Face Ethics and Society team has published the fourth edition of its newsletter, this time focusing on the problem of "bias" in text-to-image (T2I)…
### Core Background and Challenges DeepFloyd IF is an advanced text-to-image model released by DeepFloyd, a research lab under Stability AI. Unlike the…
In late 2022, while continuous-space diffusion models represented by Stable Diffusion were stealing the spotlight, diffusion models operating in discrete space…
In October 2022, Stability AI officially released "Japanese Stable Diffusion," a model specifically designed for the Japanese market and culture, hosted on the…
This blog post from Replicate provides a clear and accessible introduction to running text-to-image models using Replicate's cloud API service. It serves as an…