Latest in AI

Showing:diffusion-modelsResearchersClear ×

🔥 Trending today

anthropic7 export-controls4 model-access3 spacex3 amazon3 national-security2 open-source2 governance2 ai-policy2 ai-regulation2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Google DeepMind Releases DiffusionGemma: Open Source Model with 4x Local AI Execution Speed Improvement
Ars Technica AI3 days agoRelease
Google DeepMind has released DiffusionGemma, an open-source model that brings diffusion-based generation to text tasks. Unlike autoregressive LLMs that generate one token at a time, diffusion models can produce outputs in parallel, dramatically cutting latency. The result is reportedly a 4x speed improvement for local AI inference, making on-device deployment significantly more practical.
NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI
NVIDIA Blog4 days agoRelease
Google DeepMind released DiffusionGemma, an experimental open model built for fast text generation. NVIDIA says it optimized the model for GeForce RTX GPUs, RTX PRO platforms, and DGX Spark systems. Instead of generating text one word at a time, DiffusionGemma produces multiple words in parallel to reduce latency for single-user workloads.
DiffusionGemma: 4x Faster Text Generation
r/LocalLLaMA top day4 days agoRelease
Google has announced DiffusionGemma, a text-generation model that applies diffusion-based techniques to the Gemma architecture, claiming speeds four times faster than standard autoregressive generation. Unlike conventional language models that predict tokens one at a time, diffusion-based methods generate text through iterative denoising, enabling parallel output. The release, published on Google's official blog, drew immediate attention from the local-LLM community for its potential inference-efficiency gains.
邁向光速文本生成：NVIDIA Nemotron-Labs 推出擴散語言模型 (Diffusion Language Models)★ 75
Hugging Face Blog22 days agoRelease
Traditional large language models (such as GPT, Claude, and others) all use an "autoregressive" mechanism — that is, they must predict the next token based on…
Hugging Face 推出 Modular Diffusers：為擴散模型管線打造的可組合模組化積木★ 78
Hugging Face Blog101 days agoRelease
The Hugging Face official blog has announced the launch of "Modular Diffusers" — a major architectural overhaul of its widely popular `diffusers` library. In…
PRX 第三部分：在 24 小時內訓練一個 Text-to-Image 圖像生成模型！★ 75
Hugging Face Blog103 days agoTutorial
Photoroom, the well-known AI image editing tool, recently published Part 3 of its technical blog series on Hugging Face about its in-house image generation…
Text-to-Image 模型訓練設計：來自 Photoroom 消融實驗的實戰啟示★ 75
Hugging Face Blog131 days agoTutorial
Photoroom, the well-known image editing platform, recently published a series of technical blog posts about their in-house text-to-image model, PRX. In Part 2…
Hugging Face Diffusers 量化後端深度探索：在消費級 GPU 高效運行大型擴散模型★ 80
Hugging Face Blog389 days agoTutorial
As diffusion models (such as Flux.1 and Stable Diffusion 3) continue to grow in parameter count — often reaching tens of billions or even hundreds of billions…
Hugging Face 推出 Remote VAE 功能：優化 Inference Endpoints 的圖像解碼與 VRAM 佔用★ 75
Hugging Face Blog475 days agoRelease
In the generative AI domain, latent diffusion models (such as Stable Diffusion, FLUX.1, etc.) operate in two main stages: first, denoising and generation take…
使用 Quanto 與 Diffusers 打造記憶體高效的 Diffusion Transformers (DiT)★ 80
Hugging Face Blog684 days agoRelease
### Background and Challenges As generative AI technology evolves, image and video generation models are increasingly transitioning from traditional UNet…
SegMoE：Segmind 推出擴散模型混合專家（Mixture of Diffusion Experts）框架★ 75
Hugging Face Blog862 days agoRelease
In the large language model (LLM) space, the Mixture of Experts (MoE) architecture (as seen in models like Mixtral 8x7B) has proven capable of dramatically…
使用 TRL 透過 DDPO 微調 Stable Diffusion 模型★ 75
Hugging Face Blog989 days agoRelease
Hugging Face published a blog post introducing how to use the DDPO (Denoising Diffusion Policy Optimization) algorithm within the TRL (Transformer…
介紹 Würstchen：超快速且高效的圖像生成擴散模型★ 75
Hugging Face Blog1,005 days agoRelease
Hugging Face, in collaboration with the research community, has introduced a new text-to-image diffusion model called "Würstchen." The model's standout feature…
🤗 Diffusers 迎來一週年！回顧開源擴散模型的黃金發展史
Hugging Face Blog1,060 days agoRelease
An official Hugging Face blog post celebrates the one-year anniversary of its core open-source library, `diffusers`. Since its release in July 2022, Diffusers…
深入探討文字生成影片 (Text-to-Video) 模型：原理、開源現況與 Diffusers 實作
Hugging Face Blog1,133 days agoTutorial
This Hugging Face blog post takes an in-depth look at the development of text-to-video (T2V) technology and the principles behind it. In mid-2023, as…
VQ-Diffusion：基於離散擴散模型的文本到圖像生成技術
Hugging Face Blog1,292 days agoRelease
In late 2022, while continuous-space diffusion models represented by Stable Diffusion were stealing the spotlight, diffusion models operating in discrete space…
Hugging Face 舉辦「擴散模型直播活動」：探索 AI 圖像生成的幕後技術
Hugging Face Blog1,297 days agoTutorial
This blog post is an event announcement published by Hugging Face in November 2022, announcing the "Diffusion Models Live Event." In the second half of 2022…
詳解擴散模型：The Annotated Diffusion Model 程式碼與原理實戰指南★ 85
Hugging Face Blog1,468 days agoTutorial
This classic blog post from Hugging Face, "The Annotated Diffusion Model," is an essential guide for learning about generative AI image synthesis. Modeled…

Latest in AI

Google DeepMind Releases DiffusionGemma: Open Source Model with 4x Local AI Execution Speed Improvement

NVIDIA Accelerates Google DeepMind’s DiffusionGemma for Local AI

DiffusionGemma: 4x Faster Text Generation

邁向光速文本生成：NVIDIA Nemotron-Labs 推出擴散語言模型 (Diffusion Language Models)★ 75

Hugging Face 推出 Modular Diffusers：為擴散模型管線打造的可組合模組化積木★ 78

PRX 第三部分：在 24 小時內訓練一個 Text-to-Image 圖像生成模型！★ 75

Text-to-Image 模型訓練設計：來自 Photoroom 消融實驗的實戰啟示★ 75

Hugging Face Diffusers 量化後端深度探索：在消費級 GPU 高效運行大型擴散模型★ 80

Hugging Face 推出 Remote VAE 功能：優化 Inference Endpoints 的圖像解碼與 VRAM 佔用★ 75

使用 Quanto 與 Diffusers 打造記憶體高效的 Diffusion Transformers (DiT)★ 80

SegMoE：Segmind 推出擴散模型混合專家（Mixture of Diffusion Experts）框架★ 75

使用 TRL 透過 DDPO 微調 Stable Diffusion 模型★ 75

介紹 Würstchen：超快速且高效的圖像生成擴散模型★ 75

🤗 Diffusers 迎來一週年！回顧開源擴散模型的黃金發展史

深入探討文字生成影片 (Text-to-Video) 模型：原理、開源現況與 Diffusers 實作

VQ-Diffusion：基於離散擴散模型的文本到圖像生成技術

Hugging Face 舉辦「擴散模型直播活動」：探索 AI 圖像生成的幕後技術

詳解擴散模型：The Annotated Diffusion Model 程式碼與原理實戰指南★ 85