The official Replicate blog announced the launch of a new feature that significantly enhances developer experience (DX): quickly scaffolding an application…
This Hugging Face blog post introduces **LCM-LoRA (Latent Consistency Models LoRAs)**, a revolutionary technique that enables Stable Diffusion XL (SDXL) to…
This Hugging Face blog post takes an in-depth look at how to use LoRA (Low-Rank Adaptation) to fine-tune three models of different architectures and scales for…
In everyday development, tools like GitHub Copilot dramatically improve productivity, but for enterprises or individual developers, general-purpose models may…
This article introduces the integration between Hugging Face and the open-source data exploration tool Renumics Spotlight, aimed at addressing the pain point…
This technical guide from Replicate provides detailed instructions on how to locally deploy and run Latent Consistency Models (LCMs) on Macs equipped with…
Vercel has officially launched a new AI product called "v0," a generative UI system designed specifically for front-end development. The core philosophy of v0…
This technical blog post from Replicate introduces a new approach to improving the smoothness of AI-generated video: combining **AnimateDiff** with the…
With the widespread adoption of high-quality open-source image generation models like Stable Diffusion XL (SDXL), reducing inference latency and controlling…
Hugging Face published a blog post introducing how to use the DDPO (Denoising Diffusion Policy Optimization) algorithm within the TRL (Transformer…
This case study details how Rocket Money (formerly TrueBill), a popular personal finance app, partnered with Hugging Face to address pain points in deploying…
This technical blog post from Hugging Face takes an in-depth look at 3D Gaussian Splatting (3DGS), a revolutionary technology that has taken the worlds of 3D…
Hugging Face has officially launched the "Object Detection Leaderboard," a brand-new evaluation platform designed for the computer vision field. With the rapid…
AudioLDM 2 is an advanced open-source text-to-audio and text-to-music generation model. However, under its default settings, the model's inference speed is…
Vercel hosted the Demo Day for its inaugural "Vercel AI Accelerator," an accelerator program designed to drive AI application development. The program…
On the occasion of the first anniversary of Stable Diffusion and Replicate's launch of Stable Diffusion XL (SDXL) fine-tuning services, this article provides…
Bark is an innovative text-to-audio model developed by the team at Suno. It can generate not only high-quality, multilingual speech, but also background music…
DeepFloyd IF is an advanced text-to-image model developed by DeepFloyd, a research group backed by Stability AI. Unlike the more common Stable Diffusion, it…
This article provides a detailed walkthrough of how to quickly deploy Meta's open-source MusicGen music generation model using Hugging Face Inference…
This blog post, co-authored by Hugging Face and Zama — a cryptography company specializing in Fully Homomorphic Encryption (FHE) — explores how to address a…
Hugging Face Hub, the world's largest open-source AI community platform, hosts hundreds of thousands of models, datasets, and demo applications (Spaces). For a…
Since the release of Stable Diffusion XL (SDXL), its exceptional image generation quality has attracted widespread attention. However, its massive 1.3 billion…
Hugging Face has announced the launch of a brand-new open-source library designed specifically for JavaScript and TypeScript developers: "Agents.js" (published…
In this blog post, Vercel explores the fundamental transformation of the modern web development process, and how it is shaping the profile of future…
This case study takes an in-depth look at how Writer, an enterprise-grade generative AI platform, leverages the Hugging Face open-source ecosystem and…
This technical blog post from Hugging Face details how to accelerate the vision-language model (VLM) "BridgeTower" on Intel's Habana Gaudi2 deep learning…
The Hugging Face Ethics and Society team has published the fourth edition of its newsletter, this time focusing on the problem of "bias" in text-to-image (T2I)…
Meta's MMS (Massively Multilingual Speech) project, released in 2023, extends speech technology to over 1,000 languages, covering automatic speech recognition…
In recent years, the academic community has engaged in heated debate over whether Transformers are suitable for time series forecasting — particularly after…
In the era of explosive AI application growth, frontend developers face many challenges when integrating large language models (LLMs) — for example, how to…