This technical blog post from Hugging Face provides a detailed walkthrough of how to use the `transformers` library to fine-tune Meta's open-source W2V2-BERT…
SD Turbo and SDXL Turbo are single-step/few-step text-to-image models from Stability AI, with their core innovation being Adversarial Diffusion Distillation…
Mixture of Experts (MoE) has become a core technology for improving the performance and efficiency of today's large language models (LLMs). Traditional "dense…
This Hugging Face blog post introduces **LCM-LoRA (Latent Consistency Models LoRAs)**, a revolutionary technique that enables Stable Diffusion XL (SDXL) to…
This Hugging Face blog post takes an in-depth look at how to use LoRA (Low-Rank Adaptation) to fine-tune three models of different architectures and scales for…
This article introduces the integration between Hugging Face and the open-source data exploration tool Renumics Spotlight, aimed at addressing the pain point…
This technical guide from Replicate provides detailed instructions on how to locally deploy and run Latent Consistency Models (LCMs) on Macs equipped with…
With the widespread adoption of high-quality open-source image generation models like Stable Diffusion XL (SDXL), reducing inference latency and controlling…
Hugging Face published a blog post introducing how to use the DDPO (Denoising Diffusion Policy Optimization) algorithm within the TRL (Transformer…
This case study details how Rocket Money (formerly TrueBill), a popular personal finance app, partnered with Hugging Face to address pain points in deploying…
Hugging Face has officially launched the "Object Detection Leaderboard," a brand-new evaluation platform designed for the computer vision field. With the rapid…
This technical blog post from Hugging Face takes an in-depth look at 3D Gaussian Splatting (3DGS), a revolutionary technology that has taken the worlds of 3D…
AudioLDM 2 is an advanced open-source text-to-audio and text-to-music generation model. However, under its default settings, the model's inference speed is…
On the occasion of the first anniversary of Stable Diffusion and Replicate's launch of Stable Diffusion XL (SDXL) fine-tuning services, this article provides…
Bark is an innovative text-to-audio model developed by the team at Suno. It can generate not only high-quality, multilingual speech, but also background music…
This article provides a detailed walkthrough of how to quickly deploy Meta's open-source MusicGen music generation model using Hugging Face Inference…
Hugging Face Hub, the world's largest open-source AI community platform, hosts hundreds of thousands of models, datasets, and demo applications (Spaces). For a…
This blog post, co-authored by Hugging Face and Zama — a cryptography company specializing in Fully Homomorphic Encryption (FHE) — explores how to address a…
Since the release of Stable Diffusion XL (SDXL), its exceptional image generation quality has attracted widespread attention. However, its massive 1.3 billion…
This case study takes an in-depth look at how Writer, an enterprise-grade generative AI platform, leverages the Hugging Face open-source ecosystem and…
This technical blog post from Hugging Face details how to accelerate the vision-language model (VLM) "BridgeTower" on Intel's Habana Gaudi2 deep learning…
The Hugging Face Ethics and Society team has published the fourth edition of its newsletter, this time focusing on the problem of "bias" in text-to-image (T2I)…
Meta's MMS (Massively Multilingual Speech) project, released in 2023, extends speech technology to over 1,000 languages, covering automatic speech recognition…
In recent years, the academic community has engaged in heated debate over whether Transformers are suitable for time series forecasting — particularly after…
Hugging Face has announced a deep integration with fastText — the classic open-source natural language processing (NLP) library originally from Meta —…
In the current boom of generative AI, image generation models like Stable Diffusion have become widely popular thanks to their remarkable capabilities…
Hugging Face has announced official support for RWKV (Receptive Weighted Key Value) models in its `transformers` library. RWKV is an innovative architecture…
This Hugging Face blog post takes an in-depth look at the development of text-to-video (T2V) technology and the principles behind it. In mid-2023, as…
### Core Background and Challenges DeepFloyd IF is an advanced text-to-image model released by DeepFloyd, a research lab under Stability AI. Unlike the…
The spring of 2023 was a golden era for open-source large language model (LLM) development. In April 2023, Replicate — the well-known AI model hosting platform…