As large language models (LLMs) and generative AI exploded in popularity, demand for computing power surged dramatically, leaving Nvidia GPUs (such as the…
The release of ChatGPT in late 2022 triggered an explosion in generative AI, and the most critical technology behind it is Reinforcement Learning from Human…
### The Rise of Protein Deep Learning Following the success of AlphaFold 2, artificial intelligence has achieved a revolutionary breakthrough in structural…
In late 2022, Apple and Hugging Face jointly announced that Stable Diffusion had officially gained support for Apple Silicon's Core ML framework. This update…
As the world's largest open-source AI model hub, Hugging Face not only provides model hosting but has also built a complete inference ecosystem. This article…
As privacy awareness grows and regulations tighten, performing inference with cloud AI models while protecting user privacy has become a significant challenge…
In the field of natural language generation (NLG), enabling language models to produce coherent and natural long-form text has long been a major challenge…
OpenAI's Whisper is a powerful automatic speech recognition (ASR) model. While its zero-shot capabilities are impressive, there remains significant room for…
As Transformer models become increasingly prevalent in natural language processing (NLP) and computer vision (CV), efficiently deploying these large models in…
As large language models (LLMs) become widely used across various domains, the issues of bias and toxicity in model outputs have received increasing attention…
Hugging Face Inference Endpoints is a fully managed service designed for developers and enterprises, built to solve the pain points of deploying machine…
Hugging Face officially announced support for Stable Diffusion models using the JAX and Flax frameworks in its renowned `diffusers` library. This update is a…
This technical blog post from Hugging Face documents in detail the practical process of optimizing inference for BLOOM, the open-source multilingual large…
Hugging Face officially announced the introduction of a DOI (Digital Object Identifier) mechanism for models and datasets hosted on the Hugging Face Hub. This…
In October 2022, Stability AI officially released "Japanese Stable Diffusion," a model specifically designed for the Japanese market and culture, hosted on the…
In late 2022, as massive language models like BLOOM and OPT emerged one after another, the AI community faced a core pain point: how to effectively and…
As the parameter counts of large language models (LLMs) grow exponentially, how to load and run these models on limited hardware has become a major pain point…
SetFit (Sentence Transformer Fine-Tuning) is an efficient few-shot learning framework jointly developed by Hugging Face, Intel Labs, and UKP Lab. It is…
Hugging Face, the leading organization in the global open-source AI community, officially launched the first issue of its Ethics and Society Newsletter. The…
BLOOM is a massive open-source multilingual model with 176 billion parameters. Running BLOOM at FP16 precision requires at least 352 GB of video memory (VRAM)…
Hugging Face published a major update and progress report on their diffusion model library `diffusers`, two months after its launch. Since its release in July…
As language model scales continue to expand, the memory (VRAM) of a single GPU has long been unable to accommodate models with tens or hundreds of billions of…
Against the backdrop of rapid AI advancement, how to balance "openness" with "safety risks" has become a central question for the community. Traditional…
With the open-sourcing of Stable Diffusion, running powerful AI image generation models locally has become a real possibility. This guide published by…
This classic blog post published in August 2022 marks a major milestone in the open-source AI image generation space. Hugging Face announced a collaboration…
This technical tutorial from the official Hugging Face blog provides a detailed walkthrough of how to deploy the popular computer vision model ViT (Vision…
This in-depth technical blog post from Hugging Face focuses on how to efficiently deploy and fine-tune Vision Transformer (ViT) models on Graphcore's IPU…
This article introduces the deep integration between Hugging Face and the bitsandbytes library, aimed at solving the enormous memory challenges posed by…
Although Hugging Face originally got its start with PyTorch at its core (formerly known as `pytorch-transformers`), as the community grew, they recognized the…
This is a practical guide authored by Hugging Face, aimed at teaching developers how to train and fine-tune Sentence Transformers models to generate…