Meta officially released Llama 3, the next generation of its open-source large language models, on April 18, 2024. The initial release includes two parameter…
This case study details how biomedical AI startup Ryght leveraged Hugging Face's Expert Support service to overcome the many challenges of deploying generative…
Hugging Face has announced the launch of Idefics2, the next generation of its open-source Vision Language Model (VLM). With 8 billion (8B) parameters, this…
As the parameter counts of large language models (LLMs) have skyrocketed, the hardware requirements for training and fine-tuning these models have risen…
Hugging Face has announced a deep partnership with NVIDIA to directly integrate NVIDIA DGX Cloud services into the Hugging Face platform. This collaboration…
After Google released the Gemma family of open-source models (including 2B and 7B parameter versions), Hugging Face promptly published this practical…
Google has officially released a new family of open-source large language models called "Gemma" — a series of lightweight, state-of-the-art open-source models…
This technical blog post from Hugging Face provides a detailed walkthrough of how to use the `transformers` library to fine-tune Meta's open-source W2V2-BERT…
This technical blog post from Hugging Face takes an in-depth look at the latest techniques in "preference tuning," with a particular focus on **Direct…
Hugging Face's official blog announced a partnership with the Unsloth team to integrate Unsloth's efficient fine-tuning technology directly into Hugging Face's…
This Hugging Face blog post takes an in-depth look at how to use LoRA (Low-Rank Adaptation) to fine-tune three models of different architectures and scales for…
This technical blog post from Hugging Face takes an in-depth look at the critical "implementation details" that are routinely glossed over in academic papers…
AI cloud deployment and runtime platform Replicate has announced official support for fine-tuning Meta's open-source music generation model MusicGen. This new…
Hugging Face published a blog post introducing how to use the DDPO (Denoising Diffusion Policy Optimization) algorithm within the TRL (Transformer…
When fine-tuning massively large open-source models like Llama 2 70B — with its 70 billion parameters — developers frequently encounter a bottleneck that goes…
In today's software development workflows, AI coding assistants have become a critical tool for boosting developer productivity. However, for many enterprises…
AI cloud hosting platform Replicate has announced a major technical breakthrough for fine-tuned models: the "cold boot" time for fine-tuned models has been…
### Background and Enterprise Pain Points The widespread adoption of AI coding assistants like GitHub Copilot has significantly boosted developer productivity…
### Background and Pain Points Traditional RLHF (Reinforcement Learning from Human Feedback), while achieving enormous success with models like ChatGPT…
The well-known cloud AI hosting platform Replicate has announced official support for fine-tuning Meta's open-source large language model Llama 2. This service…
Meta and Microsoft jointly announced Llama 2, a new generation of open-source large language models. Compared to the original Llama, Llama 2 increases training…
This official Hugging Face blog post systematically maps out the complete ecosystem it has built around open-source large language models (LLMs). As…
In the era of booming generative AI, fine-tuning large image generation models like Stable Diffusion has generally been considered the exclusive domain of…
This technical blog post from Hugging Face details how to accelerate the vision-language model (VLM) "BridgeTower" on Intel's Habana Gaudi2 deep learning…
Meta's MMS (Massively Multilingual Speech) project, released in 2023, extends speech technology to over 1,000 languages, covering automatic speech recognition…
The Falcon series of large language models (including Falcon-40B and Falcon-7B), developed by Abu Dhabi's Technology Innovation Institute (TII), have…
This official Hugging Face blog post introduces a deep integration with the `bitsandbytes` library, formally adding 4-bit quantization support to…
This Hugging Face blog post provides an in-depth exploration of how to use InstructPix2Pix technology to apply instruction tuning to Stable Diffusion, enabling…
Hugging Face has announced the launch of StarChat Alpha, a conversational AI assistant designed specifically for programming. The model is based on StarCoder…
This technical guide from Hugging Face provides a detailed walkthrough of how to efficiently train language models by combining TensorFlow, the Hugging Face…