Replicate's official blog announced that the platform now officially supports fine-tuning for Stability AI's flagship open-source image generation model, SDXL…
The well-known cloud AI hosting platform Replicate has announced official support for fine-tuning Meta's open-source large language model Llama 2. This service…
Meta and Microsoft jointly announced Llama 2, a new generation of open-source large language models. Compared to the original Llama, Llama 2 increases training…
This official Hugging Face blog post systematically maps out the complete ecosystem it has built around open-source large language models (LLMs). As…
In the era of booming generative AI, fine-tuning large image generation models like Stable Diffusion has generally been considered the exclusive domain of…
This technical blog post from Hugging Face details how to accelerate the vision-language model (VLM) "BridgeTower" on Intel's Habana Gaudi2 deep learning…
Meta's MMS (Massively Multilingual Speech) project, released in 2023, extends speech technology to over 1,000 languages, covering automatic speech recognition…
The Falcon series of large language models (including Falcon-40B and Falcon-7B), developed by Abu Dhabi's Technology Innovation Institute (TII), have…
This official Hugging Face blog post introduces a deep integration with the `bitsandbytes` library, formally adding 4-bit quantization support to…
This Hugging Face blog post provides an in-depth exploration of how to use InstructPix2Pix technology to apply instruction tuning to Stable Diffusion, enabling…
Hugging Face has announced the launch of StarChat Alpha, a conversational AI assistant designed specifically for programming. The model is based on StarCoder…
This technical guide from Hugging Face provides a detailed walkthrough of how to efficiently train language models by combining TensorFlow, the Hugging Face…
This case study introduces a deep technical collaboration between Databricks and Hugging Face, aimed at addressing the efficiency and cost challenges…
With the explosion of foundation models and large language models (LLMs), enterprises are eager to incorporate these powerful technologies into real-world…
This classic blog post from Hugging Face provides an extremely valuable hands-on guide for the open-source community, detailing how to fine-tune the LLaMA…
ControlNet is a revolutionary technique that allows users to provide additional spatial conditioning — such as Canny edges, human pose skeletons, and depth…
After ChatGPT swept the globe in early 2023, the open-source community was desperately searching for self-controllable, low-cost alternatives. Meta's release…
This technical blog post from Replicate details how fine-tuning technology can be used to make the open-source large language model LLaMA mimic the distinctive…
In March 2023, Stanford University released the Alpaca model — a fine-tuned version of Meta's LLaMA-7B model trained on 52,000 instruction-following examples…
This technical blog post from Hugging Face introduces how to combine TRL (Transformer Reinforcement Learning) and PEFT (Parameter-Efficient Fine-Tuning)…
As the parameter scale of large language models (LLMs) continues to grow, full fine-tuning has become prohibitively expensive and impractical. To lower the…
The Replicate platform has officially launched support for LoRA (Low-Rank Adaptation) technology, bringing a major efficiency breakthrough to Stable Diffusion…
This classic blog post from Hugging Face provides a detailed guide on how to use LoRA (Low-Rank Adaptation) technology to efficiently fine-tune Stable…
As the scale of deep learning models (such as Transformers) continues to grow, training these models demands enormous computational resources and time. To help…
As large language models (LLMs) and generative AI exploded in popularity, demand for computing power surged dramatically, leaving Nvidia GPUs (such as the…
### The Rise of Protein Deep Learning Following the success of AlphaFold 2, artificial intelligence has achieved a revolutionary breakthrough in structural…
The Replicate platform has announced the launch of a DreamBooth API, a service designed to solve the cumbersome workflow that developers face when training and…
This is a classic technical guide written by Hugging Face, detailing how to use their open-source diffusers library to fine-tune Stable Diffusion models via…
OpenAI's Whisper is a powerful automatic speech recognition (ASR) model. While its zero-shot capabilities are impressive, there remains significant room for…
This classic technical blog post from Hugging Face systematically guides developers in understanding and mastering distributed training techniques within the…