AMD and Hugging Face have jointly announced the "AMD Pervasive AI Developer Contest," a global competition designed to inspire developers worldwide to build…
In the large language model (LLM) space, the Mixture of Experts (MoE) architecture (as seen in models like Mixtral 8x7B) has proven capable of dramatically…
Hugging Face has announced the launch of the new **NPHardEval** leaderboard — a benchmark specifically designed to evaluate the reasoning capabilities of large…
Hugging Face has partnered with AWS to officially bring its widely popular open-source LLM inference optimization framework, Text Generation Inference (TGI)…
This blog post from Hugging Face provides an in-depth exploration of how to implement "Constitutional AI (CAI)" using open-source large language models (Open…
Hugging Face has partnered with Patronus AI — a startup focused on LLM evaluation and defense — to officially launch the **Enterprise Scenarios Leaderboard**…
While large language models (LLMs) have demonstrated remarkable generative capabilities across many domains, "hallucination" — where a model confidently…
### Introduction: Capability Is Not Safety — A New Benchmark for LLM Safety Evaluation As large language models (LLMs) are adopted more deeply across…
Hugging Face and Google officially announced a landmark strategic partnership at the end of January 2024. The core objective of this collaboration is to deeply…
This official Hugging Face blog post provides a detailed guide on how to use open-source large language models (LLMs) as intelligent agents within LangChain…
This technical blog post from Hugging Face takes an in-depth look at the latest techniques in "preference tuning," with a particular focus on **Direct…
SD Turbo and SDXL Turbo are single-step/few-step text-to-image models from Stability AI, with their core innovation being Adversarial Diffusion Distillation…
In the open-source AI community, the Hugging Face Open LLM Leaderboard serves as an important benchmark for evaluating model capabilities. However, many…
Hugging Face's official blog announced a partnership with the Unsloth team to integrate Unsloth's efficient fine-tuning technology directly into Hugging Face's…
The Hugging Face official blog formally introduced a brand-new open-source text-to-image model called "aMUSEd." This model is based on a reproduction and…
The Hugging Face official blog introduces how to use "Speculative Decoding" to more than double the inference speed of OpenAI's Whisper speech-to-text model…
Looking back on 2023, the most notable trend in the AI landscape was the explosive growth of open-source large language models (Open LLMs). In this annual…
French AI startup Mistral AI officially released its highly anticipated open-source Mixture of Experts (MoE) model — Mixtral 8x7B. The model caused a sensation…
Mixture of Experts (MoE) has become a core technology for improving the performance and efficiency of today's large language models (LLMs). Traditional "dense…
### Background and Challenges Sentiment analysis is one of the most classic tasks in natural language processing (NLP). However, traditional sentence-level…
Hugging Face announced the launch of a new open-source library called "Optimum-NVIDIA," the result of a deep collaboration with NVIDIA, aimed at seamlessly…
AI infrastructure startup Replicate announced the successful completion of a $40 million Series B funding round. This round was led by prominent Silicon Valley…
In real-world generative AI applications, fine-tuning for specific tasks or clients is a common requirement. However, deploying a full base model for every…
Hugging Face's official blog announced a deep partnership with chip giant AMD, launching `optimum-amd`, an open-source library optimized specifically for AMD…
The Hugging Face Open LLM Leaderboard has long served as an important benchmark for the community to evaluate the capabilities of open-source models. However…
The Yi model series is a bilingual (Chinese and English) large language model trained from scratch by 01.AI, the AI startup founded by Kai-Fu Lee. Upon its…
This technical blog post from Replicate provides a detailed introduction to using the open-source BGE (BAAI General Embedding) model for efficient, low-cost…
Prodigy, the well-known machine learning data annotation tool from Explosion (the company behind the popular NLP library spaCy), has officially released a…
This technical guide from Replicate provides detailed instructions on how to locally deploy and run Latent Consistency Models (LCMs) on Macs equipped with…
As large language models (LLMs) and Retrieval-Augmented Generation (RAG) technology become increasingly widespread, embedding models have become an…