As large language models (LLMs) have been widely adopted across industries, ensuring AI systems remain safe and compliant while preventing harmful outputs has…
As the parameter counts of large language models (LLMs) have skyrocketed, the hardware requirements for training and fine-tuning these models have risen…
### Background: The Shortcomings of Static Safety Evaluations As large language models (LLMs) are widely adopted across industries, AI safety has become an…
This article takes an in-depth look at the critical role of "synthetic data" in the open-source ecosystem, and explains how it helps enterprises and developers…
Hugging Face has announced the launch of the new **NPHardEval** leaderboard — a benchmark specifically designed to evaluate the reasoning capabilities of large…
Hugging Face has partnered with AWS to officially bring its widely popular open-source LLM inference optimization framework, Text Generation Inference (TGI)…
Hugging Face has partnered with Patronus AI — a startup focused on LLM evaluation and defense — to officially launch the **Enterprise Scenarios Leaderboard**…
Meta's Code Llama 70B is a powerful open-source large language model designed specifically for code generation and understanding. However, running such a…
While large language models (LLMs) have demonstrated remarkable generative capabilities across many domains, "hallucination" — where a model confidently…
### Introduction: Capability Is Not Safety — A New Benchmark for LLM Safety Evaluation As large language models (LLMs) are adopted more deeply across…
This official Hugging Face blog post provides a detailed guide on how to use open-source large language models (LLMs) as intelligent agents within LangChain…
Hugging Face's official blog announced a partnership with the Unsloth team to integrate Unsloth's efficient fine-tuning technology directly into Hugging Face's…
Looking back on 2023, the most notable trend in the AI landscape was the explosive growth of open-source large language models (Open LLMs). In this annual…
Hugging Face announced the launch of a new open-source library called "Optimum-NVIDIA," the result of a deep collaboration with NVIDIA, aimed at seamlessly…
AI infrastructure startup Replicate announced the successful completion of a $40 million Series B funding round. This round was led by prominent Silicon Valley…
In real-world generative AI applications, fine-tuning for specific tasks or clients is a common requirement. However, deploying a full base model for every…
As large language models (LLMs) such as Llama 2 become more widely adopted, achieving efficient and cost-effective inference in production environments has…
This Hugging Face blog post takes an in-depth look at how to use LoRA (Low-Rank Adaptation) to fine-tune three models of different architectures and scales for…
In practical natural language processing (NLP) applications, converting unstructured text (such as emails or conversation logs) into structured data (such as…
As large language models (LLMs) shift toward conversational (Chat/Instruct) applications, correctly formatting and feeding a user's conversation history —…
"AI Comic Factory" is one of the most popular open-source AI applications on the Hugging Face platform. It allows users to simply enter a brief plot…
This official guide from Hugging Face is designed for readers without a technical background. It provides a detailed walkthrough of how to use Hugging Face's…
This Hugging Face blog post presents detailed performance benchmarks for deploying Meta's open-source large language models — Llama 2 (covering 7B, 13B, and…
The Hugging Face official blog has announced a new "Inference for PROs" upgraded service for PRO subscribers (at $9 per month). This service is designed to…
When fine-tuning massively large open-source models like Llama 2 70B — with its 70 billion parameters — developers frequently encounter a bottleneck that goes…
AI cloud hosting platform Replicate has announced a major technical breakthrough for fine-tuned models: the "cold boot" time for fine-tuned models has been…
Meta has officially launched Code Llama, a family of state-of-the-art open-source code generation models fine-tuned on Llama 2. Code Llama achieves leading…
This Hugging Face official blog post introduces a major update that integrates AutoGPTQ into the `transformers` and `optimum` libraries. GPTQ (Generalized…
Hugging Face has officially launched IDEFICS (Image-supervised Decoder-Encoder-Few-shot-In-Context-Shorthand), an open-source multimodal vision-language model…
This is a practical prompting guide for Llama 2, written by the Replicate team. Unlike OpenAI's GPT series models, the open-source model Llama 2 has extremely…