When developing applications based on large language models (LLMs) — such as AI agents, RAG systems, or automated workflows — one of the biggest challenges…
Hugging Face has announced the official launch of the "Open Medical-LLM Leaderboard" in collaboration with researchers from Open Life Science AI and the…
Meta officially released Llama 3, the next generation of its open-source large language models, on April 18, 2024. The initial release includes two parameter…
This case study details how biomedical AI startup Ryght leveraged Hugging Face's Expert Support service to overcome the many challenges of deploying generative…
As code large language models (Code LLMs) develop rapidly, fairly and accurately evaluating their capabilities has become a major challenge. Traditional…
This tutorial article details how to build an efficient natural language to SQL (Text2SQL) query system using tools from the Hugging Face ecosystem and a…
Hugging Face and internet infrastructure giant Cloudflare have announced a major partnership that officially brings serverless GPU inference services to…
As large language models (LLMs) have been widely adopted across industries, ensuring AI systems remain safe and compliant while preventing harmful outputs has…
As the parameter counts of large language models (LLMs) have skyrocketed, the hardware requirements for training and fine-tuning these models have risen…
### Background: The Shortcomings of Static Safety Evaluations As large language models (LLMs) are widely adopted across industries, AI safety has become an…
This article takes an in-depth look at the critical role of "synthetic data" in the open-source ecosystem, and explains how it helps enterprises and developers…
Hugging Face has announced the launch of the new **NPHardEval** leaderboard — a benchmark specifically designed to evaluate the reasoning capabilities of large…
Hugging Face has partnered with AWS to officially bring its widely popular open-source LLM inference optimization framework, Text Generation Inference (TGI)…
Hugging Face has partnered with Patronus AI — a startup focused on LLM evaluation and defense — to officially launch the **Enterprise Scenarios Leaderboard**…
While large language models (LLMs) have demonstrated remarkable generative capabilities across many domains, "hallucination" — where a model confidently…
### Introduction: Capability Is Not Safety — A New Benchmark for LLM Safety Evaluation As large language models (LLMs) are adopted more deeply across…
This official Hugging Face blog post provides a detailed guide on how to use open-source large language models (LLMs) as intelligent agents within LangChain…
Hugging Face's official blog announced a partnership with the Unsloth team to integrate Unsloth's efficient fine-tuning technology directly into Hugging Face's…
Looking back on 2023, the most notable trend in the AI landscape was the explosive growth of open-source large language models (Open LLMs). In this annual…
AI infrastructure startup Replicate announced the successful completion of a $40 million Series B funding round. This round was led by prominent Silicon Valley…
In real-world generative AI applications, fine-tuning for specific tasks or clients is a common requirement. However, deploying a full base model for every…
Hugging Face announced the launch of a new open-source library called "Optimum-NVIDIA," the result of a deep collaboration with NVIDIA, aimed at seamlessly…
As large language models (LLMs) such as Llama 2 become more widely adopted, achieving efficient and cost-effective inference in production environments has…
This Hugging Face blog post takes an in-depth look at how to use LoRA (Low-Rank Adaptation) to fine-tune three models of different architectures and scales for…
In practical natural language processing (NLP) applications, converting unstructured text (such as emails or conversation logs) into structured data (such as…
As large language models (LLMs) shift toward conversational (Chat/Instruct) applications, correctly formatting and feeding a user's conversation history —…
This Hugging Face blog post presents detailed performance benchmarks for deploying Meta's open-source large language models — Llama 2 (covering 7B, 13B, and…
The Hugging Face official blog has announced a new "Inference for PROs" upgraded service for PRO subscribers (at $9 per month). This service is designed to…
When fine-tuning massively large open-source models like Llama 2 70B — with its 70 billion parameters — developers frequently encounter a bottleneck that goes…
AI cloud hosting platform Replicate has announced a major technical breakthrough for fine-tuned models: the "cold boot" time for fine-tuned models has been…