This technical blog post from Hugging Face provides a detailed benchmark of running large language models (LLMs) on Google Cloud Platform's (GCP) new C4…
Frontend cloud platform Vercel has announced a new partnership with cloud computing giant AWS (Amazon Web Services), jointly committed to simplifying and…
This educational article from Hugging Face aims to guide readers — in the most intuitive, step-by-step way — to "reinvent" RoPE (Rotary Position Embedding)…
This case study provides a detailed account of how non-profit organization Digital Green, with support from Hugging Face's Expert Support team, optimized its…
Hugging Face has officially launched HUGS (Hugging Face Microservices), a brand-new microservices solution designed to address the pain points enterprises face…
AMD has officially launched its 5th-generation EPYC processor, codenamed "Turin," and Hugging Face has promptly published a blog post detailing the deep…
Hugging Face has officially introduced the "Community Tools" feature to its open-source chat platform, HuggingChat. This major update injects powerful Agent…
### Background and Pain Points In AI agent development, "tool use" (also known as function calling) is the core capability that allows large language models…
Hugging Face and NVIDIA announced a major partnership in late July 2024, officially launching a serverless inference service powered by NVIDIA NIM (NVIDIA…
Following Apple's major Core ML updates announced at WWDC 24, Hugging Face published a practical guide detailing how to convert the popular open-source large…
The Hugging Face official blog has introduced a major update to its open-source text generation inference engine, Text Generation Inference (TGI): the…
Hugging Face officially announced a deep integration with KerasHub — the new unified library for natural language processing (NLP) and computer vision (CV) in…
### Background and Challenges France's Banque des Territoires (part of the Caisse des Dépôts et Consignations — CDC Group) is committed to promoting local…
Hugging Face has announced official support for AWS Inferentia2 (Inf2) instances within its hosted Inference Endpoints service. This update gives developers…
Hugging Face and Dell Technologies have announced the launch of the "Dell Enterprise Hub," a new solution designed for enterprise on-premise AI deployment. As…
During Microsoft Build 2024, Hugging Face announced a further strategic collaboration with Microsoft, aimed at providing developers with a more seamless…
With the explosive growth of generative AI, demand for high-performance GPUs has reached an unprecedented level. To break hardware monopolies and reduce AI…
As enterprise demand for Retrieval-Augmented Generation (RAG) technology surges, how to maintain high performance while controlling hardware costs has become…
Hugging Face has announced a partnership with the independent AI performance analytics firm Artificial Analysis, officially integrating its "LLM Performance…
Vercel has announced the launch of Vercel AI SDK 3.1, a major architectural upgrade. Accompanying this release, Lars Grammel — the founder of the well-known…
When developing applications based on large language models (LLMs) — such as AI agents, RAG systems, or automated workflows — one of the biggest challenges…
This case study details how biomedical AI startup Ryght leveraged Hugging Face's Expert Support service to overcome the many challenges of deploying generative…
Hugging Face has announced the launch of Idefics2, the next generation of its open-source Vision Language Model (VLM). With 8 billion (8B) parameters, this…
Hugging Face and internet infrastructure giant Cloudflare have announced a major partnership that officially brings serverless GPU inference services to…
Hugging Face has officially released Cosmopedia, currently the largest and fully open-source synthetic dataset designed for the pre-training of large language…
### Background: The Shortcomings of Static Safety Evaluations As large language models (LLMs) are widely adopted across industries, AI safety has become an…
This article takes an in-depth look at the critical role of "synthetic data" in the open-source ecosystem, and explains how it helps enterprises and developers…
Hugging Face has announced the launch of the new **NPHardEval** leaderboard — a benchmark specifically designed to evaluate the reasoning capabilities of large…
Hugging Face has partnered with AWS to officially bring its widely popular open-source LLM inference optimization framework, Text Generation Inference (TGI)…
Hugging Face has partnered with Patronus AI — a startup focused on LLM evaluation and defense — to officially launch the **Enterprise Scenarios Leaderboard**…