During the inference process of large language models (LLMs), the self-attention mechanism needs to store the Key and Value vectors of historical tokens (i.e…
Hugging Face has announced the launch of the "Open Arabic LLM Leaderboard," an important initiative aimed at advancing Arabic natural language processing (NLP)…
Google has officially launched PaliGemma, a powerful yet lightweight open-source Vision-Language Model (VLM). The release of PaliGemma represents a significant…
Hugging Face and LangChain have jointly announced the launch of a new official partner package, `langchain-huggingface`. This collaboration aims to provide…
Hugging Face has officially launched Transformers Agents 2.0, a major refactoring and upgrade of its existing Agent framework, designed to provide developers…
As enterprise demand for Retrieval-Augmented Generation (RAG) technology surges, how to maintain high performance while controlling hardware costs has become…
Hugging Face has announced that its enterprise-focused collaboration platform, "Enterprise Hub," is now officially available on AWS Marketplace. This…
Hugging Face has officially launched the "Open Leaderboard for Hebrew LLMs," an open-source evaluation platform specifically designed for Hebrew large language…
Hugging Face has announced a partnership with the independent AI performance analytics firm Artificial Analysis, officially integrating its "LLM Performance…
This technical blog post from Hugging Face introduces how to build a powerful and efficient speech processing system using Hugging Face Inference Endpoints — a…
When developing applications based on large language models (LLMs) — such as AI agents, RAG systems, or automated workflows — one of the biggest challenges…
### Background and Challenges In the field of code generation, instruction tuning is the key to improving a model's practical utility and alignment with human…
Hugging Face has announced the launch of the new "Open Chain of Thought (CoT) Leaderboard," a public platform specifically designed to evaluate and compare the…
Snowflake recently launched a brand-new open-source large language model called "Snowflake Arctic" — a Mixture of Experts (MoE) model designed for…
In the field of artificial intelligence, developing a "Generalist Agent" — one capable of chatting, writing, controlling robots, and playing video games all at…
Hugging Face has announced the official launch of the "Open Medical-LLM Leaderboard" in collaboration with researchers from Open Life Science AI and the…
Meta officially released Llama 3, the next generation of its open-source large language models, on April 18, 2024. The initial release includes two parameter…
This article introduces how to run privacy-preserving inference based on Fully Homomorphic Encryption (FHE) on Hugging Face Endpoints. In traditional…
As code large language models (Code LLMs) develop rapidly, fairly and accurately evaluating their capabilities has become a major challenge. Traditional…
This case study details how biomedical AI startup Ryght leveraged Hugging Face's Expert Support service to overcome the many challenges of deploying generative…
Gradio, one of the most popular frameworks for rapid AI prototyping, has officially introduced its powerful "Reload Mode" (hot-reload functionality). In the…
Hugging Face has announced the launch of Idefics2, the next generation of its open-source Vision Language Model (VLM). With 8 billion (8B) parameters, this…
This technical blog post published by Hugging Face provides an accessible yet thorough breakdown of the core principles and applications of Vision Language…
Hugging Face and Google Cloud have announced a deep strategic partnership, officially integrating thousands of popular open-source large language models (LLMs)…
Google and Hugging Face have jointly announced the launch of CodeGemma, a family of lightweight open-source large language models (LLMs) designed specifically…
Hugging Face has officially published its core positions and commitments on Public Policy. As global debates over AI regulation intensify — from the EU's AI…
This tutorial article details how to build an efficient natural language to SQL (Text2SQL) query system using tools from the Hugging Face ecosystem and a…
Hugging Face, as the world's largest hosting platform for open-source AI models, datasets, and applications (Spaces), has become indispensable infrastructure…
SetFit (Sentence Transformer Fine-Tuning) is a few-shot text classification framework co-developed by Hugging Face, Intel Labs, and other organizations. Rather…
Hugging Face and internet infrastructure giant Cloudflare have announced a major partnership that officially brings serverless GPU inference services to…