Hugging Face continues to expand its "Inference Providers" program, aimed at enabling developers to run open-source models from Hugging Face Hub in the…
Hugging Face has officially released `LeRobotDataset:v3.0`, a critical technical upgrade to its open-source robot learning library `lerobot`, with the core…
With the explosive growth of generative AI — particularly in image and multimedia generation — how to identify AI-generated content and protect copyright has…
Vercel officially announced the launch of "402-mcp" in its Changelog — an important update that injects commercial capabilities into the Model Context Protocol…
Vercel has officially launched a new open protocol called "x402-mcp," designed to establish a standardized payment and billing mechanism for Model Context…
Writer, a leading provider of enterprise AI solutions, has officially announced the launch of its new "Palmyra-mini" model series on the Hugging Face platform…
### Background and the LLM Inference Bottleneck When running large language models (LLMs), autoregressive generation is inherently "memory-bandwidth-bound"…
Hugging Face and Together AI have announced a deep partnership, launching a new integration designed to streamline the fine-tuning workflow for open-source…
### Background and Core Concepts Traditional large language models (LLMs), when faced with complex mathematics, data analysis, or programming tasks, can…
In today's era dominated by generative AI and large language models (LLMs), bidirectional encoder models (such as BERT and RoBERTa) still play an indispensable…
When deploying modern AI models (such as LLaMA, Flux, or Stable Diffusion), `torch.compile` — introduced in PyTorch 2.0 — is a powerful performance…
Google has recently launched a new open-source text embedding model called "EmbeddingGemma" on the Hugging Face platform. This model is built on the…
NVIDIA has officially released a massive "Multi-Lingual Reasoning Dataset" containing 6 million samples on the Hugging Face platform. This significant…
As the use of AI in academic research becomes increasingly widespread, enabling large language models (LLMs) to access the latest scientific literature in real…
The AI-MO (AI Mathematical Olympiad) team at Hugging Face has officially released the "Kimina-Prover-RL" project. Following the previously well-received…
As generative AI advances rapidly, deploying massive models to resource-constrained edge devices — such as smartphones, smart hardware, and AI PCs — has become…
Arm and Hugging Face have announced a collaboration to launch "Neural Super Sampling (NSS)" technology and related models, officially bringing AI-driven image…
The Hugging Face team and community have collaborated to launch a new evaluation benchmark called "FilBench," aimed at answering a key question: do large…
Hugging Face has recently introduced a new benchmark called "TextQuests," designed to evaluate the performance of large language models (LLMs) in text-based…
Replicate has officially launched a remote MCP (Model Context Protocol) server. MCP is an open standard created by Anthropic that enables large language models…
As the parameter counts of generative AI and large language models (LLMs) push into the tens and hundreds of billions, the memory of a single GPU has long been…
Hugging Face has officially launched a new tool called "AI Sheets," an intuitive spreadsheet tool designed specifically for dataset processing. It aims to make…
Hugging Face's TRL (Transformer Reinforcement Learning) is a popular open-source library specifically designed for aligning language models (LLMs). In its…
The Hugging Face official blog has announced exciting news, formally welcoming OpenAI's newly launched open-source model family — "GPT OSS." This is undeniably…
As the Model Context Protocol (MCP) proposed by Anthropic gradually becomes the open standard for connecting large language models (LLMs) with external tools…
Hugging Face has officially launched a lightweight open-source experiment tracking library called **Trackio**, designed to offer machine learning developers…
As AI applications become more widespread, how to allow large language models (LLMs) to securely and efficiently access enterprise internal data or external…
This technical guide from Hugging Face takes an in-depth look at how to accelerate LoRA (Low-Rank Adaptation) inference for Flux.1, the powerful open-source…
As large multimodal models (LMMs) have achieved breakthroughs in image and short-video understanding, the industry has gradually shifted its attention to the…
Hugging Face and NVIDIA have announced a new collaboration to bring NVIDIA NIM (NVIDIA Inference Microservices) into the Hugging Face ecosystem, with the goal…