As large language models (LLMs) have developed, the industry has gradually come to recognize the limitations of the "single model does everything" approach…
Hugging Face has officially launched the Ettin Suite, a brand-new state-of-the-art (SoTA) open-source model family of "Paired Encoders and Decoders." In…
Hugging Face's AI-MO (AI Math Olympiad) team has officially published Kimina-Prover, a research paper demonstrating how "test-time reinforcement learning…
Hugging Face has officially announced the launch of its dedicated MCP (Model Context Protocol) server — a major step in ecosystem integration. The Model…
With the rise of Anthropic's Claude 3.5 Sonnet "Computer Use" and various GUI-oriented multimodal models, "desktop agents" have become one of the hottest areas…
In the fields of robot learning and embodied AI, enabling controllers based on deep learning or large language/vision models (VLAs) to run in real time has…
With Anthropic's introduction of the Model Context Protocol (MCP) open standard, the way AI agents connect to external tools and data sources has become…
As AMD Instinct MI300 series GPUs (such as the MI300X) gradually increase their market share in the AI compute market, how to perform low-level optimization…
Hugging Face recently announced a collaboration with Pollen Robotics to launch a new open-source robotics platform called "Reachy Mini," designed to provide…
Hugging Face has announced the release of a brand-new generation of lightweight open-source models — SmolLM3. As the latest member of the SmolLM family…
With the rapid development of vision-language models (VLMs) and multimodal AI, the amount of data required to train these models has grown explosively…
Hugging Face and the UAE's Technology Innovation Institute (TII, the organization behind the well-known open-source model Falcon) have jointly announced a new…
This technical blog post from Hugging Face provides a detailed guide on how to train and fine-tune "Sparse Embedding Models" using the Sentence Transformers…
Google's open-source model family welcomes a new member! The all-new Gemma 3n model series is now fully available within the Hugging Face ecosystem. Gemma 3n…
SGLang (Structured Generation Language) is a high-performance LLM inference and serving framework developed by the LMSYS team, renowned for its efficient…
FLUX.1-dev is a state-of-the-art open-source text-to-image model with 12 billion parameters (12B), developed by Black Forest Labs. However, due to its enormous…
Hugging Face announced a deep partnership with Groq, a chip company focused on ultra-fast AI inference, formally bringing Groq into the Hugging Face "Inference…
As the context windows of large language models (LLMs) continue to expand — from the early 4k and 8k, to the now-common 32k and even 128k or more — users have…
Hugging Face officially announced a partnership with Featherless AI, a serverless GPU inference platform, integrating it into the Hugging Face Inference…
The Hugging Face official blog published a "Get Started with Hugging Face Kernel Hub in 5 Minutes" tutorial, formally introducing this new platform to the…
Hugging Face has announced a new partnership with AI chip giant NVIDIA, launching "Training Cluster as a Service" (TCaaS). The introduction of this service…
In the inference process of large language models (LLMs) and vision-language models (VLMs), autoregressive decoding is a major performance bottleneck. Each…
As generative AI technology becomes more widespread, AI Sound Generation has become an indispensable part of modern multimedia creation, game development, and…
Hugging Face has recently taken an important step in the field of embodied AI, officially launching **SmolVLA** — a lightweight Vision-Language-Action (VLA)…
In the reinforcement learning from human feedback (RLHF) training process for large language models — whether PPO or the recently popular GRPO — there are…
In this Hugging Face blog post, the team takes a deep dive into the evolution of AI agent architectures — specifically how to combine "structured constraints"…
Since the explosive rise of DeepSeek-R1, GRPO (Group Relative Policy Optimization) has become the most widely discussed reinforcement learning (RL) technique…
Hugging Face recently published a highly practical technical tutorial demonstrating how to build a fully functional miniature AI agent in just around 70 lines…
As enterprises place ever-increasing demands on data privacy, security, and regulatory compliance, deploying AI models on-premises has become the preferred…
The Technology Innovation Institute (TII) of the UAE recently officially unveiled a brand-new open-source language model series on the Hugging Face blog —…