In the inference process of large language models (LLMs) and vision-language models (VLMs), autoregressive decoding is a major performance bottleneck. Each…
As generative AI technology becomes more widespread, AI Sound Generation has become an indispensable part of modern multimedia creation, game development, and…
Hugging Face has recently taken an important step in the field of embodied AI, officially launching **SmolVLA** — a lightweight Vision-Language-Action (VLA)…
In the reinforcement learning from human feedback (RLHF) training process for large language models — whether PPO or the recently popular GRPO — there are…
### FLUX.1 Kontext Sparks a New Wave of "In-Context Image Generation" Since Black Forest Labs introduced FLUX.1, this open-source image generation model has…
Black Forest Labs (the development team behind the FLUX series of models) has launched a new image editing model called "FLUX.1 Kontext." This model is…
In this Hugging Face blog post, the team takes a deep dive into the evolution of AI agent architectures — specifically how to combine "structured constraints"…
Since the explosive rise of DeepSeek-R1, GRPO (Group Relative Policy Optimization) has become the most widely discussed reinforcement learning (RL) technique…
Hugging Face recently published a highly practical technical tutorial demonstrating how to build a fully functional miniature AI agent in just around 70 lines…
As enterprises place ever-increasing demands on data privacy, security, and regulatory compliance, deploying AI models on-premises has become the preferred…
The Technology Innovation Institute (TII) of the UAE recently officially unveiled a brand-new open-source language model series on the Hugging Face blog —…
The Technology Innovation Institute (TII) of Abu Dhabi has officially released a new language model series called "Falcon-Arabic" on the Hugging Face platform…
Hugging Face recently launched an open-source project called nanoVLM, positioned as "the simplest repository for training Vision Language Models (VLMs) in pure…
As diffusion models (such as Flux.1 and Stable Diffusion 3) continue to grow in parameter count — often reaching tens of billions or even hundreds of billions…
### Product Background and Developer Pain Points As generative AI applications become more widespread, developers building these apps typically need to connect…
Google DeepMind has officially released a preview of its new open model "Gemma 3n." This is a cutting-edge open model purpose-built for mobile devices and…
Microsoft and open-source AI community leader Hugging Face have announced a further expansion of their strategic partnership. At the heart of this…
The Technology Innovation Institute (TII) of the United Arab Emirates has officially released the "Falcon-Edge" model series on Hugging Face. This is a family…
The AI-managed inference platform Replicate has announced a deep partnership with Hugging Face, the giant of the open-source AI community, officially bringing…
Hugging Face's `transformers` library has become the cornerstone of the global open-source AI community and large language model (LLM) development. However, as…
Hugging Face and Kaggle — the data science community owned by Google — have announced a major deep integration aimed at providing Kaggle users with a more…
Hugging Face recently announced a brand-new, ultra-fast optimized deployment solution for OpenAI's open-source speech recognition model Whisper on its hosted…
With the explosion of multimodal technology, Vision Language Models (VLMs) have evolved from laboratory research prototypes into core tools for enterprises and…
With the release of Qwen-3, Hugging Face's official blog published an in-depth breakdown of its chat template. Chat templates are the critical bridge…
As large language models (LLMs) and vision language models (VLMs) continue to scale up, running these models on limited hardware resources — such as…
ServiceNow recently published a new open-source project called PipelineRL on the Hugging Face platform. As large language model (LLM) and AI agent systems move…
In this Hugging Face blog post, the team demonstrates how to implement a fully functional, lightweight AI agent (referred to as a "Tiny Agent") that supports…
### Background With the proliferation of vision-language models (VLMs), using VLMs for document OCR (e.g., converting PDFs to Markdown) has become mainstream…
When deploying large language models (LLMs), maintaining low latency and high throughput under high concurrency (concurrent requests) is one of the greatest…
Hugging Face's official blog announced that Cohere, the well-known enterprise AI research and development company, has officially joined Hugging Face's…