With the rapid development of vision-language models (VLMs) and multimodal AI, the amount of data required to train these models has grown explosively…
Hugging Face has announced the release of a brand-new generation of lightweight open-source models — SmolLM3. As the latest member of the SmolLM family…
In this article exploring the relationship between AI and human cognition, University of Pennsylvania Wharton School professor Ethan Mollick provides a deep…
Hugging Face and the UAE's Technology Innovation Institute (TII, the organization behind the well-known open-source model Falcon) have jointly announced a new…
This technical blog post from Hugging Face provides a detailed guide on how to train and fine-tune "Sparse Embedding Models" using the Sentence Transformers…
NVIDIA has partnered with Hugging Face to officially bring its latest lightweight vision-language model (VLM) — the **NVIDIA Llama Nemotron Nano VLM** — to the…
Google's open-source model family welcomes a new member! The all-new Gemma 3n model series is now fully available within the Hugging Face ecosystem. Gemma 3n…
Google DeepMind has unveiled a groundbreaking new unified DNA sequence model called "AlphaGenome" — a major milestone in DeepMind's work in the life sciences…
Google DeepMind has released the "Gemini Robotics On-Device" model, a significant breakthrough that brings advanced Gemini AI capabilities directly to local…
University of Pennsylvania Wharton School professor Ethan Mollick recently published an extremely practical AI quick guide, "Using AI Right Now: A Quick…
SGLang (Structured Generation Language) is a high-performance LLM inference and serving framework developed by the LMSYS team, renowned for its efficient…
FLUX.1-dev is a state-of-the-art open-source text-to-image model with 12 billion parameters (12B), developed by Black Forest Labs. However, due to its enormous…
Google DeepMind today announced a major update to the Gemini 2.5 thinking models family, aimed at improving overall performance and accuracy while providing…
Google DeepMind today announced a major advancement for the Gemini 2.5 model family. First, the previously preview-stage Gemini 2.5 Flash and Gemini 2.5 Pro…
Hugging Face announced a deep partnership with Groq, a chip company focused on ultra-fast AI inference, formally bringing Groq into the Hugging Face "Inference…
Google DeepMind has announced the launch of a new platform called "Weather Lab," designed to showcase and provide access to its experimental AI technology for…
As the context windows of large language models (LLMs) continue to expand — from the early 4k and 8k, to the now-common 32k and even 128k or more — users have…
The Hugging Face official blog published a "Get Started with Hugging Face Kernel Hub in 5 Minutes" tutorial, formally introducing this new platform to the…
Hugging Face officially announced a partnership with Featherless AI, a serverless GPU inference platform, integrating it into the Hugging Face Inference…
As embodied AI develops rapidly, deploying powerful robotics foundation models onto specific hardware has become a key challenge. NVIDIA and Hugging Face have…
Hugging Face has announced a new partnership with AI chip giant NVIDIA, launching "Training Cluster as a Service" (TCaaS). The introduction of this service…
As large language models (LLMs) have evolved, AI applications have moved beyond simple "question-and-answer conversations" toward "AI Agents" capable of…
As artificial intelligence moves beyond simple "text-based conversation" into the era of Agents (intelligent agents) that actively execute tasks, enabling AI…
In the inference process of large language models (LLMs) and vision-language models (VLMs), autoregressive decoding is a major performance bottleneck. Each…
Google DeepMind has announced that its latest-generation model, Gemini 2.5, has achieved new breakthroughs in AI-driven audio dialog and audio generation. This…
As generative AI technology becomes more widespread, AI Sound Generation has become an indispensable part of modern multimedia creation, game development, and…
H (formerly Holistic AI), a highly regarded French AI startup, recently officially released a new family of vision-language models (VLMs) on the Hugging Face…
In the reinforcement learning from human feedback (RLHF) training process for large language models — whether PPO or the recently popular GRPO — there are…
Hugging Face has recently taken an important step in the field of embodied AI, officially launching **SmolVLA** — a lightweight Vision-Language-Action (VLA)…
In this Hugging Face blog post, the team takes a deep dive into the evolution of AI agent architectures — specifically how to combine "structured constraints"…