Google DeepMind today announced a major advancement for the Gemini 2.5 model family. First, the previously preview-stage Gemini 2.5 Flash and Gemini 2.5 Pro…
Hugging Face announced a deep partnership with Groq, a chip company focused on ultra-fast AI inference, formally bringing Groq into the Hugging Face "Inference…
Google DeepMind has announced the launch of a new platform called "Weather Lab," designed to showcase and provide access to its experimental AI technology for…
As the context windows of large language models (LLMs) continue to expand — from the early 4k and 8k, to the now-common 32k and even 128k or more — users have…
The Hugging Face official blog published a "Get Started with Hugging Face Kernel Hub in 5 Minutes" tutorial, formally introducing this new platform to the…
Hugging Face officially announced a partnership with Featherless AI, a serverless GPU inference platform, integrating it into the Hugging Face Inference…
As embodied AI develops rapidly, deploying powerful robotics foundation models onto specific hardware has become a key challenge. NVIDIA and Hugging Face have…
Hugging Face has announced a new partnership with AI chip giant NVIDIA, launching "Training Cluster as a Service" (TCaaS). The introduction of this service…
As large language models (LLMs) have evolved, AI applications have moved beyond simple "question-and-answer conversations" toward "AI Agents" capable of…
As artificial intelligence moves beyond simple "text-based conversation" into the era of Agents (intelligent agents) that actively execute tasks, enabling AI…
In the inference process of large language models (LLMs) and vision-language models (VLMs), autoregressive decoding is a major performance bottleneck. Each…
Google DeepMind has announced that its latest-generation model, Gemini 2.5, has achieved new breakthroughs in AI-driven audio dialog and audio generation. This…
As generative AI technology becomes more widespread, AI Sound Generation has become an indispensable part of modern multimedia creation, game development, and…
H (formerly Holistic AI), a highly regarded French AI startup, recently officially released a new family of vision-language models (VLMs) on the Hugging Face…
In the reinforcement learning from human feedback (RLHF) training process for large language models — whether PPO or the recently popular GRPO — there are…
Hugging Face has recently taken an important step in the field of embodied AI, officially launching **SmolVLA** — a lightweight Vision-Language-Action (VLA)…
In this Hugging Face blog post, the team takes a deep dive into the evolution of AI agent architectures — specifically how to combine "structured constraints"…
Since the explosive rise of DeepSeek-R1, GRPO (Group Relative Policy Optimization) has become the most widely discussed reinforcement learning (RL) technique…
As enterprises place ever-increasing demands on data privacy, security, and regulatory compliance, deploying AI models on-premises has become the preferred…
Hugging Face recently published a highly practical technical tutorial demonstrating how to build a fully functional miniature AI agent in just around 70 lines…
In this article, University of Pennsylvania Wharton School professor Ethan Mollick explores the common challenges enterprises face when adopting generative AI…
Replicate, the well-known AI model hosting and deployment platform, has announced a major update: it now officially supports OpenAI's latest-generation models…
The Technology Innovation Institute (TII) of the UAE recently officially unveiled a brand-new open-source language model series on the Hugging Face blog —…
The Technology Innovation Institute (TII) of Abu Dhabi has officially released a new language model series called "Falcon-Arabic" on the Hugging Face platform…
Hugging Face recently launched an open-source project called nanoVLM, positioned as "the simplest repository for training Vision Language Models (VLMs) in pure…
As diffusion models (such as Flux.1 and Stable Diffusion 3) continue to grow in parameter count — often reaching tens of billions or even hundreds of billions…
Google DeepMind has officially released a preview of its new open model "Gemma 3n." This is a cutting-edge open model purpose-built for mobile devices and…
Google DeepMind today announced important updates to its flagship model series, Gemini 2.5. The most noteworthy highlight of this update is a brand-new…
At Google I/O 2025, Google DeepMind announced the launch of the new "SynthID Detector" portal. This tool is designed to address the increasingly serious…
Google DeepMind recently published its latest vision for building a "Universal AI Assistant." In this blueprint, the core technical evolution lies in extending…