As large multimodal models (LMMs) have achieved breakthroughs in image and short-video understanding, the industry has gradually shifted its attention to the…
This technical guide from Hugging Face takes an in-depth look at how to accelerate LoRA (Low-Rank Adaptation) inference for Flux.1, the powerful open-source…
Hugging Face and NVIDIA have announced a new collaboration to bring NVIDIA NIM (NVIDIA Inference Microservices) into the Hugging Face ecosystem, with the goal…
Vercel announced in its product changelog that its AI Gateway service now officially supports "OpenAI-compatible API endpoints." This is a practical feature…
In the field of AI image generation, maintaining visual consistency for the same character across different scenes, actions, and expressions — known as…
As large language models (LLMs) have developed, the industry has gradually come to recognize the limitations of the "single model does everything" approach…
Hugging Face has officially launched the Ettin Suite, a brand-new state-of-the-art (SoTA) open-source model family of "Paired Encoders and Decoders." In…
Vercel has officially launched "AI Cloud," a unified platform designed specifically for AI workloads. This marks a major transformation for Vercel from a…
Hugging Face's AI-MO (AI Math Olympiad) team has officially published Kimina-Prover, a research paper demonstrating how "test-time reinforcement learning…
Hugging Face has officially announced the launch of its dedicated MCP (Model Context Protocol) server — a major step in ecosystem integration. The Model…
With the rise of Anthropic's Claude 3.5 Sonnet "Computer Use" and various GUI-oriented multimodal models, "desktop agents" have become one of the hottest areas…
In the fields of robot learning and embodied AI, enabling controllers based on deep learning or large language/vision models (VLAs) to run in real time has…
With Anthropic's introduction of the Model Context Protocol (MCP) open standard, the way AI agents connect to external tools and data sources has become…
As AMD Instinct MI300 series GPUs (such as the MI300X) gradually increase their market share in the AI compute market, how to perform low-level optimization…
Hugging Face recently announced a collaboration with Pollen Robotics to launch a new open-source robotics platform called "Reachy Mini," designed to provide…
Hugging Face has announced the release of a brand-new generation of lightweight open-source models — SmolLM3. As the latest member of the SmolLM family…
With the rapid development of vision-language models (VLMs) and multimodal AI, the amount of data required to train these models has grown explosively…
AI video generation technology has made breakthrough advances over the past year — from closed-source systems like Sora and Runway to a flourishing open-source…
Hugging Face and the UAE's Technology Innovation Institute (TII, the organization behind the well-known open-source model Falcon) have jointly announced a new…
Cloud AI deployment platform Replicate recently announced that the "FLUX.1 Kontext Hackathon," co-hosted with renowned open-source image generation model…
This technical blog post from Hugging Face provides a detailed guide on how to train and fine-tune "Sparse Embedding Models" using the Sentence Transformers…
Google's open-source model family welcomes a new member! The all-new Gemma 3n model series is now fully available within the Hugging Face ecosystem. Gemma 3n…
SGLang (Structured Generation Language) is a high-performance LLM inference and serving framework developed by the LMSYS team, renowned for its efficient…
FLUX.1-dev is a state-of-the-art open-source text-to-image model with 12 billion parameters (12B), developed by Black Forest Labs. However, due to its enormous…
Hugging Face announced a deep partnership with Groq, a chip company focused on ultra-fast AI inference, formally bringing Groq into the Hugging Face "Inference…
As the "Model Context Protocol" (MCP) proposed by Anthropic gradually becomes an open standard for connecting AI tools to external data sources, how to…
As the context windows of large language models (LLMs) continue to expand — from the early 4k and 8k, to the now-common 32k and even 128k or more — users have…
Hugging Face officially announced a partnership with Featherless AI, a serverless GPU inference platform, integrating it into the Hugging Face Inference…
The Hugging Face official blog published a "Get Started with Hugging Face Kernel Hub in 5 Minutes" tutorial, formally introducing this new platform to the…
Hugging Face has announced a new partnership with AI chip giant NVIDIA, launching "Training Cluster as a Service" (TCaaS). The introduction of this service…