With the proliferation of GPT-4o, Gemini Live, and various end-to-end voice models, Voice Agents have become an important frontier in AI applications. However…
When building Retrieval-Augmented Generation (RAG) systems, general-purpose embedding models (such as those from OpenAI or common open-source alternatives)…
Hugging Face has published its Spring 2026 "State of Open Source AI" report, offering a comprehensive review of the explosive growth and paradigm shifts that…
Hcompany has officially released a new model on Hugging Face called **Holotron-12B**, positioned as a "High Throughput Computer Use Agent." Although only the…
This article, from Nathan Lambert's well-known AI newsletter Interconnects, offers a deep examination of the critical turning point that open-source language…
This issue of Import AI (No. 449) dives deep into several core frontier topics in the current AI landscape, spanning technical breakthroughs and broad…
With the success of reasoning models such as DeepSeek-R1, reinforcement learning (RL/RLHF) has become a critical technique for improving the alignment and…
The official Hugging Face blog has announced the launch of "Storage Buckets" on the Hugging Face Hub. This is a cloud object storage service designed…
Hugging Face has officially released version 0.5.0 of its open-source robot learning library, LeRobot, under the theme "Scaling Every Dimension." Since its…
As large language models (LLMs) push the demand for long context toward the million-token scale, the VRAM of a single GPU can no longer accommodate the…
In this column published in Interconnects, author Nathan Lambert cites the latest observations from policy expert Dean Ball on the high-profile "Anthropic v…
As large language models (LLMs) continue to evolve, the traditional pure-Transformer architecture faces physical bottlenecks in computational efficiency and…
Hugging Face has entered into a deep collaboration with semiconductor giant NXP (NXP Semiconductors), aimed at solving the challenge of deploying advanced…
The Hugging Face official blog has announced the launch of "Modular Diffusers" — a major architectural overhaul of its widely popular `diffusers` library. In…
This is Issue #19 of the "Latest Open Artifacts" column by well-known AI industry analyst Nathan Lambert, opening with "Welcome to the year of the horse!" It…
Mixture of Experts (MoE) has become the mainstream architecture for current large language models (LLMs). This article takes an in-depth look at how MoE…
Anthropic recently published research on "distillation attacks," defining the practice of external developers using its API outputs to train other models as a…
Hugging Face's official blog has announced exciting news for the open-source AI community: Hugging Face has formed a deep partnership with Unsloth — the…
A historic milestone has arrived in the open-source AI world: GGML and llama.cpp — the open-source projects founded by Georgi Gerganov that laid the…
### The Pain Points of Enterprise AI Agents in Production: Why Do They Keep Failing? As large language models (LLMs) have rapidly advanced, enterprises have…
This article by Nathan Lambert takes a deep dive into the tangled competitive dynamics between open-source and closed-source AI models. Lambert argues that…
As the demand for computational efficiency in deep learning models continues to grow, writing custom CUDA kernels (GPU core programs) has become a key…
As AI Agent (intelligent agent) technology advances rapidly, evaluating how these agents perform in the real world has become one of the greatest challenges…
Hugging Face officially published Transformers.js v4 on NPM, marking a major milestone for running local AI models within the JavaScript ecosystem…
In today's era of rapid AI advancement, major model vendors and research institutions are releasing all manner of "leaderboards" to claim their models surpass…
This blog post from Hugging Face, written in February 2026 — marking the one-year anniversary of the "DeepSeek Moment" (when DeepSeek-R1 and V3 shook the…
Photoroom, the well-known image editing platform, recently published a series of technical blog posts about their in-house text-to-image model, PRX. In Part 2…
Hugging Face has officially launched a new tool called "Daggr," billed as a way to "chain apps programmatically, inspect visually." As large language model…
### Background and Challenge: Why Is CUDA Programming So Hard for AI? CUDA (Compute Unified Device Architecture) is a parallel computing platform and…
This blog post from Hugging Face reviews the full year of technical evolution since the "DeepSeek Moment" at the start of 2025 — the release of DeepSeek-V3 and…