Hugging Face's Open R1 project aims to fully open-source and replicate the training pipeline of DeepSeek-R1's reasoning model. In the latest fourth update…
Vercel has announced the release of AI SDK 4.2, the latest major update to the open-source AI framework designed for JavaScript and TypeScript developers. As…
Vercel has officially announced that three prominent AI infrastructure service providers — Groq, fal, and DeepInfra — have formally joined the Vercel…
Since its launch, Hugging Face's Open R1 project has been dedicated to replicating the reasoning capabilities of DeepSeek-R1 in a fully open-source manner. In…
As the hardware performance of mobile devices continues to improve, "edge inference" — running large language models (LLMs) directly on smartphones — has…
Hugging Face has officially published the second technical update (Update #2) for the Open R1 project, which aims to replicate DeepSeek-R1's reasoning model…
### Background and the Goals of the Open-R1 Project Since the release of DeepSeek-R1, its powerful reasoning capability and remarkably low training cost have…
### Project Background: Recreating the Open-Source Miracle of DeepSeek-R1 The emergence of DeepSeek-R1 sent shockwaves through the global AI community…
Hugging Face has officially launched the "Inference Providers" feature on the Hugging Face Hub — a major update designed to address the pain points developers…
On January 24, 2025, Hugging Face announced that smolagents — its open-source library designed for building lightweight, high-performance AI agents — now…
This technical blog post from Hugging Face provides a detailed benchmark of running large language models (LLMs) on Google Cloud Platform's (GCP) new C4…
Hugging Face launched a brand-new "Synthetic Data Generator" in December 2024 — a web-based, no-code tool designed to allow anyone to create high-quality AI…
Frontend cloud platform Vercel has announced a new partnership with cloud computing giant AWS (Amazon Web Services), jointly committed to simplifying and…
As large language models (LLMs) are increasingly applied in software development and logical reasoning, there is growing interest in whether models possess the…
### Background and Challenges: The Difficulty of Evaluating Non-English LLMs In the current landscape of large language model (LLM) development, evaluating…
This educational article from Hugging Face aims to guide readers — in the most intuitive, step-by-step way — to "reinvent" RoPE (Rotary Position Embedding)…
The slow autoregressive generation speed of large language models (LLMs) has long been a major bottleneck in real-world deployment. While "speculative…
This article from the Hugging Face blog introduces "The First Multilingual LLM Debate Competition." As large language models (LLMs) have rapidly advanced…
As large language models (LLMs) have rapidly advanced, traditional static benchmarks (such as MMLU) have increasingly faced saturation and gaming problems. As…
Vercel has officially released AI SDK 4.0, its flagship toolkit for building AI-powered applications. This update marks a maturation of the framework, with a…
This case study provides a detailed account of how non-profit organization Digital Green, with support from Hugging Face's Expert Support team, optimized its…
Hugging Face has officially launched HUGS (Hugging Face Microservices), a brand-new microservices solution designed to address the pain points enterprises face…
Hugging Face has officially launched Transformers.js v3, the most significant update to this web-based machine learning library since its release…
Meta's Llama 3.2 release includes lightweight 1B and 3B text models designed specifically for edge computing and mobile devices. These models have now been…
As generative AI applications become more widespread, one of the biggest challenges developers face is the "non-deterministic" output of large language models…
AMD has officially launched its 5th-generation EPYC processor, codenamed "Turin," and Hugging Face has promptly published a blog post detailing the deep…
Meta has officially introduced the Llama 3.2 family of open-source models, marking a significant architectural upgrade with two major breakthroughs: multimodal…
The deployment of large language models (LLMs) has long faced a dual bottleneck of VRAM capacity and memory bandwidth. Microsoft previously introduced the…
Hugging Face has officially introduced the "Community Tools" feature to its open-source chat platform, HuggingChat. This major update injects powerful Agent…
Meta's Llama 3.1 405B is one of the most powerful open-source large language models available today, but its massive parameter count (405 billion) poses…