The AI cloud hosting platform Replicate has announced a major fine-tuning speed optimization for FLUX.1, currently the most popular open-source image…
Hugging Face, the world's largest open-source AI platform, currently hosts over 1.2 million models, datasets, and Space applications. With the explosion of…
This educational article from Hugging Face aims to guide readers — in the most intuitive, step-by-step way — to "reinvent" RoPE (Rotary Position Embedding)…
This article from the Hugging Face blog introduces "The First Multilingual LLM Debate Competition." As large language models (LLMs) have rapidly advanced…
Hugging Face has officially launched the "Open Japanese LLM Leaderboard," a community-driven platform dedicated to evaluating the performance of…
The Hugging Face Hub currently hosts millions of AI models, datasets, and applications (Spaces), with total storage reaching the hundreds of petabytes. As the…
The slow autoregressive generation speed of large language models (LLMs) has long been a major bottleneck in real-world deployment. While "speculative…
As large language models (LLMs) have rapidly advanced, traditional static benchmarks (such as MMLU) have increasingly faced saturation and gaming problems. As…
Hugging Face's official blog has published an article warmly inviting and encouraging machine learning (ML) researchers and developers worldwide to share their…
Hugging Face and PyCharm — the renowned Python development tool from JetBrains — have announced a deep integration. This collaboration aims to streamline the…
The open-source data curation and annotation platform Argilla has officially released version 2.4, with the core of this update being deep integration with…
In the deployment and inference of large language models (LLMs), reducing generation latency has always been a critical challenge. The traditional approach of…
Cohere For AI (C4AI) has officially launched Aya Expanse, a family of open-weight models designed specifically for multilingual tasks. The family includes two…
On October 23, 2024, Google and Hugging Face jointly announced the open-sourcing of Google's "SynthID Text" technology and its integration into Hugging Face's…
CinePile is a multimodal question-answering dataset focused on movie and long-video understanding. In traditional dataset construction, researchers commonly…
Hugging Face has officially launched HUGS (Hugging Face Microservices), a brand-new microservices solution designed to address the pain points enterprises face…
As real-time voice interaction technologies like GPT-4o become more widespread, the open-source community is also actively developing speech-to-speech (S2S)…
In LLM application development, ensuring that a model outputs content that 100% conforms to a specific format — such as a JSON Schema, a regular expression, or…
Stability AI officially launched the Stable Diffusion 3.5 (SD3.5) model series in late October 2024, and Hugging Face's Diffusers team simultaneously announced…
As open-source AI models have flourished, Hugging Face has become the central platform through which developers worldwide access and share machine learning…
### The Mathematical Flaw in Traditional Gradient Accumulation Gradient accumulation is an extremely common technique in deep learning. When VRAM is limited…
AMD has officially launched its 5th-generation EPYC processor, codenamed "Turin," and Hugging Face has promptly published a blog post detailing the deep…
Hugging Face has published a technical blog post on "Dynamic Speculation," aimed at optimizing the inference speed of large language models (LLMs)…
Hugging Face has officially launched the "Open FinLLM Leaderboard" — a new platform dedicated to evaluating and tracking the performance of large language…
This article from the Hugging Face blog takes an in-depth look at how China's artificial intelligence forces have successfully gone global in recent years…
The Hugging Face team and its collaborators have jointly launched a new benchmark called "BenCzechMark," designed to evaluate the understanding and generation…
With the explosion of video generation and understanding models such as Sora and Gen-3, high-quality video training data has become a key battleground for…
This article provides a detailed look at how to use Hugging Face's `optimum-intel` library and Intel's OpenVINO GenAI toolkit to optimize and deploy generative…
The deployment of large language models (LLMs) has long faced a dual bottleneck of VRAM capacity and memory bandwidth. Microsoft previously introduced the…
Hugging Face has officially introduced the "Community Tools" feature to its open-source chat platform, HuggingChat. This major update injects powerful Agent…