As multimodal large language models (such as GPT-4o, Gemini, and various open-source audio models) continue to proliferate, AI's ability to process audio has…
Despite the recent dominance of generative decoder models (such as GPT and Llama), encoder-only models (such as BERT) remain indispensable behind the scenes…
### Background and Architectural Innovation As large language models (LLMs) have advanced rapidly, the traditional Transformer architecture faces severe…
This technical blog post from Hugging Face provides a detailed benchmark of running large language models (LLMs) on Google Cloud Platform's (GCP) new C4…
The Technology Innovation Institute (TII) of Abu Dhabi has officially launched the new Falcon 3 open-source model family on Hugging Face. This marks a major…
In the history of AI development, the open-sourcing of Stable Diffusion in 2022 is regarded as a pivotal turning point in the field of image generation — it…
Hugging Face launched a brand-new "Synthetic Data Generator" in December 2024 — a web-based, no-code tool designed to allow anyone to create high-quality AI…
Hugging Face announced a major partnership with Amazon Web Services (AWS), formally integrating Hugging Face's model library into the Amazon Bedrock service…
### Introduction: An Important Piece of the Open-Source Image Generation Puzzle As text-to-image (T2I) technology advances rapidly, ensuring that AI-generated…
As large language models (LLMs) are increasingly applied in software development and logical reasoning, there is growing interest in whether models possess the…
Google and Hugging Face have jointly announced the release of a new generation of open-weight vision-language model (VLM) — PaliGemma 2. This model continues…
### Background and Challenges: The Difficulty of Evaluating Non-English LLMs In the current landscape of large language model (LLM) development, evaluating…
This case study from Hugging Face details how quantitative asset management firm Capital Fund Management (CFM) has optimized its investment and research…
With the EU AI Act officially taking effect, AI developers worldwide are facing compliance pressure. To help the open-source (OSS) community understand the…
The AI cloud hosting platform Replicate has announced a major fine-tuning speed optimization for FLUX.1, currently the most popular open-source image…
Hugging Face, the world's largest open-source AI platform, currently hosts over 1.2 million models, datasets, and Space applications. With the explosion of…
Hugging Face has officially launched a lightweight vision language model (VLM) called **SmolVLM**, designed to bring powerful multimodal understanding…
This educational article from Hugging Face aims to guide readers — in the most intuitive, step-by-step way — to "reinvent" RoPE (Rotary Position Embedding)…
The Replicate platform has officially launched "FLUX.1 Tools," a new suite of control tools for the open-source image generation model FLUX.1, designed to…
This article from the Hugging Face blog introduces "The First Multilingual LLM Debate Competition." As large language models (LLMs) have rapidly advanced…
Hugging Face has officially launched the "Open Japanese LLM Leaderboard," a community-driven platform dedicated to evaluating the performance of…
The Hugging Face Hub currently hosts millions of AI models, datasets, and applications (Spaces), with total storage reaching the hundreds of petabytes. As the…
The slow autoregressive generation speed of large language models (LLMs) has long been a major bottleneck in real-world deployment. While "speculative…
As large language models (LLMs) have rapidly advanced, traditional static benchmarks (such as MMLU) have increasingly faced saturation and gaming problems. As…
Hugging Face's official blog has published an article warmly inviting and encouraging machine learning (ML) researchers and developers worldwide to share their…
Hugging Face and PyCharm — the renowned Python development tool from JetBrains — have announced a deep integration. This collaboration aims to streamline the…
The open-source data curation and annotation platform Argilla has officially released version 2.4, with the core of this update being deep integration with…
In the deployment and inference of large language models (LLMs), reducing generation latency has always been a critical challenge. The traditional approach of…
Cohere For AI (C4AI) has officially launched Aya Expanse, a family of open-weight models designed specifically for multilingual tasks. The family includes two…
CinePile is a multimodal question-answering dataset focused on movie and long-video understanding. In traditional dataset construction, researchers commonly…