The Hugging Face team and its collaborators have jointly launched a new benchmark called "BenCzechMark," designed to evaluate the understanding and generation…
In the field of 3D generative AI (encompassing models such as InstantMesh and Tripo3D), generated 3D models typically represent color using "vertex coloring."…
Meta has officially introduced the Llama 3.2 family of open-source models, marking a significant architectural upgrade with two major breakthroughs: multimodal…
With the explosion of video generation and understanding models such as Sora and Gen-3, high-quality video training data has become a key battleground for…
In today's rapidly evolving AI landscape, keeping up with the latest academic papers is a major challenge. Hugging Face's "Daily Papers" page was created…
This article provides a detailed look at how to use Hugging Face's `optimum-intel` library and Intel's OpenVINO GenAI toolkit to optimize and deploy generative…
The deployment of large language models (LLMs) has long faced a dual bottleneck of VRAM capacity and memory bandwidth. Microsoft previously introduced the…
Hugging Face has officially announced the launch of a brand-new "SQL Console" feature on the Datasets pages of the Hugging Face Hub. This feature is designed…
Hugging Face has officially introduced the "Community Tools" feature to its open-source chat platform, HuggingChat. This major update injects powerful Agent…
Hugging Face has officially released version 1.0.0 of its core open-source library, Accelerate. This is a milestone update, signifying that since the project's…
Hugging Face has officially announced a partnership with the well-known cybersecurity company Truffle Security, integrating the open-source credential scanning…
### Background and Challenges In robotics (such as Embodied AI), imitation learning and reinforcement learning require collecting large volumes of robot…
Hugging Face, as the world's largest open-source AI community, has developed many powerful tools beyond its well-known Model Hub that often go unnoticed by…
When fine-tuning or pre-training large language models (LLMs), the sequence lengths of input data are typically uneven. The traditional approach is to use…
Meta's Llama 3.1 405B is one of the most powerful open-source large language models available today, but its massive parameter count (405 billion) poses…
This edition of Replicate Intelligence #11 compiles major recent technical breakthroughs and application trends in the generative AI space, focusing primarily…
This Hugging Face blog post provides a detailed account of the team's attempt to reproduce and evaluate Google's proposed "Infini-Attention" mechanism — and…
GGML is a lightweight, zero-dependency C/C++ tensor library developed by Georgi Gerganov. It was originally designed to enable efficient local inference of the…
The Technology Innovation Institute (TII) of Abu Dhabi has officially released Falcon Mamba 7B, a significant milestone in the evolution of AI architectures…
### Background and Pain Points In AI agent development, "tool use" (also known as function calling) is the core capability that allows large language models…
AI community platform Hugging Face announced on August 8, 2024, the acquisition of XetHub, a startup specializing in solving large-scale data versioning…
As AI models have proliferated, AI supply chain security has become a top priority that developers and enterprises can no longer afford to overlook. Hugging…
### Solving Real-World Document AI Pain Points In the fields of Document AI and OCR (Optical Character Recognition), datasets used in academic research or…
This edition of Replicate Intelligence highlights three of the most noteworthy open-source and developer tool developments from early August 2024: 1. **The new…
Black Forest Labs — a new AI team founded by the original creators of Stable Diffusion (including core developer Robin Rombach and others) — has officially…
Google released a major update to the Gemma 2 family in late July 2024, comprising three core components: 1. **Gemma 2 2B**: A lightweight model with just 2.6B…
### Background and Challenges As generative AI technology evolves, image and video generation models are increasingly transitioning from traditional UNet…
Hugging Face and NVIDIA announced a major partnership in late July 2024, officially launching a serverless inference service powered by NVIDIA NIM (NVIDIA…
Replicate has published its eighth issue of technical intelligence (Replicate Intelligence #8), bringing three major updates for developers: 1. **Top…
### Background and Challenges Document Visual Question Answering (DocVQA) is an important application of multimodal AI, requiring models to simultaneously…