Replicate has published its technical newsletter, Replicate Intelligence #4, summarizing recent major developments in the AI field as well as the latest…
In the era of large language models (LLMs), the VRAM of a single GPU is often insufficient to hold models with tens of billions of parameters. To overcome this…
In recent years, methods such as Direct Preference Optimization (DPO) have become mainstream for large language model (LLM) alignment, as they eliminate the…
Hugging Face's official blog announced that its diffusers library now officially supports Stable Diffusion 3 (SD3), the latest release from Stability AI. SD3…
Stability AI officially launched its latest text-to-image model — Stable Diffusion 3 (SD3). This release brings major upgrades across several key areas…
Hugging Face has announced the launch of a new Hugging Face Embedding container (Deep Learning Container, DLC) designed specifically for Amazon SageMaker. This…
Hugging Face's official blog published an article titled "Making sense of this mess," announcing a comprehensive redesign of the official documentation for its…
Hugging Face has announced the launch of "NPC-Playground," a 3D interactive sandbox environment designed to showcase and test non-player characters (NPCs)…
Replicate's technical newsletter, Replicate Intelligence #2, takes a deep dive into three of the most hotly discussed trends in the open-source AI community…
This official Hugging Face blog post takes an in-depth look at how to benchmark Text Generation Inference (TGI), Hugging Face's open-source LLM inference and…
The official Hugging Face blog introduces a major update to the Sentence Transformers library (v3.0), centered on the launch of the new…
The Technology Innovation Institute (TII) of Abu Dhabi has officially released a new open-source model family on Hugging Face — Falcon 2 11B. This model, with…
AI model hosting platform Replicate published a security advisory on May 23, 2024, disclosing a "Shared Network Vulnerability" affecting its multi-tenant…
Hugging Face has announced official support for AWS Inferentia2 (Inf2) instances within its hosted Inference Endpoints service. This update gives developers…
Hugging Face and Dell Technologies have announced the launch of the "Dell Enterprise Hub," a new solution designed for enterprise on-premise AI deployment. As…
During Microsoft Build 2024, Hugging Face announced a further strategic collaboration with Microsoft, aimed at providing developers with a more seamless…
With the explosive growth of generative AI, demand for high-performance GPUs has reached an unprecedented level. To break hardware monopolies and reduce AI…
During the inference process of large language models (LLMs), the self-attention mechanism needs to store the Key and Value vectors of historical tokens (i.e…
Hugging Face has announced the launch of the "Open Arabic LLM Leaderboard," an important initiative aimed at advancing Arabic natural language processing (NLP)…
Google has officially launched PaliGemma, a powerful yet lightweight open-source Vision-Language Model (VLM). The release of PaliGemma represents a significant…
Hugging Face and LangChain have jointly announced the launch of a new official partner package, `langchain-huggingface`. This collaboration aims to provide…
Hugging Face has officially launched Transformers Agents 2.0, a major refactoring and upgrade of its existing Agent framework, designed to provide developers…
Hugging Face has announced that its enterprise-focused collaboration platform, "Enterprise Hub," is now officially available on AWS Marketplace. This…
As enterprise demand for Retrieval-Augmented Generation (RAG) technology surges, how to maintain high performance while controlling hardware costs has become…
Hugging Face has officially launched the "Open Leaderboard for Hebrew LLMs," an open-source evaluation platform specifically designed for Hebrew large language…
Hugging Face has announced a partnership with the independent AI performance analytics firm Artificial Analysis, officially integrating its "LLM Performance…
This technical blog post from Hugging Face introduces how to build a powerful and efficient speech processing system using Hugging Face Inference Endpoints — a…
When developing applications based on large language models (LLMs) — such as AI agents, RAG systems, or automated workflows — one of the biggest challenges…
### Background and Challenges In the field of code generation, instruction tuning is the key to improving a model's practical utility and alignment with human…
Hugging Face has announced the launch of the new "Open Chain of Thought (CoT) Leaderboard," a public platform specifically designed to evaluate and compare the…