Latest in AI

Showing:quantizationDevelopersClear ×

🔥 Trending today

anthropic4 open-source3 amazon3 ai-regulation2 government-policy2 export-controls2 geopolitics2 privacy2 python-packaging2 webassembly2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

使用 Intel Sapphire Rapids 加速 PyTorch Transformer 模型推論（第二部分）
Hugging Face Blog1,224 days agoTutorial
This article is the second installment of a Hugging Face series on accelerating PyTorch Transformer models on Intel's 4th-generation Xeon Scalable Processors…
加速 Document AI：Hugging Face 提升多模態文件理解模型的推論效率★ 70
Hugging Face Blog1,301 days agoTutorial
"Document AI" is a key driver of enterprise digital transformation in recent years, aimed at automating the processing of unstructured documents such as…
使用 🤗 Optimum Intel 與 OpenVINO 加速你的 Hugging Face 模型
Hugging Face Blog1,320 days agoNew Tool
As Transformer models become increasingly prevalent in natural language processing (NLP) and computer vision (CV), efficiently deploying these large models in…
優化故事：BLOOM 超大模型推理優化實踐
Hugging Face Blog1,341 days agoTutorial
This technical blog post from Hugging Face documents in detail the practical process of optimizing inference for BLOOM, the open-source multilingual large…
輕鬆上手 8-bit 矩陣乘法：使用 Transformers、Accelerate 與 bitsandbytes 實現超大規模 Transformer 模型量化★ 80
Hugging Face Blog1,397 days agoRelease
This article introduces the deep integration between Hugging Face and the bitsandbytes library, aimed at solving the enormous memory challenges posed by…
使用 Optimum 與 Transformers Pipelines 加速模型推論★ 75
Hugging Face Blog1,496 days agoRelease
When deploying Transformer models in production, reducing inference latency and increasing throughput while keeping computational costs under control has…
案例研究：使用 Hugging Face Infinity 與現代 CPU 實現毫秒級延遲
Hugging Face Blog1,613 days agoNew Tool
This case study focuses on the performance of "Hugging Face Infinity" — Hugging Face's high-performance inference container solution — on modern CPUs…
在現代 CPU 上擴展 BERT 類模型的推理效能 - 第二部分
Hugging Face Blog1,683 days agoTutorial
This blog post is the second part of a technical guide co-authored by Hugging Face and Intel, designed to show developers how to push the inference performance…
Hugging Face 推出 Optimum：專為大規模 Transformer 模型打造的硬體加速與優化工具包★ 75
Hugging Face Blog1,734 days agoRelease
Hugging Face has officially launched a new open-source toolkit called "Optimum" — an optimization and hardware acceleration library designed specifically for…
Hugging Face 如何為 API 客戶將 Transformer 推理速度提升 100 倍
Hugging Face Blog1,973 days agoRelease
In this technical blog post, the Hugging Face team reveals in detail how they achieved up to 100x speedup in inference for Transformer models for customers of…

← PreviousPage 3