使用 AWS Inferentia2 加速 Hugging Face Transformers 模型推理★ 70
Hugging Face Blog·1,154 days ago·Release
This article explains how to accelerate the deployment and inference of Hugging Face Transformers models using AWS Inferentia2 (Inf2 instances) — AWS's…