Hugging Face BlogJan 26, 2024, 12:00 AMimportant 75

Hugging Face 推出 AI Secure LLM 安全排行榜:基於 DecodingTrust 框架深度評估大模型信任度

Original: An Introduction to AI Secure LLM Safety Leaderboard

### Introduction: Capability Is Not Safety — A New Benchmark for LLM Safety Evaluation As large language models (LLMs) are adopted more…

Hugging Face 與學術團隊合作推出了全新的「AI Secure LLM 安全排行榜」(基於 DecodingTrust 評估框架)。該排行榜旨在填補現有 LLM 評測偏重「能力」而忽略「安全」的空白,從毒性、刻板印象偏見、對抗強健性、隱私保護及機器倫理等 8 大安全維度,對主流開源與閉源模型進行系統性評測,為開發者提供更全面的模型安全選擇依據。

### Introduction: Capability Is Not Safety — A New Benchmark for LLM Safety Evaluation

Full summary

Free shows the 3-line summary; Pro unlocks the full deep summary (~300 words) so you never have to click through.

See Pro plans →

Want the original English / full article?

Read on Hugging Face Blog →

Summaries are AI-generated; the original article is authoritative.