Google DeepMind 推出 FACTS 基準測試套件:系統化評估大型語言模型的真實性★ 80
Google DeepMind Blog·187 days ago·Release
As large language models (LLMs) are deployed across a wide range of industries, ensuring the "factuality" of model outputs and reducing "hallucination" has…