Latest in AI

Showing:llm-securityResearchersClear ×

🔥 Trending today

anthropic4 open-source3 amazon3 ai-regulation2 government-policy2 export-controls2 geopolitics2 privacy2 python-packaging2 webassembly2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

I built a vulnerable app and spent $1,500 seeing if LLMs could hack it
Hacker News (AI keywords)10 days agoBenchmark
The author built a vulnerable React Native app with a Python backend and a Firebase access-control flaw. GPT 5.5 solved 7 of 10 runs, while Deepseek and Claude variants solved fewer attempts. Many other models failed due to refusals, API-focused tunnel vision, false positives, or inability to use the exposed Firebase path correctly.
Gemini randomly dumped its system prompt
Hacker News (AI keywords)24 days agoIncident
The title suggests Gemini may have unexpectedly output its system prompt during use. Since no source text is provided, the trigger, interface, reproducibility, leaked content, and any Google response cannot be verified. Treat it as a cautious prompt-leakage incident signal relevant to LLM safety, product security, and developers building on hidden system instructions.
Llama Guard 4 正式登陸 Hugging Face Hub：全新一代開源 AI 安全防護模型★ 75
Hugging Face Blog412 days agoRelease
Meta's safety guardrail model family has welcomed its newest member — Llama Guard 4 — which is now officially available on the Hugging Face Hub. As a…
大型語言模型的紅隊演練（Red-Teaming LLMs）★ 75
Hugging Face Blog1,207 days agoTutorial
With the explosive growth of large language models (LLMs) such as ChatGPT, AI safety and ethics have become the most pressing concerns in the industry. This…