Latest in AI

Showing:cybersecurityResearchersClear ×

🔥 Trending today

anthropic6 export-controls4 model-access3 spacex3 amazon3 national-security2 open-source2 governance2 ai-regulation2 government-policy2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

U.S. Government Orders Anthropic to Disable Claude Fable 5 and Mythos 5★ 78
TechCrunch AIyesterdayRegulation
TechCrunch reports that the U.S. government ordered Anthropic to immediately disable Claude Fable 5 and Claude Mythos 5 worldwide, citing national security concerns. Anthropic says the order appears tied to a claimed narrow jailbreak of Fable 5, but argues the cited capability is already common in other public models. The move highlights a potential backlash against Anthropic’s safety-first messaging around especially powerful AI systems.
US Directive Targets Access to Fable 5 and Mythos 5★ 76
Simon Willison's WeblogyesterdayRegulation
Simon Willison comments on Anthropic’s statement that a US government export-control directive requires suspending access to Fable 5 and Mythos 5 for all foreign nationals, including Anthropic employees. Anthropic says the directive cites national security concerns but offers only verbal evidence of a narrow Fable 5 jailbreak. Willison notes that, as of 9:01pm ET, he still had access to Fable through claude.ai and Claude Code.
Security Researchers Criticize Anthropic Fable Safeguards as Too Strict
Hacker News (AI keywords)4 days agoEthics
Anthropic released Fable as a public but limited version of its cybersecurity-focused Mythos model. Security researchers say its guardrails trigger on broad cyber-related wording, blocking tasks like blog analysis, secure coding, and code review. The restrictions aim to reduce malware, software compromise, and biology-related misuse, but the current implementation may frustrate legitimate security work.
Cybersecurity Researchers Criticize Anthropic's Fable for Overly Strict Guardrails
TechCrunch AI4 days agoIncident
Anthropic's latest model Fable is drawing complaints from the cybersecurity research community over guardrails deemed excessively restrictive. Researchers say the model's content filters block even legitimate security tasks, hampering professional workflows. The incident highlights a persistent tension between AI safety measures and the practical needs of security professionals who must engage with offensive techniques defensively.
Microsoft's open source tools were hacked to steal passwords of AI developers★ 78
Hacker News (AI keywords)5 days agoIncident
Microsoft temporarily removed several open source GitHub projects while investigating suspected malicious content. The affected repos were linked to Azure and developer workflows involving AI coding tools such as Claude Code, Gemini CLI, and VS Code. Security researchers said the malware could steal passwords and sensitive credentials when compromised tools were opened, though Microsoft has not disclosed how many users were affected.
What We Learned Mapping a Year's Worth of AI-Enabled Cyber Threats★ 74
Anthropic News6 days agoEthics
Anthropic analyzed 832 accounts banned for malicious cyber activity from March 2025 to March 2026 and mapped them to MITRE ATT&CK. The report says attackers increasingly use AI beyond preparation, applying it to post-compromise tasks such as account discovery, lateral movement, and privilege escalation. Anthropic argues that frameworks need to capture agentic orchestration, chained attack stages, real-time decisions, and low-human-intervention operations.
U.S. Military Turned GPS into a Global "Numbers Station"
Hacker News (AI keywords)9 days agoCommentary
According to investigative outlet 404 Media, evidence suggests the U.S. military has repurposed the Global Positioning System (GPS) into a modern "numbers station." By embedding encrypted data within standard GPS broadcasts, the military can securely transmit covert messages to agents or assets worldwide. This technique leverages existing satellite infrastructure to achieve global coverage with near-perfect receiver anonymity.
Expanding Project Glasswing★ 76
Hacker News (AI keywords)12 days agoBusiness
Anthropic is expanding Project Glasswing, its program for using Claude Mythos Preview to find vulnerabilities in critical software. The new cohort includes around 150 organizations across more than 15 countries, including infrastructure providers, vendors, nonprofits, and open-source maintainers. Anthropic frames the expansion as preparation for a world where powerful cyber-capable AI models become cheaper and more widely available, shifting focus from finding bugs to validating, disclosing, patching, and deploying fixes.
From Map Data to National Security: A Deep Dive into Amap Risks in Taiwan★ 74
INSIDE 硬塞 AI16 days agoRegulation
INSIDE examines how China’s Amap has become controversial in Taiwan beyond ordinary mapping or navigation use. The article says its service relies on user data and AI-based inference rather than full official data integrations. That model could send movement traces and behavioral signals back to China, creating risks for hybrid warfare intelligence, influence operations, and Taiwan’s broader governance of map data and digital infrastructure.
Import AI 457：AI 版 Stuxnet 震網病毒、神祕的 Muon 優化器，以及積極對齊（Positive Alignment）★ 78
Import AI (Jack Clark)27 days agoCommentary
This issue of Import AI 457, written by Jack Clark, delves into three forward-looking and stylistically distinct topics in the field of artificial…
漏洞賞金計劃遭大量「AI 垃圾報告」轟炸，企業安全團隊不堪重負★ 70
Ars Technica AI27 days agoIncident
According to a report by Ars Technica, corporate bug bounty programs are currently being bombarded with an "endless" stream of AI-generated junk reports (AI…
AI 與網路安全的未來：為什麼「開放」至關重要★ 75
Hugging Face Blog54 days agoOpinion
As artificial intelligence (AI) technology undergoes explosive growth, cybersecurity has become a focal point of concern for governments and enterprises…
Import AI 452：網路戰的縮放定律、AI 自動化浪潮與 GDP 預測之謎★ 75
Import AI (Jack Clark)69 days agoCommentary
This issue of Import AI 452, written by Jack Clark, takes a deep dive into the far-reaching impact of artificial intelligence on three major areas: national…
Import AI 450：中國電子戰 AI 模型、受創傷的 LLM 與網路攻擊的規模法則★ 75
Import AI (Jack Clark)83 days agoCommentary
In this issue of Import AI 450, author Jack Clark explores three key topics with profound implications for the future of technology, security, and geopolitics…
Import AI 442：AI 經濟中的贏家與輸家、數學證明自動化，以及網路間諜活動的工業化★ 75
Import AI (Jack Clark)139 days agoCommentary
In this issue of Import AI 442, Jack Clark raises a core fundamental question: "Will the arrival of superintelligence be an instantaneous 'phase change,' or a…
Import AI 438：無聲的警報，為我們所有人閃爍（網路安全能力過剩與對話隱私）★ 75
Import AI (Jack Clark)174 days agoCommentary
In this issue of Import AI 438, Jack Clark examines two key issues concerning AI security and privacy: **1. You Are Your LLM History** As large language models…
Meta 推出 CyberSecEval 2：評估大語言模型網路安全風險與防護能力的全面性框架★ 75
Hugging Face Blog751 days agoRelease
As large language models (LLMs) become increasingly prevalent in software development and automated workflows, their "dual-use" risks in the cybersecurity…

Latest in AI

U.S. Government Orders Anthropic to Disable Claude Fable 5 and Mythos 5★ 78

US Directive Targets Access to Fable 5 and Mythos 5★ 76

Security Researchers Criticize Anthropic Fable Safeguards as Too Strict

Cybersecurity Researchers Criticize Anthropic's Fable for Overly Strict Guardrails

Microsoft's open source tools were hacked to steal passwords of AI developers★ 78

What We Learned Mapping a Year's Worth of AI-Enabled Cyber Threats★ 74

U.S. Military Turned GPS into a Global "Numbers Station"

Expanding Project Glasswing★ 76

From Map Data to National Security: A Deep Dive into Amap Risks in Taiwan★ 74

Import AI 457：AI 版 Stuxnet 震網病毒、神祕的 Muon 優化器，以及積極對齊（Positive Alignment）★ 78

漏洞賞金計劃遭大量「AI 垃圾報告」轟炸，企業安全團隊不堪重負★ 70

AI 與網路安全的未來：為什麼「開放」至關重要★ 75

Import AI 452：網路戰的縮放定律、AI 自動化浪潮與 GDP 預測之謎★ 75

Import AI 450：中國電子戰 AI 模型、受創傷的 LLM 與網路攻擊的規模法則★ 75

Import AI 442：AI 經濟中的贏家與輸家、數學證明自動化，以及網路間諜活動的工業化★ 75

Import AI 438：無聲的警報，為我們所有人閃爍（網路安全能力過剩與對話隱私）★ 75

Meta 推出 CyberSecEval 2：評估大語言模型網路安全風險與防護能力的全面性框架★ 75