Latest in AI

Showing:cybersecurityDevelopersClear ×

🔥 Trending today

anthropic6 export-controls4 model-access3 spacex3 amazon3 national-security2 open-source2 governance2 ai-regulation2 government-policy2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

U.S. Government Orders Anthropic to Disable Claude Fable 5 and Mythos 5★ 78
TechCrunch AIyesterdayRegulation
TechCrunch reports that the U.S. government ordered Anthropic to immediately disable Claude Fable 5 and Claude Mythos 5 worldwide, citing national security concerns. Anthropic says the order appears tied to a claimed narrow jailbreak of Fable 5, but argues the cited capability is already common in other public models. The move highlights a potential backlash against Anthropic’s safety-first messaging around especially powerful AI systems.
US Directive Targets Access to Fable 5 and Mythos 5★ 76
Simon Willison's WeblogyesterdayRegulation
Simon Willison comments on Anthropic’s statement that a US government export-control directive requires suspending access to Fable 5 and Mythos 5 for all foreign nationals, including Anthropic employees. Anthropic says the directive cites national security concerns but offers only verbal evidence of a narrow Fable 5 jailbreak. Willison notes that, as of 9:01pm ET, he still had access to Fable through claude.ai and Claude Code.
Security Researchers Criticize Anthropic Fable Safeguards as Too Strict
Hacker News (AI keywords)4 days agoEthics
Anthropic released Fable as a public but limited version of its cybersecurity-focused Mythos model. Security researchers say its guardrails trigger on broad cyber-related wording, blocking tasks like blog analysis, secure coding, and code review. The restrictions aim to reduce malware, software compromise, and biology-related misuse, but the current implementation may frustrate legitimate security work.
Cybersecurity Researchers Criticize Anthropic's Fable for Overly Strict Guardrails
TechCrunch AI4 days agoIncident
Anthropic's latest model Fable is drawing complaints from the cybersecurity research community over guardrails deemed excessively restrictive. Researchers say the model's content filters block even legitimate security tasks, hampering professional workflows. The incident highlights a persistent tension between AI safety measures and the practical needs of security professionals who must engage with offensive techniques defensively.
Microsoft's open source tools were hacked to steal passwords of AI developers★ 78
Hacker News (AI keywords)5 days agoIncident
Microsoft temporarily removed several open source GitHub projects while investigating suspected malicious content. The affected repos were linked to Azure and developer workflows involving AI coding tools such as Claude Code, Gemini CLI, and VS Code. Security researchers said the malware could steal passwords and sensitive credentials when compromised tools were opened, though Microsoft has not disclosed how many users were affected.
Defending Against Frontier Cyber Models: Cloudflare's Project Glasswing Architecture★ 70
Cloudflare Blog5 days agoCommentary
Cloudflare introduces its defense architecture under Project Glasswing, arguing that robust architectural defense around vulnerabilities is more critical than patching speed. By acting as its own "customer zero," Cloudflare demonstrates how to mitigate autonomous frontier cyber models through edge-based isolation, zero-trust principles, and proactive traffic filtering.
Turning Cloudflare’s threat indicators into real-time WAF rules
Cloudflare Blog6 days agoRelease
Cloudflare customers can now apply Cloudforce One threat intelligence inside the WAF to block high-risk traffic. New cf.intel fields let security teams automate protections based on specific threat actors and targeted industries. The update turns threat indicators into real-time enforcement signals, reducing the gap between intelligence and active blocking.
What We Learned Mapping a Year's Worth of AI-Enabled Cyber Threats★ 74
Anthropic News6 days agoEthics
Anthropic analyzed 832 accounts banned for malicious cyber activity from March 2025 to March 2026 and mapped them to MITRE ATT&CK. The report says attackers increasingly use AI beyond preparation, applying it to post-compromise tasks such as account discovery, lateral movement, and privilege escalation. Anthropic argues that frameworks need to capture agentic orchestration, chained attack stages, real-time decisions, and low-human-intervention operations.
U.S. Military Turned GPS into a Global "Numbers Station"
Hacker News (AI keywords)9 days agoCommentary
According to investigative outlet 404 Media, evidence suggests the U.S. military has repurposed the Global Positioning System (GPS) into a modern "numbers station." By embedding encrypted data within standard GPS broadcasts, the military can securely transmit covert messages to agents or assets worldwide. This technique leverages existing satellite infrastructure to achieve global coverage with near-perfect receiver anonymity.
Trend Micro Joins Anthropic Project Glasswing to Defend Taiwan’s AI Supply Chain★ 72
INSIDE 硬塞 AI9 days agoBusiness
Anthropic introduced Project Glasswing after Claude Mythos Preview showed the ability to rapidly find high-risk vulnerabilities and generate connected attack commands. Trend Micro’s TrendAI has joined the framework, becoming the first Taiwanese cybersecurity vendor to do so. The article frames the move around Taiwan’s strategic AI hardware role and a new defensive logic: using AI to counter malicious AI.
Trump Signs Executive Order to Review AI Models Before Release★ 76
The Verge AI12 days agoRegulation
President Donald Trump signed an executive order establishing a voluntary framework for AI companies. Companies may share frontier models with the federal government before public release. The order frames the initiative as a way to promote secure innovation and strengthen cybersecurity for critical infrastructure, while avoiding measures that stifle the US AI industry.
Anthropic scales Claude Mythos to critical infrastructure in 15+ countries★ 72
TechCrunch AI12 days agoBusiness
Anthropic is expanding its Project Glasswing security vulnerability program and access to Mythos. The rollout covers 150 organizations across 15 countries, focusing on power, water, healthcare, and communications infrastructure. The company is targeting sectors where a cyberattack could affect as many as 100 million people, although implementation details and participating organizations were not disclosed in the provided text.
Expanding Project Glasswing★ 76
Hacker News (AI keywords)12 days agoBusiness
Anthropic is expanding Project Glasswing, its program for using Claude Mythos Preview to find vulnerabilities in critical software. The new cohort includes around 150 organizations across more than 15 countries, including infrastructure providers, vendors, nonprofits, and open-source maintainers. Anthropic frames the expansion as preparation for a world where powerful cyber-capable AI models become cheaper and more widely available, shifting focus from finding bugs to validating, disclosing, patching, and deploying fixes.
From Map Data to National Security: A Deep Dive into Amap Risks in Taiwan★ 74
INSIDE 硬塞 AI16 days agoRegulation
INSIDE examines how China’s Amap has become controversial in Taiwan beyond ordinary mapping or navigation use. The article says its service relies on user data and AI-based inference rather than full official data integrations. That model could send movement traces and behavioral signals back to China, creating risks for hybrid warfare intelligence, influence operations, and Taiwan’s broader governance of map data and digital infrastructure.
MetaAge Shows Smart Enterprise AI Vision at COMPUTEX 2026
INSIDE 硬塞 AI19 days agoBusiness
MetaAge presented its “smart enterprise in the AI era” vision at COMPUTEX 2026, centered on AI Agent solutions for business deployment. The showcase focuses on core operations, intelligent customer service, and cybersecurity governance. By integrating resources from AWS, Microsoft, and Google Cloud, the company aims to help enterprises turn AI adoption into practical operational capability and competitive advantage.
Import AI 457：AI 版 Stuxnet 震網病毒、神祕的 Muon 優化器，以及積極對齊（Positive Alignment）★ 78
Import AI (Jack Clark)27 days agoCommentary
This issue of Import AI 457, written by Jack Clark, delves into three forward-looking and stylistically distinct topics in the field of artificial…
漏洞賞金計劃遭大量「AI 垃圾報告」轟炸，企業安全團隊不堪重負★ 70
Ars Technica AI27 days agoIncident
According to a report by Ars Technica, corporate bug bounty programs are currently being bombarded with an "endless" stream of AI-generated junk reports (AI…
AI 與網路安全的未來：為什麼「開放」至關重要★ 75
Hugging Face Blog54 days agoOpinion
As artificial intelligence (AI) technology undergoes explosive growth, cybersecurity has become a focal point of concern for governments and enterprises…
Import AI 452：網路戰的縮放定律、AI 自動化浪潮與 GDP 預測之謎★ 75
Import AI (Jack Clark)69 days agoCommentary
This issue of Import AI 452, written by Jack Clark, takes a deep dive into the far-reaching impact of artificial intelligence on three major areas: national…
Import AI 450：中國電子戰 AI 模型、受創傷的 LLM 與網路攻擊的規模法則★ 75
Import AI (Jack Clark)83 days agoCommentary
In this issue of Import AI 450, author Jack Clark explores three key topics with profound implications for the future of technology, security, and geopolitics…
Import AI 442：AI 經濟中的贏家與輸家、數學證明自動化，以及網路間諜活動的工業化★ 75
Import AI (Jack Clark)139 days agoCommentary
In this issue of Import AI 442, Jack Clark raises a core fundamental question: "Will the arrival of superintelligence be an instantaneous 'phase change,' or a…
Import AI 438：無聲的警報，為我們所有人閃爍（網路安全能力過剩與對話隱私）★ 75
Import AI (Jack Clark)174 days agoCommentary
In this issue of Import AI 438, Jack Clark examines two key issues concerning AI security and privacy: **1. You Are Your LLM History** As large language models…
Meta 推出 CyberSecEval 2：評估大語言模型網路安全風險與防護能力的全面性框架★ 75
Hugging Face Blog751 days agoRelease
As large language models (LLMs) become increasingly prevalent in software development and automated workflows, their "dual-use" risks in the cybersecurity…

Latest in AI

U.S. Government Orders Anthropic to Disable Claude Fable 5 and Mythos 5★ 78

US Directive Targets Access to Fable 5 and Mythos 5★ 76

Security Researchers Criticize Anthropic Fable Safeguards as Too Strict

Cybersecurity Researchers Criticize Anthropic's Fable for Overly Strict Guardrails

Microsoft's open source tools were hacked to steal passwords of AI developers★ 78

Defending Against Frontier Cyber Models: Cloudflare's Project Glasswing Architecture★ 70

Turning Cloudflare’s threat indicators into real-time WAF rules

What We Learned Mapping a Year's Worth of AI-Enabled Cyber Threats★ 74

U.S. Military Turned GPS into a Global "Numbers Station"

Trend Micro Joins Anthropic Project Glasswing to Defend Taiwan’s AI Supply Chain★ 72

Trump Signs Executive Order to Review AI Models Before Release★ 76

Anthropic scales Claude Mythos to critical infrastructure in 15+ countries★ 72

Expanding Project Glasswing★ 76

From Map Data to National Security: A Deep Dive into Amap Risks in Taiwan★ 74

MetaAge Shows Smart Enterprise AI Vision at COMPUTEX 2026

Import AI 457：AI 版 Stuxnet 震網病毒、神祕的 Muon 優化器，以及積極對齊（Positive Alignment）★ 78

漏洞賞金計劃遭大量「AI 垃圾報告」轟炸，企業安全團隊不堪重負★ 70

AI 與網路安全的未來：為什麼「開放」至關重要★ 75

Import AI 452：網路戰的縮放定律、AI 自動化浪潮與 GDP 預測之謎★ 75

Import AI 450：中國電子戰 AI 模型、受創傷的 LLM 與網路攻擊的規模法則★ 75

Import AI 442：AI 經濟中的贏家與輸家、數學證明自動化，以及網路間諜活動的工業化★ 75

Import AI 438：無聲的警報，為我們所有人閃爍（網路安全能力過剩與對話隱私）★ 75

Meta 推出 CyberSecEval 2：評估大語言模型網路安全風險與防護能力的全面性框架★ 75