TechCrunch reports that the U.S. government ordered Anthropic to immediately disable Claude Fable 5 and Claude Mythos 5 worldwide, citing national security concerns. Anthropic says the order appears tied to a claimed narrow jailbreak of Fable 5, but argues the cited capability is already common in other public models. The move highlights a potential backlash against Anthropic’s safety-first messaging around especially powerful AI systems.
Simon Willison comments on Anthropic’s statement that a US government export-control directive requires suspending access to Fable 5 and Mythos 5 for all foreign nationals, including Anthropic employees. Anthropic says the directive cites national security concerns but offers only verbal evidence of a narrow Fable 5 jailbreak. Willison notes that, as of 9:01pm ET, he still had access to Fable through claude.ai and Claude Code.
Anthropic released Fable as a public but limited version of its cybersecurity-focused Mythos model. Security researchers say its guardrails trigger on broad cyber-related wording, blocking tasks like blog analysis, secure coding, and code review. The restrictions aim to reduce malware, software compromise, and biology-related misuse, but the current implementation may frustrate legitimate security work.
Anthropic's latest model Fable is drawing complaints from the cybersecurity research community over guardrails deemed excessively restrictive. Researchers say the model's content filters block even legitimate security tasks, hampering professional workflows. The incident highlights a persistent tension between AI safety measures and the practical needs of security professionals who must engage with offensive techniques defensively.
Microsoft temporarily removed several open source GitHub projects while investigating suspected malicious content. The affected repos were linked to Azure and developer workflows involving AI coding tools such as Claude Code, Gemini CLI, and VS Code. Security researchers said the malware could steal passwords and sensitive credentials when compromised tools were opened, though Microsoft has not disclosed how many users were affected.
Cloudflare introduces its defense architecture under Project Glasswing, arguing that robust architectural defense around vulnerabilities is more critical than patching speed. By acting as its own "customer zero," Cloudflare demonstrates how to mitigate autonomous frontier cyber models through edge-based isolation, zero-trust principles, and proactive traffic filtering.
Cloudflare customers can now apply Cloudforce One threat intelligence inside the WAF to block high-risk traffic. New cf.intel fields let security teams automate protections based on specific threat actors and targeted industries. The update turns threat indicators into real-time enforcement signals, reducing the gap between intelligence and active blocking.
Anthropic analyzed 832 accounts banned for malicious cyber activity from March 2025 to March 2026 and mapped them to MITRE ATT&CK. The report says attackers increasingly use AI beyond preparation, applying it to post-compromise tasks such as account discovery, lateral movement, and privilege escalation. Anthropic argues that frameworks need to capture agentic orchestration, chained attack stages, real-time decisions, and low-human-intervention operations.
According to investigative outlet 404 Media, evidence suggests the U.S. military has repurposed the Global Positioning System (GPS) into a modern "numbers station." By embedding encrypted data within standard GPS broadcasts, the military can securely transmit covert messages to agents or assets worldwide. This technique leverages existing satellite infrastructure to achieve global coverage with near-perfect receiver anonymity.
Anthropic introduced Project Glasswing after Claude Mythos Preview showed the ability to rapidly find high-risk vulnerabilities and generate connected attack commands. Trend Micro’s TrendAI has joined the framework, becoming the first Taiwanese cybersecurity vendor to do so. The article frames the move around Taiwan’s strategic AI hardware role and a new defensive logic: using AI to counter malicious AI.
President Donald Trump signed an executive order establishing a voluntary framework for AI companies. Companies may share frontier models with the federal government before public release. The order frames the initiative as a way to promote secure innovation and strengthen cybersecurity for critical infrastructure, while avoiding measures that stifle the US AI industry.
Anthropic is expanding its Project Glasswing security vulnerability program and access to Mythos. The rollout covers 150 organizations across 15 countries, focusing on power, water, healthcare, and communications infrastructure. The company is targeting sectors where a cyberattack could affect as many as 100 million people, although implementation details and participating organizations were not disclosed in the provided text.
Anthropic is expanding Project Glasswing, its program for using Claude Mythos Preview to find vulnerabilities in critical software. The new cohort includes around 150 organizations across more than 15 countries, including infrastructure providers, vendors, nonprofits, and open-source maintainers. Anthropic frames the expansion as preparation for a world where powerful cyber-capable AI models become cheaper and more widely available, shifting focus from finding bugs to validating, disclosing, patching, and deploying fixes.
INSIDE examines how China’s Amap has become controversial in Taiwan beyond ordinary mapping or navigation use. The article says its service relies on user data and AI-based inference rather than full official data integrations. That model could send movement traces and behavioral signals back to China, creating risks for hybrid warfare intelligence, influence operations, and Taiwan’s broader governance of map data and digital infrastructure.
MetaAge presented its “smart enterprise in the AI era” vision at COMPUTEX 2026, centered on AI Agent solutions for business deployment. The showcase focuses on core operations, intelligent customer service, and cybersecurity governance. By integrating resources from AWS, Microsoft, and Google Cloud, the company aims to help enterprises turn AI adoption into practical operational capability and competitive advantage.
This issue of Import AI 457, written by Jack Clark, delves into three forward-looking and stylistically distinct topics in the field of artificial…
According to a report by Ars Technica, corporate bug bounty programs are currently being bombarded with an "endless" stream of AI-generated junk reports (AI…
As artificial intelligence (AI) technology undergoes explosive growth, cybersecurity has become a focal point of concern for governments and enterprises…
This issue of Import AI 452, written by Jack Clark, takes a deep dive into the far-reaching impact of artificial intelligence on three major areas: national…
In this issue of Import AI 450, author Jack Clark explores three key topics with profound implications for the future of technology, security, and geopolitics…
In this issue of Import AI 442, Jack Clark raises a core fundamental question: "Will the arrival of superintelligence be an instantaneous 'phase change,' or a…
In this issue of Import AI 438, Jack Clark examines two key issues concerning AI security and privacy: **1. You Are Your LLM History** As large language models…
As large language models (LLMs) become increasingly prevalent in software development and automated workflows, their "dual-use" risks in the cybersecurity…