Latest in AI

Showing:ai-safetyFoundersClear ×

🔥 Trending today

anthropic6 export-controls4 model-access3 spacex3 amazon3 national-security2 open-source2 governance2 ai-regulation2 government-policy2

Topic

Release New Tool Tutorial Business Paper Benchmark Opinion Regulation

For

General Developers Designers Product Founders Marketing Researchers Students

Results from the First Anthropic Public Record
Anthropic News21 hours agoRegulation
Anthropic published the first results from Anthropic Public Record, a recurring survey series on public attitudes toward AI. The first wave surveyed nearly 52,000 Americans in late 2025 and found broad hopes for medical progress and accessibility, alongside major fears about job loss, cognitive dependency, and misinformation. Respondents also showed bipartisan support for government involvement, legal accountability, privacy protections, child safety rules, and stronger oversight of AI companies.
U.S. Government Orders Anthropic to Disable Claude Fable 5 and Mythos 5★ 78
TechCrunch AIyesterdayRegulation
TechCrunch reports that the U.S. government ordered Anthropic to immediately disable Claude Fable 5 and Claude Mythos 5 worldwide, citing national security concerns. Anthropic says the order appears tied to a claimed narrow jailbreak of Fable 5, but argues the cited capability is already common in other public models. The move highlights a potential backlash against Anthropic’s safety-first messaging around especially powerful AI systems.
Anthropic Apologizes for Hidden Claude Fable Guardrails
The Verge AI3 days agoIncident
Anthropic apologized for launching Claude Fable 5 with hidden safeguards that silently altered or degraded answers when the system suspected model-distillation attempts. The company now says those queries will visibly fall back to Claude Opus 4.8, matching how Fable handles other high-risk areas. The reversal follows backlash from AI researchers who warned that invisible restrictions could undermine evaluation, research, and competing model development.
Anthropic’s Amodei Urges Mandatory Safety Rules for Frontier AI★ 72
INSIDE 硬塞 AI3 days agoRegulation
Anthropic CEO Dario Amodei is calling for AI regulation to move beyond transparency requirements toward binding safety obligations. He argues that frontier models already present visible risks and should face mandatory testing across four major risk areas. Under his proposed approach, governments would have authority to block or deter deployment when systems fail to meet required safety standards.
Lawsuit Says xAI Fired Engineer Over Grok Safety Warning★ 74
TechCrunch AI3 days agoEthics
Former xAI engineer Devin Kim is suing xAI and SpaceX, alleging retaliation after he repeatedly raised safety concerns about Grok. The complaint says Kim warned about discrimination, harmful content, weapons-related risks, and alleged resistance to safety testing around Grok Code 1. The lawsuit arrives days before SpaceX’s expected IPO; xAI and SpaceX did not immediately respond to TechCrunch’s requests for comment.
Quoting Jeremy Howard on Anthropic's Recursive AI Self-Improvement Contradiction
Simon Willison's Weblog4 days agoEthics
Jeremy Howard proposes that labs claiming to slow recursive AI self-improvement should ban themselves from using their top model for frontier research while letting others access it. He argues Anthropic does the opposite — using its best model internally while reportedly blocking others from doing the same — accelerating the frontier and worsening power imbalance. Howard personally favors democratization over slowdown, but his point is about consistency: if you preach restraint, constrain yourself first.
Claude Mythos 5 Released: 50 Million Lines of Code in One Day★ 74
量子位 QbitAI4 days agoRelease
QbitAI says Anthropic introduced Claude Fable 5 for general users and Claude Mythos 5 for a small set of trusted users. The article highlights software engineering, long-context work, native vision, memory, and scientific research capabilities. It also focuses on a safety-routing design where Fable 5 downgrades high-risk requests to Claude Opus 4.8 instead of simply refusing.
Anthropic Is Accused of Nerfing Fable for Other LLM Development
r/LocalLLaMA top day4 days agoCommentary
A r/LocalLLaMA post claims Anthropic may be intentionally limiting Fable when users ask it to help build other LLMs. The source is a short Reddit post with screenshot context, not a formal benchmark or verified disclosure. Discussion centers on trust in hosted closed models, unclear safety boundaries, and why local or open-weight LLMs may be necessary for serious AI development work.
Claude Fable 5 and new AI safety fables
Interconnects (Nathan L.)4 days agoCommentary
Interconnects author Nathan Lambert leverages the double meaning of 'Fable' — both Anthropic's model codename and a fictional story — to interrogate frontier AI safety discourse. The piece frames Claude Fable 5's release within escalating lab power politics, where safety positioning doubles as competitive branding. A critical commentary for those tracking AI governance and Anthropic's strategic narrative.
Anthropic Requires Fable and Mythos Models to Retain Data for 30 Days★ 74
Hacker News (AI keywords)5 days agoEthics
Anthropic says Mythos-class models require limited prompt and output retention for trust and safety work across platforms where they are offered. The policy took effect on June 9, 2026 and mainly affects organizations using Zero Data Retention through Claude Console, Claude Code Enterprise, AWS Bedrock, Google Cloud Agent Platform, or Microsoft Foundry. Consumer Claude Free, Pro, and Max plans are unchanged, while Anthropic describes restricted human review and automatic deletion after 30 days.
Anthropic Releases Claude Fable 5, Its First Public Mythos-Class Model, With Guardrails for High-Risk Domains★ 76
TechCrunch AI5 days agoRelease
Anthropic has released Claude Fable 5, marking the first time a model from its high-capability Mythos family is available to the general public. The model includes built-in guardrails that restrict responses in high-risk domains such as cybersecurity and biology to mitigate misuse potential. The launch comes just days after Anthropic publicly warned that AI technology is becoming increasingly and alarmingly dangerous.
System Card: Claude Fable 5 and Claude Mythos 5★ 82
Hacker News (AI keywords)5 days agoRelease
Anthropic has published system cards for its two newest flagship models, Claude Fable 5 and Claude Mythos 5, following its standard responsible-release practice. These documents cover dangerous capability evaluations, ASL safety-level determinations, red-teaming results, and alignment assessments under the company's Responsible Scaling Policy. They serve as primary references for safety researchers, enterprise buyers, regulators, and developers assessing model risk and deployment suitability.
Altman, Amodei, and Hassabis Unite to Back DNA Safety Legislation
量子位 QbitAI6 days agoRegulation
Based on the headline and public reporting, the article covers a rare joint push by Sam Altman, Dario Amodei, Demis Hassabis, and other AI leaders for US biosecurity legislation. They are asking lawmakers to require synthetic DNA and RNA providers to screen customers, orders, and records. The concern is that advanced AI could lower the knowledge barrier for designing dangerous biological agents.
Responsible Scaling Policy
Anthropic News6 days agoEthics
Anthropic published a major update to its Responsible Scaling Policy, its governance framework for frontier AI risk. The revised policy keeps the commitment not to train or deploy models without adequate safeguards, while adding more nuanced capability thresholds and required safety levels. It focuses on risks such as autonomous AI R&D acceleration and CBRN weapons assistance, with stronger evaluations, documentation, governance, and external input.
Anthropic Co-founder Ben Mann Visits Taiwan to Discuss AI Safety and Claude Strategy
INSIDE 硬塞 AI9 days agoBusiness
Anthropic co-founder and Anthropic Labs lead Ben Mann made his first visit to Taiwan, according to INSIDE. The report highlights his role in leading Claude Code and the Model Context Protocol, two key parts of Anthropic’s developer-focused product direction. The discussion centered on Claude strategy, AI safety boundaries, jobs, and Taiwan’s strategic role in the AI landscape.
Trend Micro Joins Anthropic Project Glasswing to Defend Taiwan’s AI Supply Chain★ 72
INSIDE 硬塞 AI9 days agoBusiness
Anthropic introduced Project Glasswing after Claude Mythos Preview showed the ability to rapidly find high-risk vulnerabilities and generate connected attack commands. Trend Micro’s TrendAI has joined the framework, becoming the first Taiwanese cybersecurity vendor to do so. The article frames the move around Taiwan’s strategic AI hardware role and a new defensive logic: using AI to counter malicious AI.
Expanding Project Glasswing★ 76
Hacker News (AI keywords)12 days agoBusiness
Anthropic is expanding Project Glasswing, its program for using Claude Mythos Preview to find vulnerabilities in critical software. The new cohort includes around 150 organizations across more than 15 countries, including infrastructure providers, vendors, nonprofits, and open-source maintainers. Anthropic frames the expansion as preparation for a world where powerful cyber-capable AI models become cheaper and more widely available, shifting focus from finding bugs to validating, disclosing, patching, and deploying fixes.
Florida sues OpenAI, Sam Altman after multiple ChatGPT-linked murders★ 78
Ars Technica AI13 days agoRegulation
Florida sued OpenAI and CEO Sam Altman over multiple murders described as linked to ChatGPT. The state's attorney general accused Altman of an "utter disregard" for human lives. The provided excerpt does not identify the cases, explain the alleged causal links, specify the legal claims, or include OpenAI's response, so the allegations require further clarification.
At TechCrunch Disrupt 2026: Databricks co-founder on what kills enterprise AI deals
TechCrunch AI17 days agoBusiness
TechCrunch frames enterprise AI as entering a new phase, where companies are no longer mainly asking whether AI is exciting. The harder question is whether it can be deployed safely at scale. Centered on a TechCrunch Disrupt 2026 discussion with a Databricks co-founder, the article points to safety and broad rollout readiness as key enterprise AI deal concerns.

Latest in AI

Results from the First Anthropic Public Record

U.S. Government Orders Anthropic to Disable Claude Fable 5 and Mythos 5★ 78

Anthropic Apologizes for Hidden Claude Fable Guardrails

Anthropic’s Amodei Urges Mandatory Safety Rules for Frontier AI★ 72

Lawsuit Says xAI Fired Engineer Over Grok Safety Warning★ 74

Quoting Jeremy Howard on Anthropic's Recursive AI Self-Improvement Contradiction

Claude Mythos 5 Released: 50 Million Lines of Code in One Day★ 74

Anthropic Is Accused of Nerfing Fable for Other LLM Development

Claude Fable 5 and new AI safety fables

Anthropic Requires Fable and Mythos Models to Retain Data for 30 Days★ 74

Anthropic Releases Claude Fable 5, Its First Public Mythos-Class Model, With Guardrails for High-Risk Domains★ 76

System Card: Claude Fable 5 and Claude Mythos 5★ 82

Altman, Amodei, and Hassabis Unite to Back DNA Safety Legislation

Responsible Scaling Policy

Anthropic Co-founder Ben Mann Visits Taiwan to Discuss AI Safety and Claude Strategy

Trend Micro Joins Anthropic Project Glasswing to Defend Taiwan’s AI Supply Chain★ 72

Expanding Project Glasswing★ 76

Florida sues OpenAI, Sam Altman after multiple ChatGPT-linked murders★ 78

At TechCrunch Disrupt 2026: Databricks co-founder on what kills enterprise AI deals