Simon Willison highlights a WIRED scoop reporting that Anthropic is changing Claude Fable 5 safeguards for frontier LLM development. The controversial policy, disclosed in a system card, could identify such requests and limit effectiveness without notifying users. Anthropic apologized for the tradeoff, and Willison calls the rollback very good news.
Anthropic has published system cards for its two newest flagship models, Claude Fable 5 and Claude Mythos 5, following its standard responsible-release practice. These documents cover dangerous capability evaluations, ASL safety-level determinations, red-teaming results, and alignment assessments under the company's Responsible Scaling Policy. They serve as primary references for safety researchers, enterprise buyers, regulators, and developers assessing model risk and deployment suitability.