Anthropic's latest model Fable is drawing complaints from the cybersecurity research community over guardrails deemed excessively restrictive. Researchers say the model's content filters block even legitimate security tasks, hampering professional workflows. The incident highlights a persistent tension between AI safety measures and the practical needs of security professionals who must engage with offensive techniques defensively.
Anthropic has announced that its latest frontier model, Fable 5, enforces hard refusals on topics deemed too dangerous, specifically cybersecurity, biology, and chemistry. The move reflects the company's ongoing effort to balance capability with safety as models grow more powerful. For developers and researchers in these fields, the restrictions may limit practical usability in legitimate professional contexts.
YouTube says it will move AI disclosures on Shorts and long-form videos to places viewers are more likely to notice. The platform will also start automatically identifying and labeling AI-generated content. The move follows Google’s expanded AI verification efforts at I/O and signals a stronger push toward transparency around synthetic media on YouTube.