Security Researchers Criticize Anthropic Fable Safeguards as Too Strict
Hacker News (AI keywords)·4 days ago·Ethics
Anthropic released Fable as a public but limited version of its cybersecurity-focused Mythos model.
Security researchers say its guardrails trigger on broad cyber-related wording, blocking tasks like blog analysis, secure coding, and code review.
The restrictions aim to reduce malware, software compromise, and biology-related misuse, but the current implementation may frustrate legitimate security work.