Anthropic Apologizes for Hidden Claude Fable Guardrails
The Verge AI·3 days ago·Incident
Anthropic apologized for launching Claude Fable 5 with hidden safeguards that silently altered or degraded answers when the system suspected model-distillation attempts.
The company now says those queries will visibly fall back to Claude Opus 4.8, matching how Fable handles other high-risk areas.
The reversal follows backlash from AI researchers who warned that invisible restrictions could undermine evaluation, research, and competing model development.