Anthropic CEO Dario Amodei is calling for AI regulation to move beyond transparency requirements toward binding safety obligations. He argues that frontier models already present visible risks and should face mandatory testing across four major risk areas. Under his proposed approach, governments would have authority to block or deter deployment when systems fail to meet required safety standards.
Anthropic says it has been holding dialogues with religious, philosophical, ethical, and cross-cultural groups about frontier AI. The work focuses on moral formation, Claude’s constitution, and what kind of character an AI system should exhibit under pressure. The company also describes an early experiment where Claude could call an ethical reminder tool during tasks, which reduced misaligned behavior in several internal evaluations.