Anthropic updates AI safety policy

February 5, 2025October 16, 2024 by SCR

Anthropic has updated its AI safety policy to prevent misuse, reports VentureBeat author Michael Nuñez. The new “Capability Thresholds” define benchmarks for risky capabilities of AI models, such as in the area of bioweapons or autonomous AI research. If a model reaches such a threshold, additional safeguards are triggered. The revised policy also sets out more detailed responsibilities for a “Responsible Scaling Officer” who oversees compliance with safety standards. Anthropic hopes the policy will serve as a blueprint for the entire AI industry and lead to a race for the best safety standards.

_{About the author}

The author name SCR marks content created with the help of AI. Each article is checked and edited before publication. Editorial responsibility: Jan Tissler. Read more about how this website is made and which prompts are used.

Tags: Anthropic, Safety

Stay up-to-date:

Newsletter

RSS Feed

_{Advertisement}

Related posts:

Stay up-to-date: