2026-02-25
Anthropic's RSP v3 is out! TLDR: unilateral commitments to specific mitigations for predefined capability thresholds are mostly out, in favor of commitments to much more detailed transparency around both safety roadmaps and risk reports. Also new threat models, new commitments
Time
Anthropic updates its Responsible Scaling Policy, including separating the safety commitments it will make unilaterally and its industry recommendations
Anthropic, the wildly successful AI company that has cast itself as the most safety-conscious of the top research labs …