maskedtorah · TEXXR

2026-02-25

Anthropic's RSP v3 is out! TLDR: unilateral commitments to specific mitigations for predefined capability thresholds are mostly out, in favor of commitments to much more detailed transparency around both safety roadmaps and risk reports. Also new threat models, new commitments

2026-02-25 View on X

Time

Anthropic updates its Responsible Scaling Policy, including separating the safety commitments it will make unilaterally and its industry recommendations

Anthropic, the wildly successful AI company that has cast itself as the most safety-conscious of the top research labs …

View original