Anthropic's moral compass architect suggested AI overcorrection could address historical injustices

Apr 22, 2026, 3:30 PM

Amanda Askell wrote the paper before joining Anthropic, where she now helps train Claude's character traits...

Redirecting to full article...