Using Consensus Mechanisms as an approach to Alignment — LessWrong

Foundational Challenges in Assuring Alignment and Safety of Large ...
arxiv.orgBuilding Effective AI Agents | Anthropic
anthropic.com
Using Consensus Mechanisms as an approach to Alignment — LessWrong