Using Consensus Mechanisms as an approach to Alignment — LessWrong

"Aligned to whom?" remains a fundamental question with no consensus answer. Should AI systems align to the immediate operator (Christiano, 2018), the system designer (Gil, 2023), a specific group of humans, humanity as a whole (Miller, 2022), objective ethical principles, or the operator's hypothetical informed preferences? There are no agreed upon... See more
AI Safety Atlas
The field of game theory models interactions between rational decision makers, identifying dilemmas of cooperative, zero-sum, and symmetric games and their inverse. As a counterforce, decentralized autonomous organizations broadly apply mechanism design through economic incentives or social norms to achieve rough consensus and running orgs.... See more
Kei Kreutler • Eight Qualities of Decentralized Autonomous Organizations
