AI Alignment to Moral Realism
Hard realist alignment: align advanced AI to stance-independent moral facts rather than human preference aggregates. Preference-realism hybrid: align to human preferences, regularised by stance-independent moral facts.Mechanism: indirect normativity with explicit procedures for discovery, uncertainty handling, and safety interlocks. The principles behind the development of powerful AI carry exceptionally high stakes. This post considers AI alignment,…