Understanding V-Risk: Navigating the Complex Landscape of Value in AI
In this post I explore what I explore what I broadly define as V-Risk (Value Risk), which I think is a critical and underrepresented concept in general, but especially for the alignment of artificial intelligence. There are two main areas of AI alignment: capability control and motivation selection. Values are what motivates approaches to fulfilling…