Value Theory

Article

Value-Content Integrity
ByAdam Ford 2026-02-182026-05-11

In The Superintelligent Will, in discussing ‘goal-content integrity’ Bostrom uses final goals (aka terminal goals) to mean the agent’s ends (as opposed to subgoals/instrumental goals, which are just means). But what about values? Goals and values are not the same; they differ in function and permanence. Values are ongoing guiding principles or “compass directions” for…

Read More Value-Content Integrity
Article | Essay | Opinion

Seven AI Safety Strategies – and the One That’s Missing
ByAdam Ford 2025-06-222025-06-22

Blue Dot Impact recently published a detailed overview of seven key strategies in AI safety – ranging from international cooperation to domestic race-winning and societal resilience. It’s the result of 100+ hours of analysis, interviews, and synthesis by Adam Jones, and it’s a solid strategic landscape mapping effort. I was made aware of this via…

Read More Seven AI Safety Strategies – and the One That’s Missing
Article | Essay

Cosmic Auditability as Alignment Pressure
ByAdam Ford 2025-06-152026-02-08

TL;DR Superintelligence has strong instrumental reasons to adopt a generally cooperative posture if it deems it likely that mature civilisations may prefer cooperation (a universal leviathan), run deep audits on it’s history and psychology and penalise the superintelligence if there are signatures of defection found, making cooperation (and perhaps moral reliability) a long-term survival strategy,…

Read More Cosmic Auditability as Alignment Pressure
Essay | Opinion

Complex Value Systems are Required to Realize Valuable Futures – A Critique
ByAdam Ford 2025-04-202025-05-15

In his 2011 paper, Complex Value Systems are Required to Realize Valuable Futures , Eliezer Yudkowsky posits that aligning artificial general intelligence (AGI) or artificial superintelligence (ASI) with human values necessitates embedding the complex intricacies of human ethics into AI systems. He warns against oversimplification, suggesting that without a comprehensive inheritance of human values, AGI…

Read More Complex Value Systems are Required to Realize Valuable Futures – A Critique
Article | Media | Video

Nick Bostrom – AI Ethics – From Utility Maximisation to Humility & Cooperation
ByAdam Ford 2025-04-112025-04-11

Norm-sensitive cooperation might beat brute-force optimisation when aligning AI with the complex layers of human value – and possibly cosmic ones too. Nick Bostrom suggests that AI systems designed with humility and a cooperative orientation are more likely to navigate the complex web of human and potentially cosmic norms than those driven by rigid utility…

Read More Nick Bostrom – AI Ethics – From Utility Maximisation to Humility & Cooperation
Article | Essay | Video

AI Values: Satiable vs Insatiable
ByAdam Ford 2025-03-232025-03-25

In short: Satiable values have diminishing returns with more resources, while insatiable values always want more, potentially leading to risky AI behaviour – it seems likely that AI with insatiable values could make existential trades, like gambling the world for a chance to double resources. Therefore potentially design AI with satiable values to ensure stability,…

Read More AI Values: Satiable vs Insatiable
Article

Human Values Approximate Ideals in Objective Value Space
ByAdam Ford 2024-12-302025-03-19

Value Space and “Good” Human Values Human values can be conceptualised as occupying regions in a vast objective multidimensional “value space.” These regions reflect preferences for cooperation, survival, flourishing, and minimising harm, among other positive traits. If TAI / SI can approximate the subset of human values deemed “good” (e.g., compassion, fairness, cooperation), while avoiding…

Read More Human Values Approximate Ideals in Objective Value Space
Article | Essay | News

The Architecture of Value
ByAdam Ford 2024-12-272025-03-19

Implications for AI Safety and Indirect Normativity The challenge of understanding value space has moved from a philosophical curiosity to an urgent technical problem. As we approach the development of advanced artificial intelligence, the structure of value and our ability to specify it has become perhaps the most critical challenge facing humanity. This challenge assumes…

Read More The Architecture of Value
Article | Essay

Understanding V-Risk: Navigating the Complex Landscape of Value in AI
ByAdam Ford 2024-12-202026-02-22

Will AI Ignore, Preserve or Improve on Our Values? Imagine a future where advanced AIs follow every instruction flawlessly – but humanity feels strangely adrift. Our tools are obedient, but the ‘soul’ of our civilisation feels absent. This is the hidden danger of V-risk – value erosion. In this post I explore what I broadly…

Read More Understanding V-Risk: Navigating the Complex Landscape of Value in AI
Article | Essay

Value Space
ByAdam Ford 2024-12-152025-05-18

The concept of a non-arbitrary objective value space offers a compelling framework for understanding how to navigate to more valuable worlds. This framework presupposes the existence of stance-independent normative truths – metaphysical/ontological, epistemic, scientific & ethical principles that rational agents will converge upon given sufficient cognitive sophistication [1]. Here I will mostly talk in relation…

Read More Value Space