Essay

Article | Essay

Value Space
ByAdam Ford 2024-12-152025-05-18

The concept of a non-arbitrary objective value space offers a compelling framework for understanding how to navigate to more valuable worlds. This framework presupposes the existence of stance-independent normative truths – metaphysical/ontological, epistemic, scientific & ethical principles that rational agents will converge upon given sufficient cognitive sophistication [1]. Here I will mostly talk in relation…

Read More Value Space
Article | Essay

Will Superintelligence solely be Motivated by Brute Self-Interest?
ByAdam Ford 2024-12-072024-12-09

Is self-interest by necessity the default motivation for all agents?Will Superintelligence necessarily become a narcissistic utility monster? No, self-interest is not necessarily the default motivation of all agents. While self-interest is a common and often foundational drive in many agents (biological or artificial), other motivations can and do arise, either naturally or through design, depending…

Read More Will Superintelligence solely be Motivated by Brute Self-Interest?
Article | Essay

Capability Control vs Motivation Selection: Contrasting Strategies for AI Safety
ByAdam Ford 2024-12-052026-04-13

In AI safety, Control and Motivation are the two primary strategies used to prevent advanced AI systems from causing harm. While control focuses on external constraints, motivation addresses the internal goals and values of the AI. Most researchers, including those at Anthropic and the Future of Life Institute, suggest a hybrid approach: using strict capability…

Read More Capability Control vs Motivation Selection: Contrasting Strategies for AI Safety
Article | Essay

Reverse Wireheading
ByAdam Ford 2024-11-182024-11-18

Concerning sentient AI, we would like to avoid unnecessary suffering in artificial systems. It’s hard for biological systems like humans to turn off suffering without appropriate pharmacology i.e. from aspirin to anesthetics etc. AI may be able to self administer pain killers – a kind of wireheading in reverse. Similarly to wireheading in AI systems…

Read More Reverse Wireheading
Article | Essay

Securing Tomorrow: An Iterative Framework for Achieving Utopia
ByAdam Ford 2024-11-152024-11-15

There is so much wrong with the world right now – I don’t know where to begin. I wake up early with dread sometimes – it’s hard to know what thread to tug at – the amorphous confusion festooning the great big everything seems so hopeless. It would be a lot easier if everyone were…

Read More Securing Tomorrow: An Iterative Framework for Achieving Utopia
Article | Essay

Understanding the moral status of digital minds requires a mature understanding of sentience
ByAdam Ford 2024-11-072024-11-07

Turning off all AI won’t happen in the real world. If we understand the signatures of sentience, we’ll be in a better position to know what to do to circumstantially prevent/mitigate it or encourage it. AI, esp LLMs ‘claiming sentience’ isn’t enough… We need to deep operational understandings of it. See the article by 80k…

Read More Understanding the moral status of digital minds requires a mature understanding of sentience
Article | Essay

AI Alignment to Higher Values, not Human Values
ByAdam Ford 2024-10-272026-05-23

The question is not, can AI be ethical, but can AI clear the bar humans have set? And once you see the bar – you realise we’ve been arguing about whether AI can limbo. “Align AI to human values” sounds sensible, even comforting. But once you look closely, it becomes far less clear that this…

Read More AI Alignment to Higher Values, not Human Values
Article | Essay | Opinion

On ASI Indifference
ByAdam Ford 2024-10-222025-02-25

What if superintelligent AI didn’t care about humans? Does intelligence necessitate care? Probably not.While I find it hard to imagine care in the absence of cognitive capacity, I can’t see anything that guarantees all cognitively apt agents will harbour care. How, if at all possible, can we from our current position assess the likelihood that…

Read More On ASI Indifference
Article | Essay | Uncategorised

MRisk – Motivation Risk
ByAdam Ford 2024-10-202026-04-08

What is M-Risk? M-Risk is the risk that a transformative AI may not be adequately motivated to pursue good values, even if it knows what they are or how to discover them. The AI might possess the cognitive capability to understand values (including their preference hierarchy to sentient agents) but lack the motivation to pursue…

Read More MRisk – Motivation Risk
Article | Essay

VRisk – Value Risk
ByAdam Ford 2024-09-192025-10-10

This is an attempt to formally introduce the concept of value risk (v-risk or VRisk). This is not an attempt at repackaging well known established risk categories like x-risk and s-risk, it’s an attempt to carve out axis of concerns about value choice, hopefully to help increase clarity and weight to what might otherwise sound…

Read More VRisk – Value Risk