Value Risk

Article | Essay | Opinion

Seven AI Safety Strategies – and the One That’s Missing
ByAdam Ford 2025-06-222025-06-22

Blue Dot Impact recently published a detailed overview of seven key strategies in AI safety – ranging from international cooperation to domestic race-winning and societal resilience. It’s the result of 100+ hours of analysis, interviews, and synthesis by Adam Jones, and it’s a solid strategic landscape mapping effort. I was made aware of this via…

Read More Seven AI Safety Strategies – and the One That’s Missing
Article | Essay

Cosmic Auditability as Alignment Pressure
ByAdam Ford 2025-06-152026-02-08

TL;DR Superintelligence has strong instrumental reasons to adopt a generally cooperative posture if it deems it likely that mature civilisations may prefer cooperation (a universal leviathan), run deep audits on it’s history and psychology and penalise the superintelligence if there are signatures of defection found, making cooperation (and perhaps moral reliability) a long-term survival strategy,…

Read More Cosmic Auditability as Alignment Pressure
Article | Essay

Controlling AI isn’t enough
ByAdam Ford 2025-06-082026-03-05

Is controlling AI is enough? AI safety has a steering problem, not just a brakes problem. We have to make it care about the right things. As AI systems become more capable, the foundational question isn’t merely how do we contain them? It’s: what are they optimising for, and why should we trust that objective?…

Read More Controlling AI isn’t enough
Essay | Opinion

Complex Value Systems are Required to Realize Valuable Futures – A Critique
ByAdam Ford 2025-04-202026-07-03

In his 2011 paper, Complex Value Systems are Required to Realize Valuable Futures , Eliezer Yudkowsky posits that aligning artificial general intelligence (AGI) or artificial superintelligence (ASI) with human values necessitates embedding the complex intricacies of human ethics into AI systems. He warns against oversimplification, suggesting that without a comprehensive inheritance of human values, AGI…

Read More Complex Value Systems are Required to Realize Valuable Futures – A Critique
Article | Opinion

AI, Don’t be a Cosmic Jerk
ByAdam Ford 2025-04-092026-02-08

Surviving an intelligence explosion may just one of the middle layers in the Great Filter. Another layer to the Great Filter might be whether our civilisation, or a greedy AI singleton can grow up enough not to become a cosmic jerk. You might brute-force your way to technological maturity, but that doesn’t mean you get…

Read More AI, Don’t be a Cosmic Jerk
Article | Essay | Video

AI Values: Satiable vs Insatiable
ByAdam Ford 2025-03-232025-03-25

In short: Satiable values have diminishing returns with more resources, while insatiable values always want more, potentially leading to risky AI behaviour – it seems likely that AI with insatiable values could make existential trades, like gambling the world for a chance to double resources. Therefore potentially design AI with satiable values to ensure stability,…

Read More AI Values: Satiable vs Insatiable
Article | Essay

On the Emergence of Biased Coherent Value Systems in AI as Value Risk
ByAdam Ford 2025-02-132025-03-31

Firstly, values are important because they are the fundamental principles that guide decisions and actions in humans. They shape our understanding of right and wrong, and they influence how we interact with the world around us. In the context of AI, values are particularly important because they determine how AI systems will behave, what goals…

Read More On the Emergence of Biased Coherent Value Systems in AI as Value Risk
Article | Essay | News

The Architecture of Value
ByAdam Ford 2024-12-272026-07-03

Implications for AI Safety and Indirect Normativity The challenge of understanding value space has moved from a philosophical curiosity to an urgent technical problem. As we approach the development of advanced artificial intelligence, the structure of value and our ability to specify it has become perhaps the most critical challenge facing humanity. This challenge assumes…

Read More The Architecture of Value
Article | Essay

Capability Control vs Motivation Selection: Contrasting Strategies for AI Safety
ByAdam Ford 2024-12-052026-04-13

In AI safety, Control and Motivation are the two primary strategies used to prevent advanced AI systems from causing harm. While control focuses on external constraints, motivation addresses the internal goals and values of the AI. Most researchers, including those at Anthropic and the Future of Life Institute, suggest a hybrid approach: using strict capability…

Read More Capability Control vs Motivation Selection: Contrasting Strategies for AI Safety
Article | Essay

VRisk – Value Risk
ByAdam Ford 2024-09-192025-10-10

This is an attempt to formally introduce the concept of value risk (v-risk or VRisk). This is not an attempt at repackaging well known established risk categories like x-risk and s-risk, it’s an attempt to carve out axis of concerns about value choice, hopefully to help increase clarity and weight to what might otherwise sound…

Read More VRisk – Value Risk