The Problems of Direct Specification
In AI safety, direct specification refers to the attempt to program specific goals, rules, or behaviours directly into an AI system. AI trained to optimise values may over-optimise those we specify at the expense of real value we don’t specify. Political or market dynamics incentivise racing to powerful AI – the race dynamics come with…