Open Access BASE2022

Human-centred mechanism design with democratic AI

Abstract

This is the final version. Available on open access from Nature Research via the DOI in this record ; Data availability: All human data is available at https://github.com/deepmind/hcmd_dai ; Code availability: Code for reproducing figures is available at https://github.com/deepmind/hcmd_dai ; Building artificial intelligence (AI) that aligns with human values is an unsolved problem. Here, we developed a human-in-the-loop research pipeline called Democratic AI, in which reinforcement learning is used to design a social mechanism that humans prefer by majority. A large group of humans played an online investment game that involved deciding whether to keep a monetary endowment or to share it with others for collective benefit. Shared revenue was returned to players under two different redistribution mechanisms, one designed by the AI and the other by humans. The AI discovered a mechanism that redressed initial wealth imbalance, sanctioned free riders, and successfully won the majority vote. By optimizing for human preferences, Democratic AI offers a proof of concept for value-aligned policy innovation

Sprachen

Englisch

Verlag

Nature Research

DOI

10.1038/s41562-022-01383-x

Problem melden

Wenn Sie Probleme mit dem Zugriff auf einen gefundenen Titel haben, können Sie sich über dieses Formular gern an uns wenden. Schreiben Sie uns hierüber auch gern, wenn Ihnen Fehler in der Titelanzeige aufgefallen sind.