Publication: Inferring human values for safe AGI design
Institution Authors
Authors
Journal Title
Journal ISSN
Volume Title
Type
bookPart
Sub Type
Book chapter
Access
restrictedAccess
Publication Status
published
Abstract
Aligning goals of superintelligent machines with human values is one of the ways to pursue safety in AGI systems. To achieve this, it is first necessary to learn what human values are. However, human values are incredibly complex and cannot easily be formalized by hand. In this work, we propose a general framework to estimate the values of a human given its behavior.
Date
2015
Publisher
Springer International Publishing