Publication:
Inferring human values for safe AGI design

Placeholder

Institution Authors

Research Projects

Journal Title

Journal ISSN

Volume Title

Type

bookPart

Sub Type

Book chapter

Access

restrictedAccess

Publication Status

published

Journal Issue

Abstract

Aligning goals of superintelligent machines with human values is one of the ways to pursue safety in AGI systems. To achieve this, it is first necessary to learn what human values are. However, human values are incredibly complex and cannot easily be formalized by hand. In this work, we propose a general framework to estimate the values of a human given its behavior.

Date

2015

Publisher

Springer International Publishing

Description

Keywords

Citation

Collections


0

Views

0

Downloads