Inferring human values for safe AGI design

Sezener, Can Eren

Publication:
Inferring human values for safe AGI design

Authors

Sezener, Can Eren

Type

bookPart

Sub Type

Book chapter

Access

restrictedAccess

Publication Status

published

Abstract

Aligning goals of superintelligent machines with human values is one of the ways to pursue safety in AGI systems. To achieve this, it is first necessary to learn what human values are. However, human values are incredibly complex and cannot easily be formalized by hand. In this work, we propose a general framework to estimate the values of a human given its behavior.

Date

2015

Publisher

Springer International Publishing

URI

http://hdl.handle.net/10679/4184
https://doi.org/10.1007/978-3-319-21365-1_16

Collections

Computer Science

Full item page

Publication:
Inferring human values for safe AGI design

Institution Authors

Authors

Research Projects

Journal Title

Journal ISSN

Volume Title

Type

Sub Type

Access

Publication Status

Journal Issue

Abstract

Date

Publisher

Description

Keywords

Citation

URI

Collections

0

Views

0

Downloads

Publication: Inferring human values for safe AGI design

Institution Authors

Authors

Research Projects

Journal Title

Journal ISSN

Volume Title

Type

Sub Type

Access

Publication Status

Journal Issue

Abstract

Date

Publisher

Description

Keywords

Citation

URI

Collections

0

Views

0

Downloads

Publication:
Inferring human values for safe AGI design