Exploration with intrinsic motivation using object–action–outcome latent space

Sener, M. İ.; Nagai, Y.; Öztop, Erhan; Uğur, E.

Publication:
Exploration with intrinsic motivation using object–action–outcome latent space

dc.contributor.author	Sener, M. İ.
dc.contributor.author	Nagai, Y.
dc.contributor.author	Öztop, Erhan
dc.contributor.author	Uğur, E.
dc.contributor.department	Computer Science
dc.contributor.ozuauthor	ÖZTOP, Erhan
dc.date.accessioned	2023-05-26T06:21:03Z
dc.date.available	2023-05-26T06:21:03Z
dc.date.issued	2023-06
dc.description.abstract	One effective approach for equipping artificial agents with sensorimotor skills is to use self-exploration. To do this efficiently is critical, as time and data collection are costly. In this study, we propose an exploration mechanism that blends action, object, and action outcome representations into a latent space, where local regions are formed to host forward model learning. The agent uses intrinsic motivation to select the forward model with the highest learning progress to adopt at a given exploration step. This parallels how infants learn, as high learning progress indicates that the learning problem is neither too easy nor too difficult in the selected region. The proposed approach is validated with a simulated robot in a table-top environment. The simulation scene comprises a robot and various objects, where the robot interacts with one of them each time using a set of parameterized actions and learns the outcomes of these interactions. With the proposed approach, the robot organizes its curriculum of learning as in existing intrinsic motivation approaches and outperforms them in learning speed. Moreover, the learning regime demonstrates features that partially match infant development; in particular, the proposed system learns to predict the outcomes of different skills in a staged manner.
dc.identifier.doi	10.1109/TCDS.2021.3062728
dc.identifier.endpage	336
dc.identifier.issn	2379-8920
dc.identifier.issue	2
dc.identifier.scopus	2-s2.0-85102270578
dc.identifier.startpage	325
dc.identifier.uri	http://hdl.handle.net/10679/8341
dc.identifier.uri	https://doi.org/10.1109/TCDS.2021.3062728
dc.identifier.volume	15
dc.identifier.wos	001005746000002
dc.language.iso	eng
dc.peerreviewed	yes
dc.publicationstatus	Published
dc.publisher	IEEE
dc.relation.ispartof	IEEE Transactions on Cognitive and Developmental Systems
dc.relation.publicationcategory	International Refereed Journal
dc.rights	restrictedAccess
dc.subject.keywords	Developmental robotics
dc.subject.keywords	Effect prediction
dc.subject.keywords	Intrinsic motivation (IM)
dc.subject.keywords	Open-ended learning
dc.subject.keywords	Representation learning
dc.title	Exploration with intrinsic motivation using object–action–outcome latent space
dc.type	article
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	85662e71-2a61-492a-b407-df4d38ab90d7
relation.isOrgUnitOfPublication.latestForDiscovery	85662e71-2a61-492a-b407-df4d38ab90d7

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.45 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Computer Science

Publication: Exploration with intrinsic motivation using object–action–outcome latent space

Files

License bundle

Collections

Publication:
Exploration with intrinsic motivation using object–action–outcome latent space