Exploration with intrinsic motivation using object–action–outcome latent space

Sener, M. İ.; Nagai, Y.; Öztop, Erhan; Uğur, E.

dc.contributor.author	Sener, M. İ.
dc.contributor.author	Nagai, Y.
dc.contributor.author	Öztop, Erhan
dc.contributor.author	Uğur, E.
dc.date.accessioned	2023-05-26T06:21:03Z
dc.date.available	2023-05-26T06:21:03Z
dc.date.issued	2023-06
dc.identifier.issn	2379-8920	en_US
dc.identifier.uri	http://hdl.handle.net/10679/8341
dc.identifier.uri	https://ieeexplore.ieee.org/document/9365689
dc.description.abstract	One effective approach for equipping artificial agents with sensorimotor skills is to use self-exploration. To do this efficiently is critical, as time and data collection are costly. In this study, we propose an exploration mechanism that blends action, object, and action outcome representations into a latent space, where local regions are formed to host forward model learning. The agent uses intrinsic motivation to select the forward model with the highest learning progress to adopt at a given exploration step. This parallels how infants learn, as high learning progress indicates that the learning problem is neither too easy nor too difficult in the selected region. The proposed approach is validated with a simulated robot in a table-top environment. The simulation scene comprises a robot and various objects, where the robot interacts with one of them each time using a set of parameterized actions and learns the outcomes of these interactions. With the proposed approach, the robot organizes its curriculum of learning as in existing intrinsic motivation approaches and outperforms them in learning speed. Moreover, the learning regime demonstrates features that partially match infant development; in particular, the proposed system learns to predict the outcomes of different skills in a staged manner.	en_US
dc.language.iso	eng	en_US
dc.publisher	IEEE	en_US
dc.relation.ispartof	IEEE Transactions on Cognitive and Developmental Systems
dc.rights	restrictedAccess
dc.title	Exploration with intrinsic motivation using object–action–outcome latent space	en_US
dc.type	Article	en_US
dc.peerreviewed	yes	en_US
dc.publicationstatus	Published	en_US
dc.contributor.department	Özyeğin University
dc.contributor.authorID	(ORCID 0000-0002-3051-6038 & YÖK ID 45227) Öztop, Erhan
dc.contributor.ozuauthor	Öztop, Erhan
dc.identifier.volume	15
dc.identifier.issue	2
dc.identifier.startpage	325
dc.identifier.endpage	336
dc.identifier.wos	WOS:001005746000002
dc.identifier.doi	10.1109/TCDS.2021.3062728	en_US
dc.subject.keywords	Developmental robotics	en_US
dc.subject.keywords	Effect prediction	en_US
dc.subject.keywords	Intrinsic motivation (IM)	en_US
dc.subject.keywords	Open-ended learning	en_US
dc.subject.keywords	Representation learning	en_US
dc.identifier.scopus	SCOPUS:2-s2.0-85102270578
dc.relation.publicationcategory	Article - International Refereed Journal - Institutional Academic Staff