Show simple item record

dc.contributor.authorSener, M. İ.
dc.contributor.authorNagai, Y.
dc.contributor.authorÖztop, Erhan
dc.contributor.authorUğur, E.
dc.date.accessioned2023-05-26T06:21:03Z
dc.date.available2023-05-26T06:21:03Z
dc.date.issued2023-06
dc.identifier.issn2379-8920en_US
dc.identifier.urihttp://hdl.handle.net/10679/8341
dc.identifier.urihttps://ieeexplore.ieee.org/document/9365689
dc.description.abstractOne effective approach for equipping artificial agents with sensorimotor skills is to use self-exploration. To do this efficiently is critical, as time and data collection are costly. In this study, we propose an exploration mechanism that blends action, object, and action outcome representations into a latent space, where local regions are formed to host forward model learning. The agent uses intrinsic motivation to select the forward model with the highest learning progress to adopt at a given exploration step. This parallels how infants learn, as high learning progress indicates that the learning problem is neither too easy nor too difficult in the selected region. The proposed approach is validated with a simulated robot in a table-top environment. The simulation scene comprises a robot and various objects, where the robot interacts with one of them each time using a set of parameterized actions and learns the outcomes of these interactions. With the proposed approach, the robot organizes its curriculum of learning as in existing intrinsic motivation approaches and outperforms them in learning speed. Moreover, the learning regime demonstrates features that partially match infant development; in particular, the proposed system learns to predict the outcomes of different skills in a staged manner.en_US
dc.language.isoengen_US
dc.publisherIEEEen_US
dc.relation.ispartofIEEE Transactions on Cognitive and Developmental Systems
dc.rightsrestrictedAccess
dc.titleExploration with intrinsic motivation using object–action–outcome latent spaceen_US
dc.typeArticleen_US
dc.peerreviewedyesen_US
dc.publicationstatusPublisheden_US
dc.contributor.departmentÖzyeğin University
dc.contributor.authorID(ORCID 0000-0002-3051-6038 & YÖK ID 45227) Öztop, Erhan
dc.contributor.ozuauthorÖztop, Erhan
dc.identifier.volume15
dc.identifier.issue2
dc.identifier.startpage325
dc.identifier.endpage336
dc.identifier.wosWOS:001005746000002
dc.identifier.doi10.1109/TCDS.2021.3062728en_US
dc.subject.keywordsDevelopmental roboticsen_US
dc.subject.keywordsEffect predictionen_US
dc.subject.keywordsIntrinsic motivation (IM)en_US
dc.subject.keywordsOpen-ended learningen_US
dc.subject.keywordsRepresentation learningen_US
dc.identifier.scopusSCOPUS:2-s2.0-85102270578
dc.relation.publicationcategoryArticle - International Refereed Journal - Institutional Academic Staff


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record


Share this page