Publication:
Modeling the development of infant imitation using inverse reinforcement learning

dc.contributor.authorTekden, A. E.
dc.contributor.authorUgur, E.
dc.contributor.authorNagai, Y.
dc.contributor.authorÖztop, Erhan
dc.contributor.departmentComputer Science
dc.contributor.ozuauthorÖZTOP, Erhan
dc.date.accessioned2020-04-20T10:45:05Z
dc.date.available2020-04-20T10:45:05Z
dc.date.issued2018-09
dc.description.abstractLittle is known about the computational mechanisms of how imitation skills develop along with infant sensorimotor learning. In robotics, there are several well developed frameworks for imitation learning or so called learning by demonstration. Two paradigms dominate: Direct Learning (DL) and Inverse Reinforcement Learning (IRL). The former is a simple mechanism where the observed state and action pairs are associated to construct a copy of the action policy of the demonstrator. In the latter, an optimality principle or reward structure is sought that would explain the observed behavior as the optimal solution governed by the optimality principle or the reward function found. In this study, we explore the plausibility of whether some form of IRL mechanism in infants can facilitate imitation learning and understanding of others' behaviours. We propose that infants project the events taking place in the environment into their internal representations through a set of features that evolve during development. We implement this idea on a grid world environment, which can be considered as a simple model for reaching with obstacle avoidance. The observing infant has to imitate the demonstrator's reaching behavior through IRL by using various set of features that correspond to different stages of development. Our simulation results indicate that the U-shape performance change during imitation development observed in infants can be reproduced with the proposed model.en_US
dc.description.sponsorshipBogazici Resarch Fund (BAP) Startup project ; Slovenia/ARRS -Turkey/TUBITAK bilateral collaboration grant (ARRS Project) ; TÜBİTAK ; JST CREST Cognitive Mirroring, Japan
dc.identifier.doi10.1109/DEVLRN.2018.8761045en_US
dc.identifier.endpage160en_US
dc.identifier.isbn978-1-5386-6110-9
dc.identifier.issn2161-9484en_US
dc.identifier.scopus2-s2.0-85070382645
dc.identifier.startpage155en_US
dc.identifier.urihttp://hdl.handle.net/10679/6525
dc.identifier.urihttps://doi.org/10.1109/DEVLRN.2018.8761045
dc.identifier.wos000492050700023
dc.language.isoengen_US
dc.publicationstatusPublisheden_US
dc.publisherIEEEen_US
dc.relationinfo:turkey/grantAgreement/TUBITAK/215E271
dc.relation.ispartof2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)
dc.relation.publicationcategoryInternational
dc.rightsinfo:eu-repo/semantics/restrictedAccess
dc.subject.keywordsObserversen_US
dc.subject.keywordsTask analysisen_US
dc.subject.keywordsReinforcement learningen_US
dc.subject.keywordsTrajectoryen_US
dc.subject.keywordsRobot sensing systemsen_US
dc.subject.keywordsEntropyen_US
dc.titleModeling the development of infant imitation using inverse reinforcement learningen_US
dc.typeConference paperen_US
dspace.entity.typePublication
relation.isOrgUnitOfPublication85662e71-2a61-492a-b407-df4d38ab90d7
relation.isOrgUnitOfPublication.latestForDiscovery85662e71-2a61-492a-b407-df4d38ab90d7

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.45 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections