Publication: Modeling the development of infant imitation using inverse reinforcement learning
dc.contributor.author | Tekden, A. E. | |
dc.contributor.author | Ugur, E. | |
dc.contributor.author | Nagai, Y. | |
dc.contributor.author | Öztop, Erhan | |
dc.contributor.department | Computer Science | |
dc.contributor.ozuauthor | ÖZTOP, Erhan | |
dc.date.accessioned | 2020-04-20T10:45:05Z | |
dc.date.available | 2020-04-20T10:45:05Z | |
dc.date.issued | 2018-09 | |
dc.description.abstract | Little is known about the computational mechanisms of how imitation skills develop along with infant sensorimotor learning. In robotics, there are several well developed frameworks for imitation learning or so called learning by demonstration. Two paradigms dominate: Direct Learning (DL) and Inverse Reinforcement Learning (IRL). The former is a simple mechanism where the observed state and action pairs are associated to construct a copy of the action policy of the demonstrator. In the latter, an optimality principle or reward structure is sought that would explain the observed behavior as the optimal solution governed by the optimality principle or the reward function found. In this study, we explore the plausibility of whether some form of IRL mechanism in infants can facilitate imitation learning and understanding of others' behaviours. We propose that infants project the events taking place in the environment into their internal representations through a set of features that evolve during development. We implement this idea on a grid world environment, which can be considered as a simple model for reaching with obstacle avoidance. The observing infant has to imitate the demonstrator's reaching behavior through IRL by using various set of features that correspond to different stages of development. Our simulation results indicate that the U-shape performance change during imitation development observed in infants can be reproduced with the proposed model. | en_US |
dc.description.sponsorship | Bogazici Resarch Fund (BAP) Startup project ; Slovenia/ARRS -Turkey/TUBITAK bilateral collaboration grant (ARRS Project) ; TÜBİTAK ; JST CREST Cognitive Mirroring, Japan | |
dc.identifier.doi | 10.1109/DEVLRN.2018.8761045 | en_US |
dc.identifier.endpage | 160 | en_US |
dc.identifier.isbn | 978-1-5386-6110-9 | |
dc.identifier.issn | 2161-9484 | en_US |
dc.identifier.scopus | 2-s2.0-85070382645 | |
dc.identifier.startpage | 155 | en_US |
dc.identifier.uri | http://hdl.handle.net/10679/6525 | |
dc.identifier.uri | https://doi.org/10.1109/DEVLRN.2018.8761045 | |
dc.identifier.wos | 000492050700023 | |
dc.language.iso | eng | en_US |
dc.publicationstatus | Published | en_US |
dc.publisher | IEEE | en_US |
dc.relation | info:turkey/grantAgreement/TUBITAK/215E271 | |
dc.relation.ispartof | 2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) | |
dc.relation.publicationcategory | International | |
dc.rights | info:eu-repo/semantics/restrictedAccess | |
dc.subject.keywords | Observers | en_US |
dc.subject.keywords | Task analysis | en_US |
dc.subject.keywords | Reinforcement learning | en_US |
dc.subject.keywords | Trajectory | en_US |
dc.subject.keywords | Robot sensing systems | en_US |
dc.subject.keywords | Entropy | en_US |
dc.title | Modeling the development of infant imitation using inverse reinforcement learning | en_US |
dc.type | Conference paper | en_US |
dspace.entity.type | Publication | |
relation.isOrgUnitOfPublication | 85662e71-2a61-492a-b407-df4d38ab90d7 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 85662e71-2a61-492a-b407-df4d38ab90d7 |
Files
License bundle
1 - 1 of 1
- Name:
- license.txt
- Size:
- 1.45 KB
- Format:
- Item-specific license agreed upon to submission
- Description: