ACNMP: skill transfer and task extrapolation through learning from demonstration and reinforcement learning via representation sharing

Akbulut, M. T.; Öztop, Erhan; Xue, H.; Tekden, A. E.; Şeker, M. Y.; Uğur, E.

Publication:
ACNMP: skill transfer and task extrapolation through learning from demonstration and reinforcement learning via representation sharing

dc.contributor.author	Akbulut, M. T.
dc.contributor.author	Öztop, Erhan
dc.contributor.author	Xue, H.
dc.contributor.author	Tekden, A. E.
dc.contributor.author	Şeker, M. Y.
dc.contributor.author	Uğur, E.
dc.contributor.department	Computer Science
dc.contributor.ozuauthor	ÖZTOP, Erhan
dc.date.accessioned	2024-03-06T04:58:52Z
dc.date.available	2024-03-06T04:58:52Z
dc.date.issued	2020
dc.description.abstract	To equip robots with dexterous skills, an effective approach is to first transfer the desired skill via Learning from Demonstration (LfD), then let the robot improve it by self-exploration via Reinforcement Learning (RL). In this paper, we propose a novel LfD+RL framework, namely Adaptive Conditional Neural Movement Primitives (ACNMP), that allows efficient policy improvement in novel environments and effective skill transfer between different agents. This is achieved through exploiting the latent representation learned by the underlying Conditional Neural Process (CNP) model, and simultaneous training of the model with supervised learning (SL) for acquiring the demonstrated trajectories and via RL for new trajectory discovery. Through simulation experiments, we show that (i) ACNMP enables the system to extrapolate to situations where pure LfD fails; (ii) Simultaneous training of the system through SL and RL preserves the shape of demonstrations while adapting to novel situations due to the shared representations used by both learners; (iii) ACNMP enables order-of-magnitude sample-efficient RL in extrapolation of reaching tasks compared to the existing approaches; (iv) ACNMPs can be used to implement skill transfer between robots having different morphology, with competitive learning speeds and importantly with less number of assumptions compared to the state-of-the-art approaches. Finally, we show the real-world suitability of ACNMPs through real robot experiments that involve obstacle avoidance, pick and place and pouring actions.
dc.description.sponsorship	Horizon 2020 Framework Programme ; Core Research for Evolutional Science and Technology ; Osaka University ; TÜBİTAK
dc.identifier.endpage	1907
dc.identifier.issn	2640-3498
dc.identifier.scopus	2-s2.0-85175852693
dc.identifier.startpage	1896
dc.identifier.uri	http://hdl.handle.net/10679/9264
dc.identifier.volume	155
dc.language.iso	eng
dc.publicationstatus	Published
dc.publisher	ML Research Press
dc.relation.ispartof	Proceedings of Machine Learning Research
dc.relation.project	info:eu-repo/grantAgreement/EC/H2020/731761
dc.relation.publicationcategory	International
dc.rights	openAccess
dc.subject.keywords	Deep learning
dc.subject.keywords	Learning from demonstration
dc.subject.keywords	Reinforcement learning
dc.subject.keywords	Representation learning
dc.title	ACNMP: skill transfer and task extrapolation through learning from demonstration and reinforcement learning via representation sharing
dc.type	conferenceObject
dc.type.subtype	Conference paper
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	85662e71-2a61-492a-b407-df4d38ab90d7
relation.isOrgUnitOfPublication.latestForDiscovery	85662e71-2a61-492a-b407-df4d38ab90d7

Files

Original bundle

Now showing 1 - 1 of 1

Name:: ACNMP skill transfer and task extrapolation through learning from demonstration and reinforcement learning via representation sharing.pdf
Size:: 3.83 MB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.45 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Computer Science

Publication: ACNMP: skill transfer and task extrapolation through learning from demonstration and reinforcement learning via representation sharing

Files

Original bundle

License bundle

Collections

Publication:
ACNMP: skill transfer and task extrapolation through learning from demonstration and reinforcement learning via representation sharing