Publication: High-level features for resource economy and fast learning in skill transfer
dc.contributor.author | Ahmetoglu, A. | |
dc.contributor.author | Uğur, E. | |
dc.contributor.author | Asada, M. | |
dc.contributor.author | Öztop, Erhan | |
dc.contributor.department | Computer Science | |
dc.contributor.ozuauthor | ÖZTOP, Erhan | |
dc.date.accessioned | 2023-08-17T12:22:27Z | |
dc.date.available | 2023-08-17T12:22:27Z | |
dc.date.issued | 2022 | |
dc.description.abstract | Abstraction is an important aspect of intelligence which enables agents to construct robust representations for effective and efficient decision making. Although, deep neural networks are proven to be effective learning systems due to their ability to form increasingly complex abstractions at successive layers these abstractions are mostly distributed over many neurons, making the re-use of a learned skill costly and blind to the insights that can be obtained on the emergent representations. For avoiding designer bias and unsparing resource use, we propose to exploit neural response dynamics to form compact representations to use in skill transfer. For this, we consider two competing methods based on (1) maximum information compression principle and (2) the notion that abstract events tend to generate slowly changing signals, and apply them to the neural signals generated during task execution. To be concrete, in our simulation experiments, we either apply principal component analysis (PCA) or slow feature analysis (SFA) on the signals collected from the last hidden layer of a deep neural network while it performs a source task, and use these features for skill transfer in a new, target, task. We then compare the generalization and learning performance of these alternatives with the baselines of skill transfer with full layer output and no-transfer settings. Our experimental results on a simulated tabletop robot arm navigation task show that units that are created with SFA are the most successful for skill transfer. SFA as well as PCA, incur less resources compared to usual skill transfer where full layer outputs are used in the new task learning, whereby many units formed show a localized response reflecting end-effector-obstacle-goal relations. Finally, SFA units with the lowest eigenvalues resemble symbolic representations that highly correlate with high-level features such as joint angles and end-effector position which might be thought of as precursors for fully symbolic systems. | en_US |
dc.description.sponsorship | TÜBİTAK | |
dc.identifier.doi | 10.1080/01691864.2021.2019613 | en_US |
dc.identifier.endpage | 303 | en_US |
dc.identifier.issn | 0169-1864 | en_US |
dc.identifier.issue | 5-6 | en_US |
dc.identifier.scopus | 2-s2.0-85122870929 | |
dc.identifier.startpage | 291 | en_US |
dc.identifier.uri | http://hdl.handle.net/10679/8715 | |
dc.identifier.uri | https://doi.org/10.1080/01691864.2021.2019613 | |
dc.identifier.volume | 36 | en_US |
dc.identifier.wos | 000742298200001 | |
dc.language.iso | eng | en_US |
dc.peerreviewed | yes | en_US |
dc.publicationstatus | Published | en_US |
dc.publisher | Taylor & Francis | en_US |
dc.relation.ispartof | Advanced Robotics | |
dc.relation.publicationcategory | International Refereed Journal | |
dc.rights | openAccess | |
dc.subject.keywords | Reinforcement learning | en_US |
dc.subject.keywords | Symbol emergence | en_US |
dc.subject.keywords | Transfer learning | en_US |
dc.title | High-level features for resource economy and fast learning in skill transfer | en_US |
dc.type | article | en_US |
dspace.entity.type | Publication | |
relation.isOrgUnitOfPublication | 85662e71-2a61-492a-b407-df4d38ab90d7 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 85662e71-2a61-492a-b407-df4d38ab90d7 |
Files
License bundle
1 - 1 of 1
- Name:
- license.txt
- Size:
- 1.45 KB
- Format:
- Item-specific license agreed upon to submission
- Description: