Generalization to unseen viewpoint images of objects via alleviated pose attentive capsule agreement

Özcan, Barış; Kınlı, Osman Furkan; Kıraç, Mustafa Furkan

Publication:
Generalization to unseen viewpoint images of objects via alleviated pose attentive capsule agreement

dc.contributor.author	Özcan, Barış
dc.contributor.author	Kınlı, Osman Furkan
dc.contributor.author	Kıraç, Mustafa Furkan
dc.contributor.department	Computer Science
dc.contributor.ozuauthor	KINLI, Osman Furkan
dc.contributor.ozuauthor	KIRAÇ, Mustafa Furkan
dc.contributor.ozugradstudent	Özcan, Barış
dc.date.accessioned	2024-01-11T06:51:57Z
dc.date.available	2024-01-11T06:51:57Z
dc.date.issued	2023-02
dc.description.abstract	Despite their achievements in object recognition, Convolutional Neural Networks (CNNs) particularly fail to generalize to unseen viewpoints of a learned object even with substantial samples. On the other hand, recently emerged capsule networks outperform CNNs in novel viewpoint generalization tasks even with significantly fewer parameters. Capsule networks group the neuron activations for representing higher level attributes and their interactions for achieving equivariance to visual transformations. However, capsule networks have a high computational cost for learning the interactions of capsules in consecutive layers via the, so called, routing algorithm. To address these issues, we propose a novel routing algorithm, Alleviated Pose Attentive Capsule Agreement (ALPACA) which is tailored for capsules that contain pose, feature and existence probability information together to enhance novel viewpoint generalization of capsules on 2D images. For this purpose, we have created a Novel ViewPoint Dataset (NVPD) a viewpoint-controlled texture-free dataset that has 8 different setups where training and test samples are formed by different viewpoints. In addition to NVPD, we have conducted experiments on iLab2M dataset where the dataset is split in terms of the object instances. Experimental results show that ALPACA outperforms its capsule network counterparts and state-of-the-art CNNs on iLab2M and NVPD datasets. Moreover, ALPACA is 10 times faster when compared to routing-based capsule networks. It also outperforms attention-based routing algorithms of the domain while keeping the inference and training times comparable. Lastly, our code, the NVPD dataset, test setups, and implemented models are freely available at https://github.com/Boazrciasn/ALPACA.
dc.identifier.doi	10.1007/s00521-022-07900-3
dc.identifier.endpage	3536
dc.identifier.issn	0941-0643
dc.identifier.issue	4
dc.identifier.scopus	2-s2.0-85139871182
dc.identifier.startpage	3521
dc.identifier.uri	http://hdl.handle.net/10679/9030
dc.identifier.uri	https://doi.org/10.1007/s00521-022-07900-3
dc.identifier.volume	35
dc.identifier.wos	000867543900001
dc.language.iso	eng
dc.peerreviewed	yes
dc.publicationstatus	Published
dc.publisher	Springer
dc.relation.ispartof	Neural Computing and Applications
dc.relation.publicationcategory	International Refereed Journal
dc.rights	restrictedAccess
dc.subject.keywords	Capsule networks
dc.subject.keywords	Neural networks
dc.subject.keywords	Novel viewpoint generalization
dc.subject.keywords	Quaternion neural networks
dc.title	Generalization to unseen viewpoint images of objects via alleviated pose attentive capsule agreement
dc.type	article
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	85662e71-2a61-492a-b407-df4d38ab90d7
relation.isOrgUnitOfPublication.latestForDiscovery	85662e71-2a61-492a-b407-df4d38ab90d7

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.45 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Computer Science

Publication: Generalization to unseen viewpoint images of objects via alleviated pose attentive capsule agreement

Files

License bundle

Collections

Publication:
Generalization to unseen viewpoint images of objects via alleviated pose attentive capsule agreement