Publication: Description-aware fashion image inpainting with convolutional neural networks in coarse-to-fine manner
dc.contributor.author | Kınlı, Osman Furkan | |
dc.contributor.author | Özcan, Barış | |
dc.contributor.author | Kıraç, Mustafa Furkan | |
dc.contributor.department | Computer Science | |
dc.contributor.ozuauthor | KINLI, Osman Furkan | |
dc.contributor.ozuauthor | KIRAÇ, Mustafa Furkan | |
dc.contributor.ozugradstudent | Özcan, Barış | |
dc.date.accessioned | 2021-06-23T09:31:40Z | |
dc.date.available | 2021-06-23T09:31:40Z | |
dc.date.issued | 2020-04-14 | |
dc.description.abstract | Inpainting a particular missing region in an image is a challenging vision task, and promising improvements on this task have been achieved with the help of the recent developments in vision-related deep learning studies. Although it may have a direct impact on the decisions of AI-based fashion analysis systems, a limited number of studies for image inpainting have been done in fashion domain, so far. In this study, we propose a multi-modal generative deep learning approach for filling the missing parts in fashion images by constraining visual features with textual features extracted from image descriptions. Our model is composed of four main blocks which can be introduced as textual feature extractor, coarse image generator guided by textual features, fine image generator enhancing the coarse output, and lastly global and local discriminators improving refined outputs. Several experiments conducted on FashionGen dataset with different combination of neural network components show that our multi-modal approach is able to generate visually plausible patches to fill the missing parts in the images. | en_US |
dc.identifier.doi | 10.1145/3397125.3397155 | en_US |
dc.identifier.endpage | 79 | en_US |
dc.identifier.isbn | 978-145037749-2 | |
dc.identifier.scopus | 2-s2.0-85086180951 | |
dc.identifier.startpage | 74 | en_US |
dc.identifier.uri | http://hdl.handle.net/10679/7447 | |
dc.identifier.uri | https://doi.org/10.1145/3397125.3397155 | |
dc.language.iso | eng | en_US |
dc.publicationstatus | Published | en_US |
dc.publisher | The ACM Digital Library | en_US |
dc.relation.ispartof | ICCTA '20: Proceedings of the 2020 6th International Conference on Computer and Technology Applications | |
dc.relation.publicationcategory | International | |
dc.rights | restrictedAccess | |
dc.subject.keywords | Deep learning | en_US |
dc.subject.keywords | Fashion analysis | en_US |
dc.subject.keywords | Generative learning | en_US |
dc.subject.keywords | Image inpainting | en_US |
dc.subject.keywords | Image reconstruction | en_US |
dc.subject.keywords | Multi-modal neural networks | en_US |
dc.title | Description-aware fashion image inpainting with convolutional neural networks in coarse-to-fine manner | en_US |
dc.type | conferenceObject | en_US |
dspace.entity.type | Publication | |
relation.isOrgUnitOfPublication | 85662e71-2a61-492a-b407-df4d38ab90d7 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 85662e71-2a61-492a-b407-df4d38ab90d7 |
Files
License bundle
1 - 1 of 1
- Name:
- license.txt
- Size:
- 1.45 KB
- Format:
- Item-specific license agreed upon to submission
- Description: