Supervising topic models with Gaussian processes

Kandemir, Melih; Kekeç, T.; Yeniterzi, Reyyan

dc.contributor.author	Kandemir, Melih
dc.contributor.author	Kekeç, T.
dc.contributor.author	Yeniterzi, Reyyan
dc.date.accessioned	2018-09-04T08:24:05Z
dc.date.available	2018-09-04T08:24:05Z
dc.date.issued	2018-05
dc.identifier.issn	0031-3203	en_US
dc.identifier.uri	http://hdl.handle.net/10679/5936
dc.identifier.uri	https://www.sciencedirect.com/science/article/pii/S0031320317305150
dc.description.abstract	Topic modeling is a powerful approach for modeling data represented as high-dimensional histograms. While the high dimensionality of such input data is extremely beneficial in unsupervised applications including language modeling and text data exploration, it introduces difficulties in cases where class information is available to boost up prediction performance. Feeding such input directly to a classifier suffers from the curse of dimensionality. Performing dimensionality reduction and classification disjointly, on the other hand, cannot enjoy optimal performance due to information loss in the gap between these two steps unaware of each other. Existing supervised topic models introduced as a remedy to such scenarios have thus far incorporated only linear classifiers in order to keep inference tractable, causing a dramatical sacrifice from expressive power. In this paper, we propose the first Bayesian construction to perform topic modeling and non-linear classification jointly. We use the well-known Latent Dirichlet Allocation (LDA) for topic modeling and sparse Gaussian processes for non-linear classification. We combine these two components by a latent variable encoding the empirical topic distribution of each document in the corpus. We achieve a novel variational inference scheme by adapting ideas from the newly emerging deep Gaussian processes into the realm of topic modeling. We demonstrate that our model outperforms other existing approaches such as: (i) disjoint LDA and non-linear classification, (ii) joint LDA and linear classification, (iii) joint non-LDA linear subspace modeling and linear classification, and (iv) non-linear classification without topic modeling, in three benchmark data sets from two real-world applications: text categorization and image tagging.	en_US
dc.description.sponsorship	Netherlands Organization for Scientific Research (NWO)
dc.language.iso	eng	en_US
dc.publisher	Elsevier	en_US
dc.relation.ispartof	Pattern Recognition
dc.rights	restrictedAccess
dc.title	Supervising topic models with Gaussian processes	en_US
dc.type	Article	en_US
dc.peerreviewed	yes	en_US
dc.publicationstatus	Published	en_US
dc.contributor.department	Özyeğin University
dc.contributor.authorID	(ORCID 0000-0001-6293-3656 & YÖK ID 258737) Kandemir, Melih
dc.contributor.authorID	(ORCID 0000-0002-8501-6209 & YÖK ID 258101) Yeniterzi, Reyyan
dc.contributor.ozuauthor	Kandemir, Melih
dc.contributor.ozuauthor	Yeniterzi, Reyyan
dc.identifier.volume	77	en_US
dc.identifier.startpage	226	en_US
dc.identifier.endpage	236	en_US
dc.identifier.wos	WOS:000426222800019
dc.identifier.doi	10.1016/j.patcog.2017.12.019	en_US
dc.subject.keywords	Latent Dirichlet allocation	en_US
dc.subject.keywords	Nonparametric Bayesian inference	en_US
dc.subject.keywords	Gaussian processes	en_US
dc.subject.keywords	Variational inference	en_US
dc.subject.keywords	Supervised topic models	en_US
dc.identifier.scopus	SCOPUS:2-s2.0-85044649648
dc.contributor.authorMale	1
dc.contributor.authorFemale	1
dc.relation.publicationcategory	Article - International Refereed Journal - Institutional Academic Staff