Publication:
Dropout regularization in hierarchical mixture of experts

dc.contributor.authorAlpaydın, Ahmet İbrahim Ethem
dc.contributor.departmentComputer Science
dc.contributor.ozuauthorALPAYDIN, Ahmet Ibrahim Ethem
dc.date.accessioned2022-09-07T12:24:00Z
dc.date.available2022-09-07T12:24:00Z
dc.date.issued2021-01-02
dc.description.abstractDropout is a very effective method in preventing overfitting and has become the go-to regularizer for multi-layer neural networks in recent years. Hierarchical mixture of experts is a hierarchically gated model that defines a soft decision tree where leaves correspond to experts and decision nodes correspond to gating models that softly choose between its children, and as such, the model defines a soft hierarchical partitioning of the input space. In this work, we propose a variant of dropout for hierarchical mixture of experts that is faithful to the tree hierarchy defined by the model, as opposed to having a flat, unitwise independent application of dropout as one has with multi-layer perceptrons. We show that on a synthetic regression data and on MNIST, CIFAR-10, and SSTB datasets, our proposed dropout mechanism prevents overfitting on trees with many levels improving generalization and providing smoother fits.en_US
dc.identifier.doi10.1016/j.neucom.2020.08.052en_US
dc.identifier.endpage156en_US
dc.identifier.issn0925-2312en_US
dc.identifier.scopus2-s2.0-85091258086
dc.identifier.startpage148en_US
dc.identifier.urihttp://hdl.handle.net/10679/7840
dc.identifier.urihttps://doi.org/10.1016/j.neucom.2020.08.052
dc.identifier.volume419en_US
dc.identifier.wos000590175500013
dc.language.isoengen_US
dc.peerreviewedyesen_US
dc.publisherElsevieren_US
dc.relation.ispartofNeurocomputing
dc.relation.publicationcategoryInternational Refereed Journal
dc.rightsrestrictedAccess
dc.subject.keywordsDropouten_US
dc.subject.keywordsHierarchical modelsen_US
dc.subject.keywordsMixture of expertsen_US
dc.subject.keywordsRegularizationen_US
dc.titleDropout regularization in hierarchical mixture of expertsen_US
dc.typearticleen_US
dspace.entity.typePublication
relation.isOrgUnitOfPublication85662e71-2a61-492a-b407-df4d38ab90d7
relation.isOrgUnitOfPublication.latestForDiscovery85662e71-2a61-492a-b407-df4d38ab90d7

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.45 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections