Show simple item record

dc.contributor.authorBudnik, M.
dc.contributor.authorBesacier, L.
dc.contributor.authorKhodabakhsh, Ali
dc.contributor.authorDemiroğlu, Cenk
dc.date.accessioned2016-07-29T05:25:57Z
dc.date.available2016-07-29T05:25:57Z
dc.date.issued2016
dc.identifier.issn1520-6149
dc.identifier.urihttp://hdl.handle.net/10679/4330
dc.identifier.urihttp://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7472743
dc.descriptionDue to copyright restrictions, the access to the full text of this article is only available via subscription.
dc.description.abstractIn this paper, we present an approach for minimizing human effort in manual speaker annotation. Label propagation is used at each iteration of an active learning cycle. More precisely, a selection strategy for choosing the most suitable speech track to be labeled is proposed. Four different selection strategies are evaluated and all the tracks in a corresponding cluster are gathered using agglomerative clustering in order to propagate human annotations. To further reduce the manual labor required, an optical character recognition system is used to bootstrap annotations. At each step of the cycle, annotations are used to build speaker models. The quality of the generated speaker models is evaluated at each step using an i-vector based speaker identification system. The presented approach shows promising results on the REPERE corpus with a minimum amount of human effort for annotation.
dc.language.isoengen_US
dc.publisherIEEE
dc.relation.ispartof2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
dc.rightsrestrictedAccess
dc.titleOCR-aided person annotation and label propagation for speaker modeling in TV showsen_US
dc.typeConference paperen_US
dc.peerreviewedyes
dc.publicationstatuspublisheden_US
dc.contributor.departmentÖzyeğin University
dc.contributor.authorID(ORCID 0000-0002-6160-3169 & YÖK ID 144947) Demiroğlu, Cenk
dc.contributor.ozuauthorDemiroğlu, Cenk
dc.identifier.startpage5570
dc.identifier.endpage5574
dc.identifier.wosWOS:000388373405144
dc.identifier.doi10.1109/ICASSP.2016.7472743
dc.subject.keywordsActive learning
dc.subject.keywordsAnnotation propagation
dc.subject.keywordsClustering
dc.subject.keywordsSpeaker identification
dc.subject.keywordsOCR
dc.identifier.scopusSCOPUS:2-s2.0-84973301088
dc.contributor.ozugradstudentKhodabakhsh, Ali
dc.contributor.authorMale2
dc.relation.publicationcategoryConference Paper - International - Institutional Academic Staff and Graduate Student


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record


Share this page