Publication:
OCR-aided person annotation and label propagation for speaker modeling in TV shows

Placeholder

Institution Authors

Research Projects

Journal Title

Journal ISSN

Volume Title

Type

conferenceObject

Access

restrictedAccess

Publication Status

published

Journal Issue

Abstract

In this paper, we present an approach for minimizing human effort in manual speaker annotation. Label propagation is used at each iteration of an active learning cycle. More precisely, a selection strategy for choosing the most suitable speech track to be labeled is proposed. Four different selection strategies are evaluated and all the tracks in a corresponding cluster are gathered using agglomerative clustering in order to propagate human annotations. To further reduce the manual labor required, an optical character recognition system is used to bootstrap annotations. At each step of the cycle, annotations are used to build speaker models. The quality of the generated speaker models is evaluated at each step using an i-vector based speaker identification system. The presented approach shows promising results on the REPERE corpus with a minimum amount of human effort for annotation.

Date

2016

Publisher

IEEE

Description

Due to copyright restrictions, the access to the full text of this article is only available via subscription.

Keywords

Citation

Collections


Page Views

0

File Download

0