Publication:
Nearest neighbor approach in speaker adaptation for HMM-based speech synthesis

dc.contributor.authorMohammadi, Amir
dc.contributor.authorDemiroğlu, Cenk
dc.contributor.departmentElectrical & Electronics Engineering
dc.contributor.ozuauthorDEMİROĞLU, Cenk
dc.contributor.ozugradstudentMohammadi, Amir
dc.date.accessioned2014-11-25T06:50:06Z
dc.date.available2014-11-25T06:50:06Z
dc.date.issued2013
dc.descriptionDue to copyright restrictions, the access to the full text of this article is only available via subscription.en_US
dc.description.abstractStatistical speech synthesis (SSS) approach has become one of the most popular and successful methods in the speech synthesis field. Smooth speech transitions, without the spurious errors that are observed in unit selection systems, can be generated with the SSS approach. Another advantage is the ability to adapt to a target speaker with a couple of minutes of adaptation data. However, many applications, especially in consumer electronics, require adaptation with only a few adaptation utterances. Here, we propose a rapid adaptation technique that first attempt to select a reference model that is close to the target speaker given a distance measure. Then, as opposed to adapting to target speaker from an average model, as typically done in most systems, adaptation is performed from the new reference model. The proposed system significantly outperformed a state-of-the-art baseline system both in objective and subjective tests especially only when one utterance is available for adaptation.en_US
dc.identifier.doi10.1109/SIU.2013.6531576
dc.identifier.endpage4
dc.identifier.isbn978-1-4673-5561-2
dc.identifier.scopus2-s2.0-84880868506
dc.identifier.startpage1
dc.identifier.urihttp://hdl.handle.net/10679/669
dc.identifier.wos000325005300416
dc.language.isoengen_US
dc.peerreviewedyesen_US
dc.publicationstatuspublisheden_US
dc.publisherIEEEen_US
dc.relation.ispartofSignal Processing and Communications Applications Conference (SIU), 2013 21st
dc.relation.publicationcategoryInternational
dc.rightsrestrictedAccess
dc.subject.keywordsHidden Markov modelsen_US
dc.subject.keywordsSpeech synthesisen_US
dc.subject.keywordsStatistical analysisen_US
dc.titleNearest neighbor approach in speaker adaptation for HMM-based speech synthesisen_US
dc.typeconferenceObjecten_US
dspace.entity.typePublication
relation.isOrgUnitOfPublication7b58c5c4-dccc-40a3-aaf2-9b209113b763
relation.isOrgUnitOfPublication.latestForDiscovery7b58c5c4-dccc-40a3-aaf2-9b209113b763

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections