Show simple item record

dc.contributor.authorSarfjoo, Seyyed Saeed
dc.contributor.authorDemiroğlu, Cenk
dc.contributor.authorKing, S.
dc.date.accessioned2017-05-04T13:11:19Z
dc.date.available2017-05-04T13:11:19Z
dc.date.issued2017-04
dc.identifier.issn2329-9304en_US
dc.identifier.urihttp://hdl.handle.net/10679/5048
dc.identifier.urihttp://ieeexplore.ieee.org/document/7851067/
dc.description.abstractCross-lingual speaker adaptation for speech synthesis has many applications, such as use in speech-to-speech translation systems. Here, we focus on cross-lingual adaptation for statistical speech synthesis systems using limited adaptation data. To that end, we propose two eigenvoice adaptation approaches exploiting a bilingual Turkish-English speech database that we collected. In one approach, eigenvoice weights extracted using Turkish adaptation data and Turkish voice models are transformed into the eigenvoice weights for the English voice models using linear regression. Weighting the samples depending on the distance of reference speakers to target speakers during linear regression was found to improve the performance. Moreover, importance weighting the elements of the eigenvectors during regression further improved the performance. The second approach proposed here is speaker-specific state-mapping, which performed significantly better than the baseline state-mapping algorithm both in objective and subjective tests. Performance of the proposed state mapping algorithm was further improved when it was used with the intralingual eigenvoice approach instead of the linear-regression based algorithms used in the baseline system.
dc.description.sponsorshipEuropean Commission ; TUBITAK
dc.language.isoengen_US
dc.publisherIEEEen_US
dc.relationinfo:turkey/grantAgreement/TUBITAK
dc.relation.ispartofIEEE/ACM Transactions on Audio, Speech, and Language Processingen_US
dc.rightsrestrictedAccess
dc.titleUsing eigenvoices and nearest-neighbors in HMM-based cross-lingual speaker adaptation with limited dataen_US
dc.typeArticleen_US
dc.peerreviewedyesen_US
dc.publicationstatuspublisheden_US
dc.contributor.departmentÖzyeğin University
dc.contributor.authorID(ORCID 0000-0002-6160-3169 & YÖK ID 144947) Demiroğlu, Cenk
dc.contributor.ozuauthorDemiroğlu, Cenk
dc.identifier.volume25en_US
dc.identifier.issue4en_US
dc.identifier.wosWOS:000398178600011
dc.identifier.doi10.1109/TASLP.2017.2667880
dc.subject.keywordsCross lingual speaker adaptationen_US
dc.subject.keywordsStatistical speech synthesisen_US
dc.subject.keywordsSpeaker adaptationen_US
dc.subject.keywordsNearest neighbouren_US
dc.identifier.scopusSCOPUS:2-s2.0-85017650551
dc.contributor.ozugradstudentSarfjoo, Seyyed Saeed
dc.contributor.authorMale2


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record


Share this page