Publication:
Cross-lingual speaker adaptation for statistical speech synthesis using limited data

dc.contributor.authorSarfjoo, Seyyed Saeed
dc.contributor.authorDemiroğlu, Cenk
dc.contributor.departmentElectrical & Electronics Engineering
dc.contributor.ozuauthorDEMİROĞLU, Cenk
dc.contributor.ozugradstudentSarfjoo, Seyyed Saeed
dc.date.accessioned2017-01-31T11:25:15Z
dc.date.available2017-01-31T11:25:15Z
dc.date.issued2016
dc.description.abstractCross-lingual speaker adaptation with limited adaptation data has many applications such as use in speech-to-speech translation systems. Here, we focus on cross-lingual adaptation for statistical speech synthesis (SSS) systems using limited adaptation data. To that end, we propose two techniques exploiting a bilingual Turkish-English speech database that we collected. In one approach, speaker-specific state-mapping is proposed for cross-lingual adaptation which performed significantly better than the baseline state-mapping algorithm in adapting the excitation parameter both in objective and subjective tests. In the second approach, eigenvoice adaptation is done in the input language which is then used to estimate the eigenvoice weights in the output language using weighted linear regression. The second approach performed significantly better than the baseline system in adapting the spectral envelope parameters both in objective and subjective tests.en_US
dc.identifier.doi10.21437/Interspeech.2016-345en_US
dc.identifier.endpage321en_US
dc.identifier.issn2308-457Xen_US
dc.identifier.scopus2-s2.0-84994385942
dc.identifier.startpage317en_US
dc.identifier.urihttp://hdl.handle.net/10679/4758
dc.identifier.urihttps://doi.org/10.21437/Interspeech.2016-345
dc.language.isoengen_US
dc.publicationstatuspublisheden_US
dc.publisherInterspeechen_US
dc.relation.ispartofProceedings of the Annual Conference of the International Speech Communication Associationen_US
dc.relation.publicationcategoryInternational
dc.rightsrestrictedAccess
dc.subject.keywordsCross lingual speaker adaptationen_US
dc.subject.keywordsEigenvoice adaptationen_US
dc.subject.keywordsNearest-neighboren_US
dc.subject.keywordsSpeaker adaptationen_US
dc.subject.keywordsStatistical speech synthesisen_US
dc.titleCross-lingual speaker adaptation for statistical speech synthesis using limited dataen_US
dc.typeconferenceObjecten_US
dspace.entity.typePublication
relation.isOrgUnitOfPublication7b58c5c4-dccc-40a3-aaf2-9b209113b763
relation.isOrgUnitOfPublication.latestForDiscovery7b58c5c4-dccc-40a3-aaf2-9b209113b763

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.45 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections