Analysis of speaker similarity in the statistical speech synthesis systems using a hybrid approach

Güner, Ekrem; Mohammadi, A.; Demiroğlu, Cenk

Publication:
Analysis of speaker similarity in the statistical speech synthesis systems using a hybrid approach

dc.contributor.author	Güner, Ekrem
dc.contributor.author	Mohammadi, A.
dc.contributor.author	Demiroğlu, Cenk
dc.contributor.department	Electrical & Electronics Engineering
dc.contributor.ozuauthor	DEMİROĞLU, Cenk
dc.contributor.ozugradstudent	Güner, Ekrem
dc.date.accessioned	2014-11-25T11:34:41Z
dc.date.available	2014-11-25T11:34:41Z
dc.date.issued	2012
dc.description	Due to copyright restrictions, the access to the full text of this article is only available via subscription.	en_US
dc.description.abstract	Statistical speech synthesis (SSS) approach has become one of the most popular and successful methods in the speech synthesis field. Smooth speech transitions, without the spurious errors that are observed in unit selection systems, can be generated with the SSS approach. However, a well-known issue with SSS is the lack of voice similarity to the target speaker. The issue arises both in speaker-dependent models and models that are adapted from average voices. Moreover, in speaker adaptation, similarity to the target speaker does not increase significantly after around one minute of adaptation data which potentially indicates inherent bottleneck(s) in the system. Here, we propose using the hybrid speech synthesis approach to understand the key factors behind the speaker similarity problem. To that end, we try to answer the following question: which segments and parameters of speech, if generated/synthesized better, would have a substantial improvement on speaker similarity? In this work, our hybrid methods are described and listening test results are presented and discussed.	en_US
dc.identifier.endpage	2059
dc.identifier.isbn	978-1-4673-1068-0
dc.identifier.scopus	2-s2.0-84869747260
dc.identifier.startpage	2055
dc.identifier.uri	http://hdl.handle.net/10679/676
dc.identifier.wos	000310623800413
dc.language.iso	eng	en_US
dc.peerreviewed	yes	en_US
dc.publicationstatus	published	en_US
dc.publisher	IEEE	en_US
dc.relation.ispartof	Signal Processing Conference (EUSIPCO), 2012 Proceedings of the 20th European
dc.relation.publicationcategory	International
dc.rights	restrictedAccess
dc.subject.keywords	Speaker recognition	en_US
dc.subject.keywords	Speech synthesis	en_US
dc.subject.keywords	Statistical analysis	en_US
dc.title	Analysis of speaker similarity in the statistical speech synthesis systems using a hybrid approach	en_US
dc.type	conferenceObject	en_US
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	7b58c5c4-dccc-40a3-aaf2-9b209113b763
relation.isOrgUnitOfPublication.latestForDiscovery	7b58c5c4-dccc-40a3-aaf2-9b209113b763

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Computer Science

Publication: Analysis of speaker similarity in the statistical speech synthesis systems using a hybrid approach

Files

License bundle

Collections

Publication:
Analysis of speaker similarity in the statistical speech synthesis systems using a hybrid approach