A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages

Güner, Ekrem; Demiroğlu, Cenk

Publication:
A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages

dc.contributor.author	Güner, Ekrem
dc.contributor.author	Demiroğlu, Cenk
dc.contributor.department	Electrical & Electronics Engineering
dc.contributor.ozuauthor	DEMİROĞLU, Cenk
dc.contributor.ozugradstudent	Güner, Ekrem
dc.date.accessioned	2014-11-25T09:13:20Z
dc.date.available	2014-11-25T09:13:20Z
dc.date.issued	2012
dc.description	Due to copyright restrictions, the access to the full text of this article is only available via subscription.	en_US
dc.description.abstract	Despite its success, unit selection based text-to-speech synthesis (TTS) has has some disadvantages such as sudden discontinuities in speech that distract the listeners. The HMM-based TTS (HTS) approach has been increasingly getting more attention from the TTS research community. One of the advantage is the lack of spurious errors that are observed in the unit selection scheme. Another advantage of the HTS system is the small memory footprint requirement which makes it attractive for embedded devices. Here, we propose a novel hybrid statistical unit selection TTS system for agglutinative languages that aims at improving the quality of the baseline HTS system while keeping the memory footprint small. The intelligibility and quality scores of the baseline system are comparable to the MOS scores of English reported in the Blizzard Challenge tests. Listeners preferred the hybrid system over the baseline system in the A/B preference tests.	en_US
dc.description.sponsorship	TÜBİTAK
dc.identifier.doi	10.1109/ICASSP.2012.6288927
dc.identifier.endpage	4540
dc.identifier.isbn	978-1-4673-0044-5
dc.identifier.scopus	2-s2.0-84867605441
dc.identifier.startpage	4537
dc.identifier.uri	http://hdl.handle.net/10679/674
dc.identifier.uri	https://doi.org/10.1109/ICASSP.2012.6288927
dc.identifier.wos	000312381404152
dc.language.iso	eng	en_US
dc.peerreviewed	yes	en_US
dc.publicationstatus	published	en_US
dc.publisher	IEEE	en_US
dc.relation	info:eu-repo/grantAgreement/TUBITAK/1001 - Araştırma/109E281	en_US
dc.relation.ispartof	Acoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
dc.relation.publicationcategory	International
dc.rights	restrictedAccess
dc.subject.keywords	Hidden Markov models	en_US
dc.subject.keywords	Natural language processing	en_US
dc.subject.keywords	Speech intelligibility	en_US
dc.subject.keywords	Speech synthesis	en_US
dc.subject.keywords	Statistical analysis	en_US
dc.title	A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages	en_US
dc.type	conferenceObject	en_US
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	7b58c5c4-dccc-40a3-aaf2-9b209113b763
relation.isOrgUnitOfPublication.latestForDiscovery	7b58c5c4-dccc-40a3-aaf2-9b209113b763

Files

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.71 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Computer Science

Publication: A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages

Files

License bundle

Collections

Publication:
A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages