Publication:
A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages

dc.contributor.authorGüner, Ekrem
dc.contributor.authorDemiroğlu, Cenk
dc.contributor.departmentElectrical & Electronics Engineering
dc.contributor.ozuauthorDEMİROĞLU, Cenk
dc.contributor.ozugradstudentGüner, Ekrem
dc.date.accessioned2014-11-25T09:13:20Z
dc.date.available2014-11-25T09:13:20Z
dc.date.issued2012
dc.descriptionDue to copyright restrictions, the access to the full text of this article is only available via subscription.en_US
dc.description.abstractDespite its success, unit selection based text-to-speech synthesis (TTS) has has some disadvantages such as sudden discontinuities in speech that distract the listeners. The HMM-based TTS (HTS) approach has been increasingly getting more attention from the TTS research community. One of the advantage is the lack of spurious errors that are observed in the unit selection scheme. Another advantage of the HTS system is the small memory footprint requirement which makes it attractive for embedded devices. Here, we propose a novel hybrid statistical unit selection TTS system for agglutinative languages that aims at improving the quality of the baseline HTS system while keeping the memory footprint small. The intelligibility and quality scores of the baseline system are comparable to the MOS scores of English reported in the Blizzard Challenge tests. Listeners preferred the hybrid system over the baseline system in the A/B preference tests.en_US
dc.description.sponsorshipTÜBİTAK
dc.identifier.doi10.1109/ICASSP.2012.6288927
dc.identifier.endpage4540
dc.identifier.isbn978-1-4673-0044-5
dc.identifier.scopus2-s2.0-84867605441
dc.identifier.startpage4537
dc.identifier.urihttp://hdl.handle.net/10679/674
dc.identifier.urihttps://doi.org/10.1109/ICASSP.2012.6288927
dc.identifier.wos000312381404152
dc.language.isoengen_US
dc.peerreviewedyesen_US
dc.publicationstatuspublisheden_US
dc.publisherIEEEen_US
dc.relationinfo:eu-repo/grantAgreement/TUBITAK/1001 - Araştırma/109E281en_US
dc.relation.ispartofAcoustics, Speech and Signal Processing (ICASSP), 2012 IEEE International Conference on
dc.relation.publicationcategoryInternational
dc.rightsrestrictedAccess
dc.subject.keywordsHidden Markov modelsen_US
dc.subject.keywordsNatural language processingen_US
dc.subject.keywordsSpeech intelligibilityen_US
dc.subject.keywordsSpeech synthesisen_US
dc.subject.keywordsStatistical analysisen_US
dc.titleA small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languagesen_US
dc.typeconferenceObjecten_US
dspace.entity.typePublication
relation.isOrgUnitOfPublication7b58c5c4-dccc-40a3-aaf2-9b209113b763
relation.isOrgUnitOfPublication.latestForDiscovery7b58c5c4-dccc-40a3-aaf2-9b209113b763

Files

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections