Show simple item record

dc.contributor.authorÖztürk, M. G.
dc.contributor.authorUlusoy, O.
dc.contributor.authorDemiroğlu, Cenk
dc.date.accessioned2020-08-27T14:39:00Z
dc.date.available2020-08-27T14:39:00Z
dc.date.issued2019
dc.identifier.isbn978-1-4799-8131-1
dc.identifier.issn1520-6149en_US
dc.identifier.urihttp://hdl.handle.net/10679/6846
dc.identifier.urihttps://ieeexplore.ieee.org/document/8683714
dc.description.abstractDeep neural networks (DNNs) have been successfully deployed for acoustic modelling in statistical parametric speech synthesis (SPSS) systems. Moreover, DNN-based postfilters (PF) have also been shown to outperform conventional postfilters that are widely used in SPSS systems for increasing the quality of synthesized speech. However, existing DNN-based postfilters are trained with speaker-dependent databases. Given that SPSS systems can rapidly adapt to new speakers from generic models, there is a need for DNN-based postfilters that can adapt to new speakers with minimal adaptation data. Here, we compare DNN-, RNN-, and CNN-based postfilters together with adversarial (GAN) training and cluster-based initialization (CI) for rapid adaptation. Results indicate that the feedforward (FF) DNN, together with GAN and CI, significantly outperforms the other recently proposed postfilters.en_US
dc.description.sponsorshipTÜBİTAK
dc.language.isoengen_US
dc.publisherIEEEen_US
dc.relationinfo:turkey/grantAgreement/TUBITAK/115E922
dc.relation.ispartofICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
dc.rightsrestrictedAccess
dc.titleDNN-based speaker-adaptive postfiltering with limited adaptation data for statistical speech synthesis systemsen_US
dc.typeConference paperen_US
dc.publicationstatusPublisheden_US
dc.contributor.departmentÖzyeğin University
dc.contributor.authorID(ORCID 0000-0002-6160-3169 & YÖK ID 144947) Demiroğlu, Cenk
dc.contributor.ozuauthorDemiroğlu, Cenk
dc.identifier.startpage7030en_US
dc.identifier.endpage7034en_US
dc.identifier.wosWOS:000482554007053
dc.identifier.doi10.1109/ICASSP.2019.8683714en_US
dc.subject.keywordsSpeaker adaptationen_US
dc.subject.keywordsSpeech synthesisen_US
dc.subject.keywordsPostfilteren_US
dc.subject.keywordsDeep learningen_US
dc.identifier.scopusSCOPUS:2-s2.0-85069005473
dc.contributor.authorMale1
dc.relation.publicationcategoryConference Paper - International - Institutional Academic Staff


Files in this item

FilesSizeFormatView

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record


Share this page