Search

Now showing items 1-10 of 27

Nearest neighbor approach in speaker adaptation for HMM-based speech synthesis

Mohammadi, Amir; Demiroğlu, Cenk (IEEE, 2013)

Statistical speech synthesis (SSS) approach has become one of the most popular and successful methods in the speech synthesis field. Smooth speech transitions, without the spurious errors that are observed in unit selection ...

SAS : A speaker verification spoofing database containing diverse attacks

Wu, Z.; Khodabakhsh, Ali; Demiroğlu, Cenk; Yamagishi, J.; Saito, D.; Toda, T.; King, S. (IEEE, 2015)

This paper presents the first version of a speaker verification spoofing and anti-spoofing database, named SAS corpus. The corpus includes nine spoofing techniques, two of which are speech synthesis, and seven are voice ...

Natural language features for detection of Alzheimer's disease in conversational speech

Khodabakhsh, Ali; Kuşçuoğlu, Serhan; Demiroğlu, Cenk (IEEE, 2014)

Automatic monitoring of the patients with Alzheimer's disease and diagnosis of the disease in early stages can have a significant impact on the society. Here, we investigate an automatic diagnosis approach through the use ...

Hybrid nearest-neighbor/cluster adaptive training for rapid speaker adaptation in statistical speech synthesis systems

Mohammadi, Amir; Demiroğlu, Cenk (International Speech Communication Association, 2013)

Statistical speech synthesis (SSS) approach has become one of the most popular methods in the speech synthesis field. An advantage of the SSS approach is the ability to adapt to a target speaker with a couple of minutes ...

Finding relevant features for statistical speech synthesis adaptation

Bruneau, P.; Parisot, O.; Mohammadi, Amir; Demiroğlu, Cenk; Ghoniem, M.; Tamisier, T. (European Language Resources Association, 2014-05)

Statistical speech synthesis (SSS) models typically lie in a very high-dimensional space. They can be used to allow speech synthesis on digital devices, using only few sentences of input by the user. However, the adaptation ...

DNN-based speaker-adaptive postfiltering with limited adaptation data for statistical speech synthesis systems

Öztürk, M. G.; Ulusoy, O.; Demiroğlu, Cenk (IEEE, 2019)

Deep neural networks (DNNs) have been successfully deployed for acoustic modelling in statistical parametric speech synthesis (SPSS) systems. Moreover, DNN-based postfilters (PF) have also been shown to outperform conventional ...

Eigenvoice speaker adaptation with minimal data for statistical speech synthesis systems using a MAP approach and nearest-neighbors

Mohammadi, Amir; Sarfjoo, Seyyed Saeed; Demiroğlu, Cenk (IEEE, 2014-12)

Statistical speech synthesis (SSS) systems have the ability to adapt to a target speaker with a couple of minutes of adaptation data. Developing adaptation algorithms to further reduce the number of adaptation utterances ...

Search

Nearest neighbor approach in speaker adaptation for HMM-based speech synthesis

SAS : A speaker verification spoofing database containing diverse attacks

Natural language features for detection of Alzheimer's disease in conversational speech

Hybrid nearest-neighbor/cluster adaptive training for rapid speaker adaptation in statistical speech synthesis systems

Finding relevant features for statistical speech synthesis adaptation

DNN-based speaker-adaptive postfiltering with limited adaptation data for statistical speech synthesis systems

Eigenvoice speaker adaptation with minimal data for statistical speech synthesis systems using a MAP approach and nearest-neighbors

Depression-level assessment from multi-lingual conversational speech data using acoustic and text features

A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages

Analysis of speaker similarity in the statistical speech synthesis systems using a hybrid approach

Browse

My Account

Discover

Search

Filters

Nearest neighbor approach in speaker adaptation for HMM-based speech synthesis

SAS : A speaker verification spoofing database containing diverse attacks

Natural language features for detection of Alzheimer's disease in conversational speech

Hybrid nearest-neighbor/cluster adaptive training for rapid speaker adaptation in statistical speech synthesis systems

Finding relevant features for statistical speech synthesis adaptation

DNN-based speaker-adaptive postfiltering with limited adaptation data for statistical speech synthesis systems

Eigenvoice speaker adaptation with minimal data for statistical speech synthesis systems using a MAP approach and nearest-neighbors

Depression-level assessment from multi-lingual conversational speech data using acoustic and text features

A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages

Analysis of speaker similarity in the statistical speech synthesis systems using a hybrid approach