Search
Now showing items 1-10 of 27
Nearest neighbor approach in speaker adaptation for HMM-based speech synthesis
(IEEE, 2013)
Statistical speech synthesis (SSS) approach has become one of the most popular and successful methods in the speech synthesis field. Smooth speech transitions, without the spurious errors that are observed in unit selection ...
SAS : A speaker verification spoofing database containing diverse attacks
(IEEE, 2015)
This paper presents the first version of a speaker verification spoofing and anti-spoofing database, named SAS corpus. The corpus includes nine spoofing techniques, two of which are speech synthesis, and seven are voice ...
Natural language features for detection of Alzheimer's disease in conversational speech
(IEEE, 2014)
Automatic monitoring of the patients with Alzheimer's disease and diagnosis of the disease in early stages can have a significant impact on the society. Here, we investigate an automatic diagnosis approach through the use ...
Hybrid nearest-neighbor/cluster adaptive training for rapid speaker adaptation in statistical speech synthesis systems
(International Speech Communication Association, 2013)
Statistical speech synthesis (SSS) approach has become one of the most popular methods in the speech synthesis field. An advantage of the SSS approach is the ability to adapt to a target speaker with a couple of minutes ...
Finding relevant features for statistical speech synthesis adaptation
(European Language Resources Association, 2014-05)
Statistical speech synthesis (SSS) models typically lie in a very high-dimensional space. They can be used to allow speech synthesis on digital devices, using only few sentences of input by the user. However, the adaptation ...
DNN-based speaker-adaptive postfiltering with limited adaptation data for statistical speech synthesis systems
(IEEE, 2019)
Deep neural networks (DNNs) have been successfully deployed for acoustic modelling in statistical parametric speech synthesis (SPSS) systems. Moreover, DNN-based postfilters (PF) have also been shown to outperform conventional ...
Eigenvoice speaker adaptation with minimal data for statistical speech synthesis systems using a MAP approach and nearest-neighbors
(IEEE, 2014-12)
Statistical speech synthesis (SSS) systems have the ability to adapt to a target speaker with a couple of minutes of adaptation data. Developing adaptation algorithms to further reduce the number of adaptation utterances ...
Depression-level assessment from multi-lingual conversational speech data using acoustic and text features
(Springer Nature, 2020-11-17)
Depression is a widespread mental health problem around the world with a significant burden on economies. Its early diagnosis and treatment are critical to reduce the costs and even save lives. One key aspect to achieve ...
A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages
(IEEE, 2012)
Despite its success, unit selection based text-to-speech synthesis (TTS) has has some disadvantages such as sudden discontinuities in speech that distract the listeners. The HMM-based TTS (HTS) approach has been increasingly ...
Analysis of speaker similarity in the statistical speech synthesis systems using a hybrid approach
(IEEE, 2012)
Statistical speech synthesis (SSS) approach has become one of the most popular and successful methods in the speech synthesis field. Smooth speech transitions, without the spurious errors that are observed in unit selection ...
Share this page