Browsing Faculty of Engineering by Author "Demiroğlu, Cenk"
Now showing items 21-40 of 43
-
Hybrid nearest-neighbor/cluster adaptive training for rapid speaker adaptation in statistical speech synthesis systems
Mohammadi, Amir; Demiroğlu, Cenk (International Speech Communication Association, 2013)Statistical speech synthesis (SSS) approach has become one of the most popular methods in the speech synthesis field. An advantage of the SSS approach is the ability to adapt to a target speaker with a couple of minutes ... -
A hybrid statistical/unit-selection text-to-speech synthesis system for morphologically rich languages
Güner, Ekrem (2013-06)Two most prominent examples of Text-to-Speech (TTS) systems are Unit Selection based TTS (UTTS) and the Hidden Markov Model (HMM) based TTS (HTTS). UTTS has been the dominant approach of the last decade while HTTS has been ... -
Hybrid statistical/unit-selection Turkish speech synthesis using suffix units
Demiroğlu, Cenk; Güner, Ekrem (Springer International Publishing, 2016-12)Unit selection based text-to-speech synthesis (TTS) has been the dominant TTS approach of the last decade. Despite its success, unit selection approach has its disadvantages. One of the most significant disadvantages is ... -
Konuşmacı aradeğerlemeli SMM tabanlı metinden konuşma sentezleme si̇stemi
Orhan, Mustafa Cem; Demiroğlu, Cenk (IEEE, 2011)Hidden Markov Model (HMM) based text-to-speech (TTS) systems offer many advantages compared to the concatenative approach. One of those advantages is the ability to interpolate between different speakers to generate new ... -
LIG at MediaEval 2015 multimodal person discovery in broadcast TV task
Budnik, M.; Safadi, B.; Besacier, L.; Quénot, G.; Khodabakhsh, Ali; Demiroğlu, Cenk (CEUR-WS, 2015)In this working notes paper the contribution of the LIG team (partnership between Univ. Grenoble Alpes and Ozyegin University) to the Multimodal Person Discovery in Broadcast TV task in MediaEval 2015 is presented. The ... -
Multi-lingual depression-level assessment from conversational speech using acoustic and text features
Özkanca, Yasin Serdar; Demiroğlu, Cenk; Besirli, A.; Çelik, S. (International Speech Communication Association, 2018)Depression is a common mental health problem around the world with a large burden on economies, well-being, hence productivity, of individuals. Its early diagnosis and treatment are critical to reduce the costs and even ... -
NatiQ: An end-to-end text-to-speech system for arabic
Abdelali, A.; Durrani, N.; Demiroğlu, Cenk; Dalvi, F.; Mubarak, H.; Darwish, K. (Association for Computational Linguistics (ACL), 2022)NatiQ is end-to-end text-to-speech system for Arabic. Our speech synthesizer uses an encoder-decoder architecture with attention. We used both tacotron-based models (tacotron-1 and tacotron-2) and the faster transformer ... -
Natural language features for detection of Alzheimer's disease in conversational speech
Khodabakhsh, Ali; Kuşçuoğlu, Serhan; Demiroğlu, Cenk (IEEE, 2014)Automatic monitoring of the patients with Alzheimer's disease and diagnosis of the disease in early stages can have a significant impact on the society. Here, we investigate an automatic diagnosis approach through the use ... -
Nearest neighbor approach in speaker adaptation for HMM-based speech synthesis
Mohammadi, Amir; Demiroğlu, Cenk (IEEE, 2013)Statistical speech synthesis (SSS) approach has become one of the most popular and successful methods in the speech synthesis field. Smooth speech transitions, without the spurious errors that are observed in unit selection ... -
OCR-aided person annotation and label propagation for speaker modeling in TV shows
Budnik, M.; Besacier, L.; Khodabakhsh, Ali; Demiroğlu, Cenk (IEEE, 2016)In this paper, we present an approach for minimizing human effort in manual speaker annotation. Label propagation is used at each iteration of an active learning cycle. More precisely, a selection strategy for choosing the ... -
ÖZÜ konuşmacı doğrulama sisteminin çok sınıflı senaryoda NIST 2010 veritabanı ile başarımı
Yeşil, Fatih; Demiroğlu, Cenk (IEEE, 2011)Performance of the speaker verification systems is typically measured based on their binary decision accuracy. However, in speaker verification applications where close to %100 accuracy is required, such as the systems ... -
Parkinson’s disease diagnosis using machine learning and voice
Wroge, T. J.; Özkanca, Yasin Serdar; Demiroğlu, Cenk; Si, D.; Atkins, D. C.; Ghomi, R. H. (IEEE, 2018)Biomarkers derived from human voice can offer in-sight into neurological disorders, such as Parkinson's disease (PD), because of their underlying cognitive and neuromuscular function. PD is a progressive neurodegenerative ... -
Performance of the OZU speaker verification systems with the NIST SRE 2010 data in a multi-class scenario
Yeşil, Fatih; Demiroğlu, Cenk (IEEE, 2012)Konuşmacı doğrulama sistemlerinin başarımı tipik olarak ikili karar mekanizmasına dayanır. Yine de finans şirketlerinin çağrı merkezi gibi 100%’ e yakın kesinlik gerektiren uygulamalarda var olan sistemlerin ikili kararlarına ... -
Postprocessing synthetic speech with a complex cepstrum vocoder for spoofing phase-based synthetic speech detectors
Demiroğlu, Cenk; Buyuk, O.; Khodabakhsh, Ali; Maia, R. (IEEE, 2017-06)State-of-the-art speaker verification systems are vulnerable to spoofing attacks. To address the issue, high-performance synthetic speech detectors (SSDs) for existing spoofing methods have been proposed. Phase-based SSDs ... -
SAS : A speaker verification spoofing database containing diverse attacks
Wu, Z.; Khodabakhsh, Ali; Demiroğlu, Cenk; Yamagishi, J.; Saito, D.; Toda, T.; King, S. (IEEE, 2015)This paper presents the first version of a speaker verification spoofing and anti-spoofing database, named SAS corpus. The corpus includes nine spoofing techniques, two of which are speech synthesis, and seven are voice ... -
Sesli̇ yanıt si̇stemi̇ çaǧrı akışında di̇lbi̇lgi̇si̇ tabanlı Türkçe konuşma tanıma si̇stemi̇ tanıtımı
Karagöz, Gün; Demiroğlu, Cenk (IEEE, 2012)Bu bildiride, çağrı merkezleri için kullanılan sesli yanıt sisteminde dilbilgisi-tabanlı Türkçe konuşma tanıma sistemi anlatılmaktadır. Yapılan çalışmada bir telekomünikasyon kurumunun çağrı merkezi sisteminin örneklemesi ... -
A small footprint hybrid statistical and unit selection text-to-speech synthesis system for Turkish
Güner, Ekrem; Demiroğlu, Cenk (Springer Science+Business Media, 2012)Unit selection based text-to-speech synthesis (TTS) can generate high quality speech. However, The HMM-based text-to-speech (HTS) has also advantages such as the lack of spurious errors that are observed in the unit selection ... -
A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages
Güner, Ekrem; Demiroğlu, Cenk (IEEE, 2012)Despite its success, unit selection based text-to-speech synthesis (TTS) has has some disadvantages such as sudden discontinuities in speech that distract the listeners. The HMM-based TTS (HTS) approach has been increasingly ... -
Spoofing attacks to i-vector based voice verification systems using statistical speech synthesis with additive noise and countermeasure
Özbay, Mustafa Caner; Khodabakhsh, Ali; Mohammadi, Amir; Demiroğlu, Cenk (IEEE, 2016)Even though improvements in the speaker verification (SV) technology with i-vectors increased their real-life deployment, their vulnerability to spoofing attacks is a major concern. Here, we investigated the effectiveness ... -
Spoofing voice verification systems with statistical speech synthesis using limited adaptation data
Khodabakhsh, Ali; Mohammadi, Amir; Demiroğlu, Cenk (Elsevier, 2017-03)State-of-the-art speaker verification systems are vulnerable to spoofing attacks using speech synthesis. To solve the issue, high-performance synthetic speech detectors (SSDs) for attack methods have been proposed recently. ...
Share this page