Browsing Faculty of Engineering by Author "Demiroğlu, Cenk"

Now showing items 21-40 of 43

Hybrid nearest-neighbor/cluster adaptive training for rapid speaker adaptation in statistical speech synthesis systems

Mohammadi, Amir; Demiroğlu, Cenk (International Speech Communication Association, 2013)

Statistical speech synthesis (SSS) approach has become one of the most popular methods in the speech synthesis field. An advantage of the SSS approach is the ability to adapt to a target speaker with a couple of minutes ...
A hybrid statistical/unit-selection text-to-speech synthesis system for morphologically rich languages

Güner, Ekrem (2013-06)

Two most prominent examples of Text-to-Speech (TTS) systems are Unit Selection based TTS (UTTS) and the Hidden Markov Model (HMM) based TTS (HTTS). UTTS has been the dominant approach of the last decade while HTTS has been ...
Hybrid statistical/unit-selection Turkish speech synthesis using suffix units

Demiroğlu, Cenk; Güner, Ekrem (Springer International Publishing, 2016-12)

Unit selection based text-to-speech synthesis (TTS) has been the dominant TTS approach of the last decade. Despite its success, unit selection approach has its disadvantages. One of the most significant disadvantages is ...
Konuşmacı aradeğerlemeli SMM tabanlı metinden konuşma sentezleme si̇stemi

Orhan, Mustafa Cem; Demiroğlu, Cenk (IEEE, 2011)

Hidden Markov Model (HMM) based text-to-speech (TTS) systems offer many advantages compared to the concatenative approach. One of those advantages is the ability to interpolate between different speakers to generate new ...
LIG at MediaEval 2015 multimodal person discovery in broadcast TV task

Budnik, M.; Safadi, B.; Besacier, L.; Quénot, G.; Khodabakhsh, Ali; Demiroğlu, Cenk (CEUR-WS, 2015)

In this working notes paper the contribution of the LIG team (partnership between Univ. Grenoble Alpes and Ozyegin University) to the Multimodal Person Discovery in Broadcast TV task in MediaEval 2015 is presented. The ...
Multi-lingual depression-level assessment from conversational speech using acoustic and text features

Özkanca, Yasin Serdar; Demiroğlu, Cenk; Besirli, A.; Çelik, S. (International Speech Communication Association, 2018)

Depression is a common mental health problem around the world with a large burden on economies, well-being, hence productivity, of individuals. Its early diagnosis and treatment are critical to reduce the costs and even ...
NatiQ: An end-to-end text-to-speech system for arabic

Abdelali, A.; Durrani, N.; Demiroğlu, Cenk; Dalvi, F.; Mubarak, H.; Darwish, K. (Association for Computational Linguistics (ACL), 2022)

NatiQ is end-to-end text-to-speech system for Arabic. Our speech synthesizer uses an encoder-decoder architecture with attention. We used both tacotron-based models (tacotron-1 and tacotron-2) and the faster transformer ...
Natural language features for detection of Alzheimer's disease in conversational speech

Khodabakhsh, Ali; Kuşçuoğlu, Serhan; Demiroğlu, Cenk (IEEE, 2014)

Automatic monitoring of the patients with Alzheimer's disease and diagnosis of the disease in early stages can have a significant impact on the society. Here, we investigate an automatic diagnosis approach through the use ...
Nearest neighbor approach in speaker adaptation for HMM-based speech synthesis

Mohammadi, Amir; Demiroğlu, Cenk (IEEE, 2013)

Statistical speech synthesis (SSS) approach has become one of the most popular and successful methods in the speech synthesis field. Smooth speech transitions, without the spurious errors that are observed in unit selection ...
OCR-aided person annotation and label propagation for speaker modeling in TV shows

Budnik, M.; Besacier, L.; Khodabakhsh, Ali; Demiroğlu, Cenk (IEEE, 2016)

In this paper, we present an approach for minimizing human effort in manual speaker annotation. Label propagation is used at each iteration of an active learning cycle. More precisely, a selection strategy for choosing the ...
ÖZÜ konuşmacı doğrulama sisteminin çok sınıflı senaryoda NIST 2010 veritabanı ile başarımı

Yeşil, Fatih; Demiroğlu, Cenk (IEEE, 2011)

Performance of the speaker verification systems is typically measured based on their binary decision accuracy. However, in speaker verification applications where close to %100 accuracy is required, such as the systems ...
Parkinson’s disease diagnosis using machine learning and voice

Wroge, T. J.; Özkanca, Yasin Serdar; Demiroğlu, Cenk; Si, D.; Atkins, D. C.; Ghomi, R. H. (IEEE, 2018)

Biomarkers derived from human voice can offer in-sight into neurological disorders, such as Parkinson's disease (PD), because of their underlying cognitive and neuromuscular function. PD is a progressive neurodegenerative ...
Performance of the OZU speaker verification systems with the NIST SRE 2010 data in a multi-class scenario

Yeşil, Fatih; Demiroğlu, Cenk (IEEE, 2012)

Konuşmacı doğrulama sistemlerinin başarımı tipik olarak ikili karar mekanizmasına dayanır. Yine de finans şirketlerinin çağrı merkezi gibi 100%’ e yakın kesinlik gerektiren uygulamalarda var olan sistemlerin ikili kararlarına ...
Postprocessing synthetic speech with a complex cepstrum vocoder for spoofing phase-based synthetic speech detectors

Demiroğlu, Cenk; Buyuk, O.; Khodabakhsh, Ali; Maia, R. (IEEE, 2017-06)

State-of-the-art speaker verification systems are vulnerable to spoofing attacks. To address the issue, high-performance synthetic speech detectors (SSDs) for existing spoofing methods have been proposed. Phase-based SSDs ...
SAS : A speaker verification spoofing database containing diverse attacks

Wu, Z.; Khodabakhsh, Ali; Demiroğlu, Cenk; Yamagishi, J.; Saito, D.; Toda, T.; King, S. (IEEE, 2015)

This paper presents the first version of a speaker verification spoofing and anti-spoofing database, named SAS corpus. The corpus includes nine spoofing techniques, two of which are speech synthesis, and seven are voice ...
Sesli̇ yanıt si̇stemi̇ çaǧrı akışında di̇lbi̇lgi̇si̇ tabanlı Türkçe konuşma tanıma si̇stemi̇ tanıtımı

Karagöz, Gün; Demiroğlu, Cenk (IEEE, 2012)

Bu bildiride, çağrı merkezleri için kullanılan sesli yanıt sisteminde dilbilgisi-tabanlı Türkçe konuşma tanıma sistemi anlatılmaktadır. Yapılan çalışmada bir telekomünikasyon kurumunun çağrı merkezi sisteminin örneklemesi ...
A small footprint hybrid statistical and unit selection text-to-speech synthesis system for Turkish

Güner, Ekrem; Demiroğlu, Cenk (Springer Science+Business Media, 2012)

Unit selection based text-to-speech synthesis (TTS) can generate high quality speech. However, The HMM-based text-to-speech (HTS) has also advantages such as the lack of spurious errors that are observed in the unit selection ...
A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages

Güner, Ekrem; Demiroğlu, Cenk (IEEE, 2012)

Despite its success, unit selection based text-to-speech synthesis (TTS) has has some disadvantages such as sudden discontinuities in speech that distract the listeners. The HMM-based TTS (HTS) approach has been increasingly ...
Spoofing attacks to i-vector based voice verification systems using statistical speech synthesis with additive noise and countermeasure

Özbay, Mustafa Caner; Khodabakhsh, Ali; Mohammadi, Amir; Demiroğlu, Cenk (IEEE, 2016)

Even though improvements in the speaker verification (SV) technology with i-vectors increased their real-life deployment, their vulnerability to spoofing attacks is a major concern. Here, we investigated the effectiveness ...
Spoofing voice verification systems with statistical speech synthesis using limited adaptation data

Khodabakhsh, Ali; Mohammadi, Amir; Demiroğlu, Cenk (Elsevier, 2017-03)

State-of-the-art speaker verification systems are vulnerable to spoofing attacks using speech synthesis. To solve the issue, high-performance synthetic speech detectors (SSDs) for attack methods have been proposed recently. ...

Share this page

Browsing Faculty of Engineering by Author "Demiroğlu, Cenk"

Hybrid nearest-neighbor/cluster adaptive training for rapid speaker adaptation in statistical speech synthesis systems

A hybrid statistical/unit-selection text-to-speech synthesis system for morphologically rich languages

Hybrid statistical/unit-selection Turkish speech synthesis using suffix units

Konuşmacı aradeğerlemeli SMM tabanlı metinden konuşma sentezleme si̇stemi

LIG at MediaEval 2015 multimodal person discovery in broadcast TV task

Multi-lingual depression-level assessment from conversational speech using acoustic and text features

NatiQ: An end-to-end text-to-speech system for arabic

Natural language features for detection of Alzheimer's disease in conversational speech

Nearest neighbor approach in speaker adaptation for HMM-based speech synthesis

OCR-aided person annotation and label propagation for speaker modeling in TV shows

ÖZÜ konuşmacı doğrulama sisteminin çok sınıflı senaryoda NIST 2010 veritabanı ile başarımı

Parkinson’s disease diagnosis using machine learning and voice

Performance of the OZU speaker verification systems with the NIST SRE 2010 data in a multi-class scenario

Postprocessing synthetic speech with a complex cepstrum vocoder for spoofing phase-based synthetic speech detectors

SAS : A speaker verification spoofing database containing diverse attacks

Sesli̇ yanıt si̇stemi̇ çaǧrı akışında di̇lbi̇lgi̇si̇ tabanlı Türkçe konuşma tanıma si̇stemi̇ tanıtımı

A small footprint hybrid statistical and unit selection text-to-speech synthesis system for Turkish

A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages

Spoofing attacks to i-vector based voice verification systems using statistical speech synthesis with additive noise and countermeasure

Spoofing voice verification systems with statistical speech synthesis using limited adaptation data

Browse

My Account

Browsing Faculty of Engineering by Author "Demiroğlu, Cenk"

Hybrid nearest-neighbor/cluster adaptive training for rapid speaker adaptation in statistical speech synthesis systems ﻿

A hybrid statistical/unit-selection text-to-speech synthesis system for morphologically rich languages ﻿

Hybrid statistical/unit-selection Turkish speech synthesis using suffix units ﻿

Konuşmacı aradeğerlemeli SMM tabanlı metinden konuşma sentezleme si̇stemi ﻿

LIG at MediaEval 2015 multimodal person discovery in broadcast TV task ﻿

Multi-lingual depression-level assessment from conversational speech using acoustic and text features ﻿

NatiQ: An end-to-end text-to-speech system for arabic ﻿

Natural language features for detection of Alzheimer's disease in conversational speech ﻿

Nearest neighbor approach in speaker adaptation for HMM-based speech synthesis ﻿

OCR-aided person annotation and label propagation for speaker modeling in TV shows ﻿

ÖZÜ konuşmacı doğrulama sisteminin çok sınıflı senaryoda NIST 2010 veritabanı ile başarımı ﻿

Parkinson’s disease diagnosis using machine learning and voice ﻿

Performance of the OZU speaker verification systems with the NIST SRE 2010 data in a multi-class scenario ﻿

Postprocessing synthetic speech with a complex cepstrum vocoder for spoofing phase-based synthetic speech detectors ﻿

SAS : A speaker verification spoofing database containing diverse attacks ﻿

Sesli̇ yanıt si̇stemi̇ çaǧrı akışında di̇lbi̇lgi̇si̇ tabanlı Türkçe konuşma tanıma si̇stemi̇ tanıtımı ﻿

A small footprint hybrid statistical and unit selection text-to-speech synthesis system for Turkish ﻿

A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages ﻿

Spoofing attacks to i-vector based voice verification systems using statistical speech synthesis with additive noise and countermeasure ﻿

Spoofing voice verification systems with statistical speech synthesis using limited adaptation data ﻿

Hybrid nearest-neighbor/cluster adaptive training for rapid speaker adaptation in statistical speech synthesis systems

A hybrid statistical/unit-selection text-to-speech synthesis system for morphologically rich languages

Hybrid statistical/unit-selection Turkish speech synthesis using suffix units

Konuşmacı aradeğerlemeli SMM tabanlı metinden konuşma sentezleme si̇stemi

LIG at MediaEval 2015 multimodal person discovery in broadcast TV task

Multi-lingual depression-level assessment from conversational speech using acoustic and text features

NatiQ: An end-to-end text-to-speech system for arabic

Natural language features for detection of Alzheimer's disease in conversational speech

Nearest neighbor approach in speaker adaptation for HMM-based speech synthesis

OCR-aided person annotation and label propagation for speaker modeling in TV shows

ÖZÜ konuşmacı doğrulama sisteminin çok sınıflı senaryoda NIST 2010 veritabanı ile başarımı

Parkinson’s disease diagnosis using machine learning and voice

Performance of the OZU speaker verification systems with the NIST SRE 2010 data in a multi-class scenario

Postprocessing synthetic speech with a complex cepstrum vocoder for spoofing phase-based synthetic speech detectors

SAS : A speaker verification spoofing database containing diverse attacks

Sesli̇ yanıt si̇stemi̇ çaǧrı akışında di̇lbi̇lgi̇si̇ tabanlı Türkçe konuşma tanıma si̇stemi̇ tanıtımı

A small footprint hybrid statistical and unit selection text-to-speech synthesis system for Turkish

A small footprint hybrid statistical/unit selection text-to-speech synthesis system for agglutinative languages

Spoofing attacks to i-vector based voice verification systems using statistical speech synthesis with additive noise and countermeasure

Spoofing voice verification systems with statistical speech synthesis using limited adaptation data