Improving automatic emotion recognition from speech signals

Bozkurt, E.; Erzin, E.; Eroğlu Erdem, Ç.; Erdem, Tanju

Publication:
Improving automatic emotion recognition from speech signals

dc.contributor.author	Bozkurt, E.
dc.contributor.author	Erzin, E.
dc.contributor.author	Eroğlu Erdem, Ç.
dc.contributor.author	Erdem, Tanju
dc.contributor.department	Computer Science
dc.contributor.ozuauthor	ERDEM, Arif Tanju
dc.date.accessioned	2016-02-11T06:46:20Z
dc.date.available	2016-02-11T06:46:20Z
dc.date.issued	2009
dc.description.abstract	We present a speech signal driven emotion recognition system. Our system is trained and tested with the INTERSPEECH 2009 Emotion Challenge corpus, which includes spontaneous and emotionally rich recordings. The challenge includes classifier and feature sub-challenges with five-class and two-class classification problems. We investigate prosody related, spectral and HMM-based features for the evaluation of emotion recognition with Gaussian mixture model (GMM) based classifiers. Spectral features consist of mel-scale cepstral coefficients (MFCC), line spectral frequency (LSF) features and their derivatives, whereas prosody-related features consist of mean normalized values of pitch, first derivative of pitch and intensity. Unsupervised training of HMM structures are employed to define prosody related temporal features for the emotion recognition problem. We also investigate data fusion of different features and decision fusion of different classifiers, which are not well studied for emotion recognition framework. Experimental results of automatic emotion recognition with the INTERSPEECH 2009 Emotion Challenge corpus are presented.
dc.description.sponsorship	TÜBİTAK
dc.identifier.endpage	315
dc.identifier.isbn	978-1-61567-692-7
dc.identifier.scopus	2-s2.0-70450177656
dc.identifier.startpage	312
dc.identifier.uri	http://hdl.handle.net/10679/2007
dc.identifier.wos	000276842800076
dc.language.iso	eng
dc.publicationstatus	published
dc.publisher	International Speech Communications Association
dc.relation.ispartof	10th Annual Conference Of The International Speech Communication Association 2009 (INTERSPEECH 2009)
dc.relation.project	info:eu-repo/grantAgreement/TUBITAK/1001 - Araştırma/106E201
dc.relation.project	info:eu-repo/grantAgreement/TUBITAK/1001 - Araştırma/3070796
dc.relation.publicationcategory	International
dc.rights	restrictedAccess
dc.subject.keywords	Emotion recognition
dc.subject.keywords	Prosody modeling
dc.title	Improving automatic emotion recognition from speech signals
dc.type	conferenceObject
dc.type.subtype	Conference paper
dspace.entity.type	Publication
relation.isOrgUnitOfPublication	85662e71-2a61-492a-b407-df4d38ab90d7
relation.isOrgUnitOfPublication.latestForDiscovery	85662e71-2a61-492a-b407-df4d38ab90d7

Collections

Computer Science

Publication: Improving automatic emotion recognition from speech signals

Files

Collections

Publication:
Improving automatic emotion recognition from speech signals