Publication:
Multi-lingual depression-level assessment from conversational speech using acoustic and text features

dc.contributor.authorÖzkanca, Yasin Serdar
dc.contributor.authorDemiroğlu, Cenk
dc.contributor.authorBesirli, A.
dc.contributor.authorÇelik, S.
dc.contributor.departmentElectrical & Electronics Engineering
dc.contributor.ozuauthorDEMİROĞLU, Cenk
dc.contributor.ozugradstudentÖzkanca, Yasin Serdar
dc.date.accessioned2020-05-18T22:08:31Z
dc.date.available2020-05-18T22:08:31Z
dc.date.issued2018
dc.description.abstractDepression is a common mental health problem around the world with a large burden on economies, well-being, hence productivity, of individuals. Its early diagnosis and treatment are critical to reduce the costs and even save lives. One key aspect to achieve that goal is to use voice technologies and monitor depression remotely and relatively inexpensively using automated agents. Although there has been efforts to automatically assess depression levels from audiovisual features, use of transcriptions along with the acoustic features has emerged as a more recent research venue. Moreover, difficulty in data collection and the limited amounts of data available for research are also challenges that are hampering the success of the algorithms. One of the novel contributions in this paper is to exploit the databases from multiple languages for feature selection. Since a large number of features can be extracted from speech and given the small amounts of training data available, effective data selection is critical for success. Our proposed multi-lingual method was effective at selecting better features and significantly improved the depression assessment accuracy. We also use text-based features for assessment and propose a novel strategy to fuse the text- and speech-based classifiers which further boosted the performance.en_US
dc.description.versionPublisher version
dc.identifier.doi10.21437/Interspeech.2018-2169en_US
dc.identifier.endpage3402en_US
dc.identifier.isbn978-1-5108-7221-9
dc.identifier.issn2308-457Xen_US
dc.identifier.scopus2-s2.0-85055003235
dc.identifier.startpage3398en_US
dc.identifier.urihttp://hdl.handle.net/10679/6575
dc.identifier.urihttps://doi.org/10.21437/Interspeech.2018-2169
dc.identifier.wos000465363900709
dc.language.isoengen_US
dc.publicationstatusPublisheden_US
dc.publisherInternational Speech Communication Associationen_US
dc.relation.ispartofProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH
dc.relation.publicationcategoryInternational
dc.rightsopenAccess
dc.subject.keywordsDepression estimationen_US
dc.subject.keywordsAcoustic featuresen_US
dc.subject.keywordsFeature selectionen_US
dc.subject.keywordsMulti-lingual applicationsen_US
dc.titleMulti-lingual depression-level assessment from conversational speech using acoustic and text featuresen_US
dc.typeconferenceObjecten_US
dspace.entity.typePublication
relation.isOrgUnitOfPublication7b58c5c4-dccc-40a3-aaf2-9b209113b763
relation.isOrgUnitOfPublication.latestForDiscovery7b58c5c4-dccc-40a3-aaf2-9b209113b763

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Multi-lingual depression-level assessment from conversational speech using acoustic and text features.pdf
Size:
273.93 KB
Format:
Adobe Portable Document Format
Description:

License bundle

Now showing 1 - 1 of 1
Placeholder
Name:
license.txt
Size:
1.45 KB
Format:
Item-specific license agreed upon to submission
Description:

Collections