Hybrid BBO_PSO and higher order spectral features for emotion and stress recognition from natural speech
dc.authorid | 0000-0002-8929-3473 | en_US |
dc.authorid | 0000-0003-1840-9958 | en_US |
dc.authorid | 0000-0001-7466-0368 | en_US |
dc.contributor.author | Yogesh, C. K. | |
dc.contributor.author | Hariharan, Muthusamy | |
dc.contributor.author | Ngadiran, Ruzelita | |
dc.contributor.author | Adom, A. H. | |
dc.contributor.author | Yaacob, Sazali | |
dc.contributor.author | Polat, Kemal | |
dc.date.accessioned | 2021-06-23T19:45:48Z | |
dc.date.available | 2021-06-23T19:45:48Z | |
dc.date.issued | 2017 | |
dc.department | BAİBÜ, Mühendislik Fakültesi, Elektrik Elektronik Mühendisliği Bölümü | en_US |
dc.description.abstract | The aim of the present study is to select a set of higher order spectral features for emotion/stress recognition system. 50 Bispectral (28 features) and Bicoherence (22 features) based higher order spectral features were extracted from speech signal and its glottal waveform. These features were combined with Inter-Speech 2010 features to further improve the recognition rates. Feature subset selection (FSS) was carried out in this proposed work with the objective of maximizing emotion recognition rate for subject independent with minimum features. The FSS contains two stages: Multi-cluster feature selection was adopted in Stage 1 to reduce feature space and identify relevant feature subset from Interspeech 2010 features. In Stage 2, Biogeography based optimization (BBO), Particle swarm optimization (PSO) and proposed BBO_PSO Hybrid optimization were performed to further reduce the dimension of feature space and identify the most relevant feature subset, which has higher discrimination ability to distinguish different emotional states. The proposed method was tested in three different databases: Berlin emotional speech database (BES), Surrey audio-visual expressed emotion database (SAVEE) and Speech under simulated and actual stress (SUSAS) simulated domain. The proposed feature set was evaluated with subject independent (SI), subject dependent (SD), gender dependent male (GD-male), gender dependent female (GD-female), text independent pairwise speech (TIDPS), and text independent multi-style speech (TIDMSS) experiments by using SVM and ELM classifiers. From the results obtained, it is evident that the proposed method attained accuracies of 93.25% (SI), 100% (SD), 93.75% (GD-male), and 97.58% (GD-female) for BES; 62.38% (SI) and 76.19% (SD) for SAVEE; and 90.09% (TIDMSS), 97.04% (TIDPS - Angryvs. Neutral), 98.89% (TIDPS - Lombard vs. Neutral), 99.07% (TIDPS - Loud vs. Neutral) for SUSAS. (c) 2017 Elsevier B.V. All rights reserved. | en_US |
dc.identifier.doi | 10.1016/j.asoc.2017.03.013 | |
dc.identifier.endpage | 232 | en_US |
dc.identifier.issn | 1568-4946 | |
dc.identifier.issn | 1872-9681 | |
dc.identifier.scopus | 2-s2.0-85016239937 | en_US |
dc.identifier.scopusquality | Q1 | en_US |
dc.identifier.startpage | 217 | en_US |
dc.identifier.uri | https://doi.org/10.1016/j.asoc.2017.03.013 | |
dc.identifier.uri | https://hdl.handle.net/20.500.12491/9212 | |
dc.identifier.volume | 56 | en_US |
dc.identifier.wos | WOS:000402364000017 | en_US |
dc.identifier.wosquality | Q1 | en_US |
dc.indekslendigikaynak | Web of Science | en_US |
dc.indekslendigikaynak | Scopus | en_US |
dc.institutionauthor | Polat, Kemal | |
dc.language.iso | en | en_US |
dc.publisher | Elsevier | en_US |
dc.relation.ispartof | Applied Soft Computing | en_US |
dc.relation.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
dc.rights | info:eu-repo/semantics/closedAccess | en_US |
dc.subject | Speech Signals | en_US |
dc.subject | Feature Extraction | en_US |
dc.subject | Feature Selection and Emotion Recognition | en_US |
dc.title | Hybrid BBO_PSO and higher order spectral features for emotion and stress recognition from natural speech | en_US |
dc.type | Article | en_US |
Dosyalar
Orijinal paket
1 - 1 / 1
Küçük Resim Yok
- İsim:
- c-k-yogesh.pdf
- Boyut:
- 5.31 MB
- Biçim:
- Adobe Portable Document Format
- Açıklama:
- Tam Metin/Full Text