Deep Learning for Emotional Speech Recognition

被引：0

作者：

Alhamada, M., I ^{[1
]}

Khalifa, O. O. ^{[1
]}

Abdalla, A. H.

机构：

[1] Int Islamic Univ Malaysia, Fac Engn, Elect & Comp Engn, Kuala Lumpur, Malaysia

来源：

PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC DEVICES, SYSTEMS AND APPLICATIONS (ICEDSA2020) | 2020年 / 2306卷

关键词：

D O I：

10.1063/5.0032381

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Emotion speech recognition is a developing field in machine learning. The main purpose of this field is to produce a convenient system that is able to effortlessly communicate and interact with humans. The reliability of the current speech emotion recognition systems is far from being achieved. However, this is a challenging task due to the gap between acoustic features and human emotions, which rely strongly on the discriminative acoustic features extracted for a given recognition task. The speech signals were process with information which is divided into two main categories, linguistic and paralinguistic; emotions belong to the latter tree. The aim of this work is to develop a system that can understand paralinguistic information for paramount better human-machine interactions. A different extracted features like MFCC as well as feature classifications methods like FIMM, GMM, LTSTM and ANN were used. In this paper, an improved architecture of CNN for speech emotion recognition were implemented. The main fmding that the proposed CNN model achieved 93.96% accuracy rate in detecting emotions.

引用

页数：10

共 50 条

[1] Deep Learning for Emotional Speech Recognition
Sanchez-Gutierrez, Maximo E.
Marcelo Albornoz, E.
Martinez-Licona, Fabiola
Leonardo Rufiner, H.
Goddard, John
[J]. PATTERN RECOGNITION, MCPR 2014, 2014, 8495 : 311 - +
[2] Deep Learning Analysis Models for Speech and Emotional Recognition
Wu, Jun
Zhu, Tianliang
Yu, Chengtian
Wang, Chunzhi
Zhou, Xianjing
Liu, Hu
[J]. 2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 1541 - 1545
[3] Emotional speech Recognition using CNN and Deep learning techniques
Hema, C.
Marquez, Fausto Pedro Garcia
[J]. APPLIED ACOUSTICS, 2023, 211
[4] Emotional Climate Recognition in Interactive Conversational Speech Using Deep Learning
Alhussein, Ghada
Alkhodari, Mohanad
Khandokher, Ahsan
Hadjileontiadis, Leontios J.
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH (IEEE ICDH 2022), 2022, : 96 - 103
[5] Speech Emotion Recognition with Deep Learning
Harar, Pavol
Burget, Radim
Dutta, Malay Kishore
[J]. 2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 137 - 140
[6] Speech Recognition using Deep Learning
Lakkhanawannakun, Phoemporn
Noyunsan, Chaluemwut
[J]. 2019 34TH INTERNATIONAL TECHNICAL CONFERENCE ON CIRCUITS/SYSTEMS, COMPUTERS AND COMMUNICATIONS (ITC-CSCC 2019), 2019, : 514 - 517
[7] Emotional Speech Recognition Using Deep Neural Networks
Trinh Van, Loan
Dao Thi Le, Thuy
Le Xuan, Thanh
Castelli, Eric
[J]. SENSORS, 2022, 22 (04)
[8] An Analysis of Emotional Speech Recognition for Tamil Language Using Deep Learning Gate Recurrent Unit
Fernandes, Bennilo
Mannepalli, Kasiprasad
[J]. PERTANIKA JOURNAL OF SCIENCE AND TECHNOLOGY, 2021, 29 (03): : 1937 - 1961
[9] Learning Emotional Representations from Imbalanced Speech Data for Speech Emotion Recognition and Emotional Text-to-Speech
Wang, Shijun
Gudnason, Jon
Borth, Damian
[J]. INTERSPEECH 2023, 2023, : 351 - 355
[10] Korean speech recognition using deep learning
Lee, Suji
Han, Seokjin
Park, Sewon
Lee, Kyeongwon
Lee, Jaeyong
[J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2019, 32 (02) : 213 - 227

← 1 2 3 4 5 →