Emotion Recognition in Speech with Deep Learning Architectures

被引：3

作者：

Erdal, Mehmet ^{[1
]}

Kaechele, Markus ^{[1
]}

Schwenker, Friedhelm ^{[1
]}

机构：

[1] Univ Ulm, Inst Neural Informat Proc, D-89081 Ulm, Germany

来源：

ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION | 2016年 / 9896卷

关键词：

D O I：

10.1007/978-3-319-46182-3_25

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks (DNNs) became very popular for learning abstract high-level representations from raw data. This lead to improvements in several classification tasks including emotion recognition in speech. Besides the use as feature learner a DNN can also be used as classifier. In any case it is a challenge to determine the number of hidden layers and neurons in each layer for such networks. In this work the architecture of a DNN is determined by a restricted grid-search with the aim to recognize emotion in human speech. Because speech signals are essentially time series the data will be transformed in an appropriate format to use it as input for deep feed forward neural networks without losing much time dependent information. Furthermore the Elman-Net will be examined. The results shows that by maintaining time dependent information in the data better classification accuracies can be achieved with deep architectures.

引用

页码：298 / 311

页数：14

共 50 条

[1] Evaluating deep learning architectures for Speech Emotion Recognition
Fayek, Haytham M.
Lech, Margaret
Cavedon, Lawrence
[J]. NEURAL NETWORKS, 2017, 92 : 60 - 68
[2] Analysis of Deep Learning Architectures for Cross-corpus Speech Emotion Recognition
Parry, Jack
Palaz, Dimitri
Clarke, Georgia
Lecomte, Pauline
Mead, Rebecca
Berger, Michael
Hofer, Gregor
[J]. INTERSPEECH 2019, 2019, : 1656 - 1660
[3] Speech Emotion Recognition with Deep Learning
Harar, Pavol
Burget, Radim
Dutta, Malay Kishore
[J]. 2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 137 - 140
[4] Speech Emotion Recognition Using Deep Learning
Alagusundari, N.
Anuradha, R.
[J]. ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, VOL 1, AITA 2023, 2024, 843 : 313 - 325
[5] Speech Emotion Recognition Using Deep Learning
Ahmed, Waqar
Riaz, Sana
Iftikhar, Khunsa
Konur, Savas
[J]. ARTIFICIAL INTELLIGENCE XL, AI 2023, 2023, 14381 : 191 - 197
[6] Multimodal Emotion Recognition using Deep Learning Architectures
Ranganathan, Hiranmayi
Chakraborty, Shayok
Panchanathan, Sethuraman
[J]. 2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016), 2016,
[7] Efficient Feature-Aware Hybrid Model of Deep Learning Architectures for Speech Emotion Recognition
Ezz-Eldin, Mai
Khalaf, Ashraf A. M.
Hamed, Hesham F. A.
Hussein, Aziza, I
[J]. IEEE ACCESS, 2021, 9 : 19999 - 20011
[8] On Comparison of Deep Learning Architectures for Distant Speech Recognition
Sustika, Rika
Yuliani, Asri R.
Zaenudin, Efendi
Pardede, Hilman F.
[J]. 2017 2ND INTERNATIONAL CONFERENCES ON INFORMATION TECHNOLOGY, INFORMATION SYSTEMS AND ELECTRICAL ENGINEERING (ICITISEE): OPPORTUNITIES AND CHALLENGES ON BIG DATA FUTURE INNOVATION, 2017, : 17 - 21
[9] Ensemble deep learning with HuBERT for speech emotion recognition
Yang, Janghoon
[J]. 2023 IEEE 17TH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING, ICSC, 2023, : 153 - 154
[10] Survey of Deep Representation Learning for Speech Emotion Recognition
Latif, Siddique
Rana, Rajib
Khalifa, Sara
Jurdak, Raja
Qadir, Junaid
Schuller, Bjorn
[J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1634 - 1654

← 1 2 3 4 5 →