Deep Multilayer Perceptrons for Dimensional Speech Emotion Recognition

被引:0
|
作者
Atmaja, Bagus Tris [1 ,2 ]
Akagi, Masato [2 ]
机构
[1] Sepuluh Nopember Inst Technol, Surabaya, Indonesia
[2] Japan Adv Inst Sci & Technol, Nomi, Japan
关键词
Affective computing; speech emotion recognition; multilayer perceptrons; neural networks; dimensional emotion; DOMINANCE; AROUSAL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Modern deep learning architectures are ordinarily performed in high performance computing facilities due to the large size of their input features and complexity of their models. This paper proposes traditional multilayer perceptrons (MLP) with deep layers and small input sizes to tackle this computation requirement limitation. This study compares a proposed deep MLP method to the more modern deep learning architectures with the same number of layers, batch size, and optimizer. The result shows that our proposed deep MLP outperformed modern deep learning architectures, i.e., LSTM and CNN, on the same number of layers and value of parameters. Both proposed and benchmark methods were optimized in the same way. The deep MLP exhibited the highest performance on both speaker-dependent and speaker-independent scenarios on IEMOCAP and MSP-IMPROV datasets.
引用
收藏
页码:325 / 331
页数:7
相关论文
共 50 条
  • [1] Singular Learning of Deep Multilayer Perceptrons for EEG-Based Emotion Recognition
    Guo, Weili
    Li, Guangyu
    Lu, Jianfeng
    Yang, Jian
    [J]. FRONTIERS IN COMPUTER SCIENCE, 2021, 3
  • [2] PERCEPTRONS AND MULTILAYER PERCEPTRONS IN SPEECH RECOGNITION - IMPROVEMENTS FROM TEMPORAL WARPING OF THE TRAINING MATERIAL
    KAMMERER, B
    KUPPER, W
    [J]. NEURAL NETWORKS FROM MODELS TO APPLICATIONS, 1989, : 531 - 540
  • [3] A Fast Learning Method for Multilayer Perceptrons in Automatic Speech Recognition Systems
    Cai, Chenghao
    Xu, Yanyan
    Ke, Dengfeng
    Su, Kaile
    [J]. JOURNAL OF ROBOTICS, 2015, 2015
  • [4] Propagation of Uncertainty through Multilayer Perceptrons for Robust Automatic Speech Recognition
    Astudillo, Ramon Fernandez
    da Silva Neto, Joao Paulo
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 468 - 471
  • [5] An approach for automatic speech recognition using multilayer perceptrons and acoustic segmentation
    Giurgiu, M
    Meciu, E
    [J]. COMBIO'96 - SUMMER WORKSHOP ON COMPUTATIONAL MODELLING, IMAGING AND VISUALIZATION IN BIOSCIENCES, 1996, 1996 (06): : 104 - 108
  • [6] Understanding the Representation and Computation of Multilayer Perceptrons: A Case Study in Speech Recognition
    Nagamine, Tasha
    Mesgarani, Nima
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [7] Speech Emotion Recognition with Deep Learning
    Harar, Pavol
    Burget, Radim
    Dutta, Malay Kishore
    [J]. 2017 4TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND INTEGRATED NETWORKS (SPIN), 2017, : 137 - 140
  • [8] Dimensional Speech Emotion Recognition Review
    Li, Hai-Feng
    Chen, Jing
    Ma, Lin
    Bo, Hong-Jian
    Xu, Cong
    Li, Hong-Wei
    [J]. Ruan Jian Xue Bao/Journal of Software, 2020, 31 (08): : 2465 - 2491
  • [9] Deep scattering network for speech emotion recognition
    Singh, Premjeet
    Saha, Goutam
    Sahidullah, Md
    [J]. 29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021), 2021, : 131 - 135
  • [10] Emotion Recognition in Speech with Deep Learning Architectures
    Erdal, Mehmet
    Kaechele, Markus
    Schwenker, Friedhelm
    [J]. ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, 2016, 9896 : 298 - 311