Speech emotion recognition based on Bi-directional LSTM architecture and deep belief networks

被引:14
|
作者
Senthilkumar, N. [1 ]
Karpakam, S. [2 ]
Devi, M. Gayathri [3 ]
Balakumaresan, R. [4 ]
Dhilipkumar, P. [5 ]
机构
[1] Dr NGP Inst Technol, Dept ECE, Coimbatore, India
[2] Sri Eshwar Coll Engn, Dept ECE, Coimbatore, India
[3] Muthayammal Engn Coll, Departmentof Biomed Engn, Rasipuram, India
[4] PSNA Coll Engn & Technol, Dept ECE, Dindigul, India
[5] Sri Ranganathar Inst Engn & Technol, Dept ECE, Coimbatore, India
关键词
CNN; Deep learning; Emotion; LSTM; Radial basis function; Sequence selection;
D O I
10.1016/j.matpr.2021.12.246
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Machine learning algorithms are often not able to recognize the speech emotion of the individuals. The Speech Emotion Recognition (SER) plays a major role in real-time applications that involve analyzing the speech emotions. It can be used in various scenarios such as emergency centers and human behavior assessments. In this work, we design the architecture for analyzing similarity in clusters, which is based on a key sequence selection procedure. A sequence of information is transformed into a spectrogram with the advantage of the STRFT algorithm. The subsequent result is a discriminative and salient feature extraction program. We have also added new features to the CNN to improve its recognition performance. Instead of the whole utterance, the key segments are processed separately to diminish the structure complexity. The proposed system is compared to different standard datasets for recognizing different kinds of objects. It is evaluated over different time periods and achieves better recognition accuracy. The proposed SER model is proven to be robust and reliable when compared with latest state-of-the-art methods. Copyright (c) 2022 Elsevier Ltd. All rights reserved. Selection and peer-review under responsibility of the scientific committee of the International Conference on Innovation and Application in Science and Technology.
引用
收藏
页码:2180 / 2184
页数:5
相关论文
共 50 条
  • [1] Speech emotion recognition based on bi-directional acoustic-articulatory conversion
    Li, Haifeng
    Zhang, Xueying
    Duan, Shufei
    Liang, Huizhi
    KNOWLEDGE-BASED SYSTEMS, 2024, 299
  • [2] Bi-directional lstm network speech-to-gesture generation using bi-directional lstm network
    Kaneko N.
    Takeuchi K.
    Hasegawa D.
    Shirakawa S.
    Sakuta H.
    Sumi K.
    Transactions of the Japanese Society for Artificial Intelligence, 2019, 34 (06):
  • [3] A Deep Learning Method Based Self-Attention and Bi-directional LSTM in Emotion Classification
    Fei, Rong
    Zhu, Yuanbo
    Yao, Quanzhu
    Xu, Qingzheng
    Hu, Bo
    JOURNAL OF INTERNET TECHNOLOGY, 2020, 21 (05): : 1447 - 1461
  • [4] Speech Emotion Recognition Based on Deep Belief Network
    Shi, Peng
    2018 IEEE 15TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC), 2018,
  • [5] Automatic Bus Stop Detection with Deep Neural Networks and Bi-directional LSTM
    Piriyataravet, Jitpinun
    Kumwilaisak, Wuttipong
    Chinrungrueng, Jatuporn
    2021 SECOND INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION, CONTROL, ARTIFICIAL INTELLIGENCE, AND ROBOTICS (ICA-SYMP), 2021, : 48 - 51
  • [6] Speech-to-Gesture Generation: A Challenge in Deep Learning Approach with Bi-Directional LSTM
    Takeuchi, Kenta
    Hasegawa, Dai
    Shirakawa, Shinichi
    Kaneko, Naoshi
    Sakuta, Hiroshi
    Sumi, Kazuhiko
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON HUMAN AGENT INTERACTION (HAI'17), 2017, : 365 - 369
  • [7] Speech emotion recognition based on deep belief networks and wavelet packet cepstral coefficients
    Huang Y.
    Wu A.
    Zhang G.
    Li Y.
    1600, UK Simulation Society, Clifton Lane, Nottingham, NG11 8NS, United Kingdom (17): : 28.1 - 28.5
  • [8] A semantic enhanced topic model based on bi-directional LSTM networks
    Gao, Wang
    Yang, Zhi-Feng
    Wang, Hai
    Zhang, Fan
    Fang, Yuan
    Journal of Computers (Taiwan), 2019, 30 (06): : 60 - 72
  • [9] Action Recognition in Video Sequences using Deep Bi-Directional LSTM With CNN Features
    Ullah, Amin
    Ahmad, Jamil
    Muhammad, Khan
    Sajjad, Muhammad
    Baik, Sung Wook
    IEEE ACCESS, 2018, 6 : 1155 - 1166
  • [10] Intelligent irrigation scheduling scheme based on deep bi-directional LSTM technique
    R. Jenitha
    K. Rajesh
    International Journal of Environmental Science and Technology, 2024, 21 : 1905 - 1922