A Study on Speech Emotion Recognition Using a Deep Neural Network

被引:14
|
作者
Lee, Kyong Hee [1 ]
Choi, Hyun Kyun [1 ]
Jang, Byung Tae [1 ]
Kim, Do Hyun [1 ]
机构
[1] ETRI, Intelligent Robot Res Div, Daejeon, South Korea
关键词
speech; emotion; deep neural network; recognition; preprocessing;
D O I
10.1109/ictc46691.2019.8939830
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
When using voice signals as input to a deep learning network, there may be myriad features depending on the method and purpose of extracting the voice signal features. Therefore, extraction of appropriate features should be conducted. In this study, verbal features necessary for speech emotion recognition (SER) and preprocessing features for a deep neural network are described in detail. We implemented various preprocessing methods using voice features. Also, a Keras-based deep neural network using Python libraries was implemented. With these features, we could obtain a test accuracy of 68.5 % using the deep neural network (DNN). As a result, we confirmed that the proposed DNN improved an accuracy by 30.1 % compared to a support vector machine (SVM).
引用
收藏
页码:1162 / 1165
页数:4
相关论文
共 50 条
  • [1] Active Learning for Speech Emotion Recognition Using Deep Neural Network
    Abdelwahab, Mohammed
    Busso, Carlos
    [J]. 2019 8TH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2019,
  • [2] Speech Emotion Recognition Based on Deep Neural Network
    Zhu, Zijiang
    Hu, Yi
    Li, Junshan
    Li, Jianjun
    Wang, Junhua
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2020, 126 : 154 - 154
  • [3] Speech Emotion Recognition Using Generative Adversarial Network and Deep Convolutional Neural Network
    Bhangale, Kishor
    Kothandaraman, Mohanaprasad
    [J]. CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2024, 43 (04) : 2341 - 2384
  • [4] Speech Emotion Recognition Using Generative Adversarial Network and Deep Convolutional Neural Network
    Kishor Bhangale
    Mohanaprasad Kothandaraman
    [J]. Circuits, Systems, and Signal Processing, 2024, 43 : 2341 - 2384
  • [5] Speech Emotion Recognition Using Deep Neural Network and Extreme Learning Machine
    Han, Kun
    Yu, Dong
    Tashev, Ivan
    [J]. 15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 223 - 227
  • [6] SPEECH EMOTION RECOGNITION USING DEEP NEURAL NETWORK CONSIDERING VERBAL AND NONVERBAL SPEECH SOUNDS
    Huang, Kun-Yi
    Wu, Chung-Hsien
    Hong, Qian-Bei
    Su, Ming-Hsiang
    Chen, Yi-Hsuan
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 5866 - 5870
  • [7] Transfer Learning of Deep Neural Network for Speech Emotion Recognition
    Huang, Ying
    Hu, Mingqing
    Yu, Xianguo
    Wang, Tao
    Yang, Chen
    [J]. PATTERN RECOGNITION (CCPR 2016), PT II, 2016, 663 : 721 - 729
  • [8] Improvement of Speech Emotion Recognition by Deep Convolutional Neural Network and Speech Features
    Mohanty, Aniruddha
    Cherukuri, Ravindranath C.
    Prusty, Alok Ranjan
    [J]. THIRD CONGRESS ON INTELLIGENT SYSTEMS, CIS 2022, VOL 1, 2023, 608 : 117 - 129
  • [9] Speech Emotion Recognition Using Deep Convolutional Neural Network and Simple Recurrent Unit
    Jiang, Pengxu
    Fu, Hongliang
    Tao, Huawei
    [J]. ENGINEERING LETTERS, 2019, 27 (04) : 901 - 906
  • [10] Study on emotion recognition and companion Chatbot using deep neural network
    Ming-Che Lee
    Shu-Yin Chiang
    Sheng-Cheng Yeh
    Ting-Feng Wen
    [J]. Multimedia Tools and Applications, 2020, 79 : 19629 - 19657