Speech Emotion Recognition using Convolution Neural Networks and Deep Stride Convolutional Neural Networks

被引:16
|
作者
Wani, Taiba Majid [1 ]
Gunawan, Teddy Surya [2 ,3 ]
Qadri, Syed Asif Ahmad [1 ]
Mansor, Hasmah [1 ]
Kartiwi, Mira [4 ]
Ismail, Nanang [5 ]
机构
[1] Int Islamic Univ Malaysia, Elect & Comp Eng Dept, Kuala Lumpur, Malaysia
[2] IIUM, ECE Dept, Kuala Lumpur, Malaysia
[3] Univ Potensi Utama, FTIK, Medan City, Indonesia
[4] Int Islamic Univ Malaysia, Informat Syst Dept, Kuala Lumpur, Malaysia
[5] UIN Sunan Gunung Djati, Dept Elect Engn, Bandung, Indonesia
关键词
speech emotion recognition; spectrogram; strides; convolutional neural network (CNN); deep stride convolutional neural network (DSCNN);
D O I
10.1109/icwt50448.2020.9243622
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
An assortment of techniques has been presented in the area of Speech Emotion Recognition (SER), where the main focus is to recognize the silent discriminants and useful features of speech signals. These features undergo the process of classification to recognize the specific emotion of a speaker. In recent times, deep learning techniques have emerged as a breakthrough in speech emotion recognition to detect and classify emotions. In this paper, we have modified a recently developed different network architecture of convolutional neural networks, i.e., Deep Stride Convolutional Neural Networks (DSCNN), by taking a smaller number of convolutional layers to increase the computational speed while still maintaining accuracy. Besides, we trained the state-of-art model of CNN and proposed DSCNN on spectrograms generated from the SAVEE speech emotion dataset. For the evaluation process, four emotions angry, happy, neutral, and sad, were considered. Evaluation results show that the proposed architecture DSCNN, with the prediction accuracy of 87.8%, outperforms CNN with 79.4% accuracy.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Speech emotion recognition with deep convolutional neural networks
    Issa, Dias
    Demirci, M. Fatih
    Yazici, Adnan
    [J]. BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2020, 59 (59)
  • [2] FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition
    Dossou, Bonaventure F. P.
    Gbenou, Yeno K. S.
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 3526 - 3531
  • [3] SPEECH EMOTION RECOGNITION USING QUATERNION CONVOLUTIONAL NEURAL NETWORKS
    Muppidi, Aneesh
    Radfar, Martin
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6309 - 6313
  • [4] Speech Emotion Recognition using Convolutional and Recurrent Neural Networks
    Lim, Wootaek
    Jang, Daeyoung
    Lee, Taejin
    [J]. 2016 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2016,
  • [5] Deep Convolutional Neural Networks for Feature Extraction in Speech Emotion Recognition
    Heracleous, Panikos
    Mohammad, Yasser
    Yoneyama, Akio
    [J]. HUMAN-COMPUTER INTERACTION. RECOGNITION AND INTERACTION TECHNOLOGIES, HCI 2019, PT II, 2019, 11567 : 117 - 132
  • [6] Improvement on Speech Emotion Recognition Based on Deep Convolutional Neural Networks
    Niu, Yafeng
    Zou, Dongsheng
    Niu, Yadong
    He, Zhongshi
    Tan, Hua
    [J]. PROCEEDINGS OF 2018 INTERNATIONAL CONFERENCE ON COMPUTING AND ARTIFICIAL INTELLIGENCE (ICCAI 2018), 2018, : 13 - 18
  • [7] Speech Emotion Recognition Using Convolution Neural Networks and Multi-Head Convolutional Transformer
    Ullah, Rizwan
    Asif, Muhammad
    Shah, Wahab Ali
    Anjam, Fakhar
    Ullah, Ibrar
    Khurshaid, Tahir
    Wuttisittikulkij, Lunchakorn
    Shah, Shashi
    Ali, Syed Mansoor
    Alibakhshikenari, Mohammad
    [J]. SENSORS, 2023, 23 (13)
  • [8] Speech Emotion Recognition and Deep Learning: An Extensive Validation Using Convolutional Neural Networks
    Ri, Francesco Ardan Dal
    Ciardi, Fabio Cifariello
    Conci, Nicola
    [J]. IEEE ACCESS, 2023, 11 : 116638 - 116649
  • [9] Music emotion recognition using deep convolutional neural networks
    Li, Ting
    [J]. JOURNAL OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING, 2024, 24 (4-5) : 3063 - 3078
  • [10] Continuous Speech Emotion Recognition with Convolutional Neural Networks
    Vryzas, Nikolaos
    Vrysis, Lazaros
    Matsiola, Maria
    Kotsakis, Rigas
    Dimoulas, Charalampos
    Kalliris, George
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2020, 68 (1-2): : 14 - 24