Emotions Classification from Speech with Deep Learning

被引:0
|
作者
Chowanda, Andry [1 ]
Muliono, Yohan [2 ]
机构
[1] Bina Nusantara Univ, Sch Comp Sci, Comp Sci Dept, Jakarta 11480, Indonesia
[2] Bina Nusantara Univ, Sch Comp Sci, Comp Sci Dept, Cyber Secur Program, Jakarta 11480, Indonesia
关键词
Emotions recognition; speech modality; temporal information; affective system; NEURAL-NETWORK;
D O I
10.14569/IJACSA.2022.0130490
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Emotions are the essential parts that convey meaning to the interlocutors during social interactions. Hence, recognising emotions is paramount in building a good and natural affective system that can naturally interact with the human interlocutors. However, recognising emotions from social interactions require temporal information in order to classify the emotions correctly. This research aims to propose an architecture that extracts temporal information using the Temporal model of Convolutional Neural Network (CNN) and combined with the Long Short Term Memory (LSTM) architecture from the Speech modality. Several combinations and settings of the architectures were explored and presented in the paper. The results show that the best classifier achieved by the model trained with four layers of CNN combined with one layer of Bidirectional LSTM. Furthermore, the model was trained with an augmented training dataset with seven times more data than the original training dataset. The best model resulted in 94.25%, 57.07%, 0.2577 and 1.1678 for training accuracy, validation accuracy, training loss and validation loss, respectively. Moreover, Neutral (Calm) and Happy are the easiest classes to be recognised, while Angry is the hardest to be classified.
引用
收藏
页码:777 / 781
页数:5
相关论文
共 50 条
  • [21] Deep learning for Depression Recognition from Speech
    Tian, Han
    Zhu, Zhang
    Jing, Xu
    MOBILE NETWORKS & APPLICATIONS, 2023, 29 (4): : 1212 - 1227
  • [22] Voice disorder classification using speech enhancement and deep learning models
    Chaiani, Mounira
    Selouani, Sid Ahmed
    Boudraa, Malika
    Yakoub, Mohammed Sidi
    BIOCYBERNETICS AND BIOMEDICAL ENGINEERING, 2022, 42 (02) : 463 - 480
  • [23] Implementation of Hybrid Deep Reinforcement Learning Technique for Speech Signal Classification
    Gayathri R.
    Rani K.S.S.
    Computer Systems Science and Engineering, 2023, 46 (01): : 43 - 56
  • [24] Classification of Spoken English Accents Using Deep Learning and Speech Analysis
    Al-Jumaili, Zaid
    Bassiouny, Tarek
    Alanezi, Ahmad
    Khan, Wasiq
    Al-Jumeily, Dhiya
    Hussain, Abir Jaafar
    INTELLIGENT COMPUTING METHODOLOGIES, PT III, 2022, 13395 : 277 - 287
  • [25] Deep Learning for Hate Speech Intensity Analysis: DistilBERT Classification Algorithm
    Riyadi, Slamet
    Masyhur, Ahmad Musthafa
    Andriyani, Annisa Divayu
    2024 IEEE SYMPOSIUM ON INDUSTRIAL ELECTRONICS AND APPLICATIONS, ISIEA 2024, 2024,
  • [26] Speech Based Multiple Emotion Classification Model Using Deep Learning
    Patneedi, Shakti Swaroop
    Kumari, Nandini
    ADVANCES IN COMPUTING AND DATA SCIENCES, PT I, 2021, 1440 : 648 - 659
  • [27] A Robust Deep Transfer Learning Model for Accurate Speech Emotion Classification
    Akinpelu, Samson
    Viriri, Serestina
    ADVANCES IN VISUAL COMPUTING, ISVC 2022, PT II, 2022, 13599 : 419 - 430
  • [28] Non-Audible Speech Classification Using Deep Learning Approaches
    Fernandes, Rommel
    Huang, Lei
    Vejarano, Gustavo
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 630 - 634
  • [29] Inner Speech Classification using EEG Signals: A Deep Learning Approach
    Van den Berg, Bram
    Van Donkelaar, Sander
    Alimardani, Maryam
    PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON HUMAN-MACHINE SYSTEMS (ICHMS), 2021, : 258 - 261
  • [30] EmoSense: Automatically Sensing Emotions From Speech By Multi-way Classification
    Reddy, V. Ramu
    Viraraghavan, Venkata Subramanian
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 4987 - 4990