Speech Emotion Recognition Using Machine Learning: A Comparative Analysis

被引:0
|
作者
Nath S. [1 ]
Shahi A.K. [1 ]
Martin T. [1 ]
Choudhury N. [1 ]
Mandal R. [1 ]
机构
[1] School of Technology, Assam Don Bosco University, Guwahati
关键词
Bi-LSTM; Fuzzy; IEMOCAP; RF; SAVEE; SER;
D O I
10.1007/s42979-024-02656-0
中图分类号
学科分类号
摘要
It is possible to identify emotions based on a person's speech. The field of research focusing on expressing emotions through voice is continuously evolving. This study utilizes the SAVEE and IEMOCAP datasets to explore Speech Emotion Recognition. The SAVEE dataset consists of seven emotions, while 4 out of 11 emotions are considered from the IEMOCAP dataset. The features ZCR, MFCC, F0, and RMS are extracted from the raw audio files, and their means are calculated which are fed as input for training the models. The study presents a comparative analysis of emotion detection on both datasets, employing the models RNN, LSTM, Bi-LSTM, RF, Rotation Forest, and Fuzzy. The RF and Bi-LSTM models achieve highest accuracies of 76 and 72%, respectively, on the SAVEE dataset, when compared to other trained models. The fuzzy and Rotation Forest models are implemented which can be improvised with further optimization techniques. Additionally, a diagnostic User Interface is developed for analyzing audio, loading datasets, extracting features, training models, and classifying human emotions from audio using the trained models. © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd 2024.
引用
收藏
相关论文
共 50 条
  • [1] Emotion Recognition On Speech Signals Using Machine Learning
    Ghai, Mohan
    Lal, Shamit
    Duggal, Shivam
    Manik, Shrey
    [J]. PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON BIG DATA ANALYTICS AND COMPUTATIONAL INTELLIGENCE (ICBDAC), 2017, : 34 - 39
  • [2] Speech based Emotion Recognition using Machine Learning
    Deshmukh, Girija
    Gaonkar, Apurva
    Golwalkar, Gauri
    Kulkarni, Sukanya
    [J]. PROCEEDINGS OF THE 2019 3RD INTERNATIONAL CONFERENCE ON COMPUTING METHODOLOGIES AND COMMUNICATION (ICCMC 2019), 2019, : 812 - 817
  • [3] Distinctive Approach for Speech Emotion Recognition Using Machine Learning
    Singh, Yogyata
    Neetu
    Rani, Shikha
    [J]. MACHINE LEARNING, IMAGE PROCESSING, NETWORK SECURITY AND DATA SCIENCES, MIND 2022, PT I, 2022, 1762 : 39 - 51
  • [4] Speech emotion recognition using machine learning - A systematic review
    Madanian, Samaneh
    Chen, Talen
    Adeleye, Olayinka
    Templeton, John Michael
    Poellabauer, Christian
    Parry, Dave
    Schneidere, Sandra L.
    [J]. INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 20
  • [5] Speech emotion recognition of Hindi speech using statistical and machine learning techniques
    Agrawal, Akshat
    Jain, Anurag
    [J]. JOURNAL OF INTERDISCIPLINARY MATHEMATICS, 2020, 23 (01) : 311 - 319
  • [6] Machine Learning Approach for Emotion Recognition in Speech
    Gjoreski, Martin
    Gjoreski, Hristijan
    Kulakov, Andrea
    [J]. INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2014, 38 (04): : 377 - 383
  • [7] Comparative Analysis of Windows for Speech Emotion Recognition Using CNN
    Teixeira, Felipe L.
    Soares, Salviano Pinto
    Abreu, J. L. Pio
    Oliveira, Paulo M.
    Teixeira, Joao P.
    [J]. OPTIMIZATION, LEARNING ALGORITHMS AND APPLICATIONS, PT I, OL2A 2023, 2024, 1981 : 233 - 248
  • [8] Applying Machine Learning Techniques for Speech Emotion Recognition
    Tarunika, K.
    Pradeeba, R. B.
    Aruna, P.
    [J]. 2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [9] Speech emotion recognition for psychotherapy: an analysis of traditional machine learning and deep learning techniques
    Shah, Nidhi
    Sood, Kanika
    Arora, Jayraj
    [J]. 2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 718 - 723
  • [10] Recognition of Emotion Intensities Using Machine Learning Algorithms: A Comparative Study
    Mehta, Dhwani
    Siddiqui, Mohammad Faridul Haque
    Javaid, Ahmad Y.
    [J]. SENSORS, 2019, 19 (08):