Revolutionizing Speech Emotion Recognition: A Novel Hilbert Curve Approach for Two-Dimensional Representation and Convolutional Neural Network Classification

被引:0
|
作者
Tyagi, Suryakant [1 ]
Szenasi, Sandor [2 ,3 ]
机构
[1] Obuda Univ, Doctoral Sch Appl Informat & Appl Math, H-1034 Budapest, Hungary
[2] Obuda Univ, John Von Neumann Fac Informat, Budapest, Hungary
[3] J Selye Univ, Fac Econ & Informat, Komarno, Slovakia
关键词
Speech emotion recognition (SER); Hilbert curve; TESS; Gram angle fields; CyTex; FEATURES;
D O I
10.1007/978-3-031-59257-7_8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Emotions are integral to human existence, influencing psychological wellbeing and permeating various aspects of daily life. Speech emotion recognition (SER) stands as a pivotal branch of emotion detection, focusing on decoding the acoustic nuances embedded in speech signals. This study delves into the landscape of SER, addressing challenges related to feature extraction and classifier development. Inspired by the Hilbert curve, a novel approach is proposed, converting one-dimensional time series data into informative two-dimensional images. A convolutional neural network extracts features from these images, and a fully connected network processes these features for sentiment classification. The study comprehensively evaluates this method across four diverse datasets, namely RAVDESS, TESS, SAVEE, and EmoDB. The proposed algorithm demonstrates promising results, showcasing potential advantages in emotion recognition tasks. Comparative analyses with existing methodologies, including Gram Angle Fields (GAF) and CyTex, affirm the feasibility and effectiveness of the proposed algorithm. The study contributes to advancing sentiment recognition by transforming time-series data into two-dimensional images, thereby opening new avenues in speech emotion recognition with improved accuracy and performance. The paper outlines the algorithms employed, details the methodology, presents experimental results, and concludes with reflections on findings and potential future directions.
引用
收藏
页码:75 / 85
页数:11
相关论文
共 50 条
  • [1] Optimizing Speech Emotion Recognition with Hilbert Curve and convolutional neural network
    Yang, Zijun
    Zhou, Shi
    Zhang, Lifeng
    Serikawa, Seiichi
    [J]. Cognitive Robotics, 2024, 4 : 30 - 41
  • [2] A novel convolutional neural network with gated recurrent unit for automated speech emotion recognition and classification
    Prakash, P. Ravi
    Anuradha, D.
    Iqbal, Javid
    Galety, Mohammad Gouse
    Singh, Ruby
    Neelakandan, S.
    [J]. JOURNAL OF CONTROL AND DECISION, 2023, 10 (01) : 54 - 63
  • [3] Multimodal speech emotion recognition and classification using convolutional neural network techniques
    A. Christy
    S. Vaithyasubramanian
    A. Jesudoss
    M. D. Anto Praveena
    [J]. International Journal of Speech Technology, 2020, 23 : 381 - 388
  • [4] Multimodal speech emotion recognition and classification using convolutional neural network techniques
    Christy, A.
    Vaithyasubramanian, S.
    Jesudoss, A.
    Praveena, M. D. Anto
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (02) : 381 - 388
  • [5] Design of a Convolutional Neural Network for Speech Emotion Recognition
    Lee, Kyong Hee
    Kim, Do Hyun
    [J]. 11TH INTERNATIONAL CONFERENCE ON ICT CONVERGENCE: DATA, NETWORK, AND AI IN THE AGE OF UNTACT (ICTC 2020), 2020, : 1332 - 1335
  • [6] CONVOLUTIONAL NEURAL NETWORK TECHNIQUES FOR SPEECH EMOTION RECOGNITION
    Parthasarathy, Srinivas
    Tashev, Ivan
    [J]. 2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 121 - 125
  • [7] Two-Dimensional Cepstrum Analysis Approach in Emotion Recognition from Speech
    Guoth, Igor
    Chmulik, Michal
    Polacky, Jozef
    Kuba, Michal
    [J]. 2016 39TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2016, : 335 - 339
  • [8] Speech Emotion Recognition based on Interactive Convolutional Neural Network
    Cheng, Huihui
    Tang, Xiaoyu
    [J]. 2020 IEEE 3RD INTERNATIONAL CONFERENCE ON INFORMATION COMMUNICATION AND SIGNAL PROCESSING (ICICSP 2020), 2020, : 163 - 167
  • [9] A NEW APPROACH FOR SPEECH EMOTION RECOGNITION USING SINGLE LAYERED CONVOLUTIONAL NEURAL NETWORK
    Mannan, J. Mannar
    Kumar, V. Vinoth
    Palaiahnakote, Shivakumara
    Khan, Surbhi Bhatia
    Almusharraf, Ahlam
    [J]. MALAYSIAN JOURNAL OF COMPUTER SCIENCE, 2024, 37 (01) : 89 - 106
  • [10] Effect on speech emotion classification of a feature selection approach using a convolutional neural network
    Amjad, Ammar
    Khan, Lal
    Chang, Hsien-Tsung
    [J]. PeerJ Computer Science, 2021, 7