Speech emotion recognition using Ramanujan Fourier Transform

被引:6
|
作者
Flower, T. Mary Little [1 ]
Jaya, T. [2 ]
机构
[1] St Xaviers Catholic Coll Engn, Dept Elect & Commun Engn, Chunkankadai, Tamil Nadu, India
[2] CSI Inst Technol, Dept Elect & Commun Engn, Thovalai, Tamil Nadu, India
关键词
Ramanujan Fourier Transform; SVM; KNN; Discriminant analysis; Machine learning; FEATURE-SELECTION METHOD; FEATURES;
D O I
10.1016/j.apacoust.2022.109133
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A novel technique is presented for the analysis of Speech Emotion Recognition (SER) using Ramanujan Fourier Transform (RFT). The unique method involves numerically encoding the speech emotion data before applying the RFT. The RFT's foundation is the projection of the obtained numerical series onto a collection of fundamental functions made up of Ramanujan sums (RS). In RS components, SER data base such as Berlin, eNTERFACE, RAVDESS, SAVEE, EMOVO, EmoFilm, and Urdu are considered for testing the accuracy. This research work proposes on RFT feature based speech emotion classification. The speech emotion samples was analyzed by Ramanujan Fourier Transform and the statistical feature extraction was carried out, fed to the machine learning classifiers. The multiclass SVM based speech emotion classification was found to be proficient, when compared with the KNN and Linear Discriminant Analysis classifiers. The algorithms are evaluated on seven data bases and the results reveals that, multiclass SVM out performs other classifiers in terms of accuracy. The RFT as a stand-alone feature recognizes speech emotion with an accuracy of 83.08% for Berlin, 82.67% for eNTERFACE' 05, 81.79% for EmoFilm, 82.98% for RAVDESS, 82.99% for EMOVO, 84% for Urdu, and 83.75% for SAVEE databases using Multiclass SVM classifier. The outcome of this research work paves a way to the researchers in speech emotion analysis for real world applications. (c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Research on Speech Emotion Recognition Based on the Fractional Fourier Transform
    Huang, Lirong
    Shen, Xizhong
    [J]. ELECTRONICS, 2022, 11 (20)
  • [2] Speech Emotion Recognition Using Fourier Parameters
    Wang, Kunxia
    An, Ning
    Li, Bing Nan
    Zhang, Yanyong
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2015, 6 (01) : 69 - 75
  • [3] Emotion recognition using fourier transform and genetic programming
    Acharya, Divya
    Billimoria, Anosh
    Srivastava, Neishka
    Goel, Shivani
    Bhardwaj, Arpit
    [J]. APPLIED ACOUSTICS, 2020, 164
  • [4] Fractional Fourier transform features for speech recognition
    Sarikaya, R
    Gao, YQ
    Saon, G
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL I, PROCEEDINGS: SPEECH PROCESSING, 2004, : 529 - 532
  • [5] Emotion Recognition Based on Multiple Order Features Using Fractional Fourier Transform
    Ren, Bo
    Liu, Deyin
    Qi, Lin
    [J]. NINTH INTERNATIONAL CONFERENCE ON DIGITAL IMAGE PROCESSING (ICDIP 2017), 2017, 10420
  • [6] Emotion recognition from speech using wavelet packet transform and prosodic features
    Gupta, Manish
    Bharti, Shambhu Shankar
    Agarwal, Suneeta
    [J]. JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2018, 35 (02) : 1541 - 1553
  • [7] Speech emotion recognition using a combination of variational mode decomposition and Hilbert transform
    Mishra, Siba Prasad
    Warule, Pankaj
    Deb, Suman
    [J]. APPLIED ACOUSTICS, 2024, 222
  • [8] Continuous Wavelet Transform based Speech Emotion Recognition
    Shegokar, Pankaj
    Sircar, Pradip
    [J]. 2016 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2016,
  • [9] Temporal Discrete Cosine Transform for Speech Emotion Recognition
    Popovic, Branislav
    Stankovic, Igor
    Ostrogonac, Stevan
    [J]. 2013 IEEE 4TH INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2013, : 87 - 90
  • [10] Discrete Fourier transform computation using prime Ramanujan numbers
    Bhatnagar, N
    [J]. PROCEEDINGS OF THE INDIAN ACADEMY OF SCIENCES-MATHEMATICAL SCIENCES, 1997, 107 (01): : 95 - 100