Research on Speech Emotion Recognition Based on the Fractional Fourier Transform

被引:10
|
作者
Huang, Lirong [1 ]
Shen, Xizhong [1 ]
机构
[1] Shanghai Inst Technol, Sch Elect & Elect Engn, Shanghai 201418, Peoples R China
关键词
speech emotion recognition; the fractional fourier transform; MFCC; LSTM; RAVDESS; ambiguity function; ORDER;
D O I
10.3390/electronics11203393
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Speech emotion recognition is an important part of human-computer interaction, and the use of computers to analyze emotions and extract speech emotion features that can achieve high recognition rates is an important step. We applied the Fractional Fourier Transform (FrFT), and then constructed it to extract MFCC and combined it with a deep learning method for speech emotion recognition. Since the performance of FrFT depends on the transform order p, we utilized an ambiguity function to determine the optimal order for each frame of speech. The MFCC was extracted under the optimal order of FrFT for each frame of speech. Finally, combining the deep learning network LSTM for speech emotion recognition. Our experiment was conducted on the RAVDESS, and detailed confusion matrices and accuracy were given for analysis. The MFCC extracted using FrFT was shown to have better performance than ordinal FT, and the proposed model achieved a weighting accuracy of 79.86%.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] The Research of Speech Emotion Recognition Based on Gaussian Mixture Model
    Zhang, Wanli
    Li, Guoxin
    Gao, Wei
    MECHANICAL COMPONENTS AND CONTROL ENGINEERING III, 2014, 668-669 : 1126 - +
  • [32] Research and Implementation of Speech Emotion Recognition Based on CGRU Model
    Zheng Y.
    Chen J.-N.
    Wu F.
    Fu B.
    Dongbei Daxue Xuebao/Journal of Northeastern University, 2020, 41 (12): : 1680 - 1685
  • [33] Temporal Discrete Cosine Transform for Speech Emotion Recognition
    Popovic, Branislav
    Stankovic, Igor
    Ostrogonac, Stevan
    2013 IEEE 4TH INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2013, : 87 - 90
  • [34] Research progress on discretization of fractional Fourier transform
    Ran Tao
    Feng Zhang
    Yue Wang
    Science in China Series F: Information Sciences, 2008, 51
  • [36] Research progress on discretization of fractional Fourier transform
    Tao, Ran
    Zhang, Feng
    Wang, Yue
    SCIENCE IN CHINA SERIES F-INFORMATION SCIENCES, 2008, 51 (07): : 859 - 880
  • [37] Research on DOA Estimation of Nonstationary Signal Based on Fractional Fourier Transform
    Zhu, Yunchao
    Yang, Kunde
    Li, Hui
    Wu, Feiyun
    Yang, Qiulong
    Xue, Runze
    2018 OCEANS - MTS/IEEE KOBE TECHNO-OCEANS (OTO), 2018,
  • [38] 3D palatal rugae recognition based on Fractional Fourier Transform
    Shangguan, Hong
    Yang, Tingyu
    Luo, Qiang
    Zhang, Xiong
    Li, Bing
    Wang, Shuai
    DIGITAL SIGNAL PROCESSING, 2023, 133
  • [39] Speech emotion recognition based on emotion perception
    Gang Liu
    Shifang Cai
    Ce Wang
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [40] Speech emotion recognition based on emotion perception
    Liu, Gang
    Cai, Shifang
    Wang, Ce
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2023, 2023 (01)