Multi-Taper Spectral Features for Emotion Recognition from Speech

被引:0
|
作者
Chapaneri, Santosh V. [1 ]
Jayaswal, Deepak D. [1 ]
机构
[1] Univ Mumbai, St Francis Inst Technol, Dept Elect & Telecommun Engn, Mumbai, Maharashtra, India
关键词
Emotion; Multi-taper; Pattern recognition; SVM; MFCC;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, the performance of multi-taper spectral estimate is investigated relative to conventional single taper estimate for the application of emotion recognition from speech signals. Typically, a single taper/window helps in reducing bias of the estimate, but due to its high variance, the resulting spectral features tend to give poor recognition performance. The weighted averages of the multi-tapered uncorrelated eigenspectra results in more discriminative spectral features, thus increasing the overall performance. We demonstrate that the application of six Multi-peak multi-tapers with support vector machine results in 81 % classification accuracy on seven emotions from Berlin emotion database considering only spectral features, compared to 72% using conventional Hamming window method.
引用
收藏
页码:1044 / 1049
页数:6
相关论文
共 50 条
  • [21] GMM supervector based SVM with spectral features for speech emotion recognition
    Hu, Hao
    Xu, Ming-Xing
    Wu, Wei
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 413 - +
  • [22] Urdu Speech Emotion Recognition using Speech Spectral Features and Deep Learning Techniques
    Taj, Soonh
    Shaikh, Ghulam Mujtaba
    Hassan, Saif
    Nimra
    [J]. 2023 4th International Conference on Computing, Mathematics and Engineering Technologies: Sustainable Technologies for Socio-Economic Development, iCoMET 2023, 2023,
  • [23] A Multi-Taper S-Transform Method for Spectral Estimation of Stationary Processes
    Huang, Zifeng
    Xu, You-Lin
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2021, 69 (69) : 1452 - 1467
  • [24] Spectral-norm risk rates for multi-taper estimation of Gaussian processes
    Romero, Jose Luis
    Speckbacher, Michael
    [J]. JOURNAL OF NONPARAMETRIC STATISTICS, 2022, 34 (02) : 448 - 464
  • [25] Spectral analysis of wave elevation time histories using multi-taper method
    Jeyaseelan, A. S.
    Balaji, R.
    [J]. OCEAN ENGINEERING, 2015, 105 : 242 - 246
  • [26] Amplitude Modulation Features for Emotion Recognition from Speech
    Alam, Md Jahangir
    Attabi, Yazid
    Dumouchel, Pierre
    Kenny, Patrick
    O'Shaughnessy, D.
    [J]. 14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2419 - 2423
  • [27] Evaluating intonational features for emotion recognition from speech
    Zervas, Panagiotis
    Mporas, Iosif
    Fakotakis, Nikos
    Kokkinakis, George
    [J]. INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2007, 16 (06) : 1001 - 1014
  • [28] Emotion recognition from speech using sub-syllabic and pitch synchronous spectral features
    Shashidhar G. Koolagudi
    Sreenivasa Rao Krothapalli
    [J]. International Journal of Speech Technology, 2012, 15 (4) : 495 - 511
  • [29] Emotion Recognition from Speech Utterances with Hybrid Spectral Features Using Machine Learning Algorithms
    Raghu, Kogila
    Sadanandam, Manchala
    [J]. TRAITEMENT DU SIGNAL, 2022, 39 (02) : 603 - 609
  • [30] Emotion recognition from speech using sub-syllabic and pitch synchronous spectral features
    Koolagudi, Shashidhar
    Krothapalli, Sreenivasa
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2012, 15 (04) : 495 - 511