F0, LPC, and MFCC Analysis for Emotion Recognition Based on Speech

被引:2
|
作者
Teixeira, Felipe L. [1 ,2 ,3 ]
Teixeira, Joao Paulo [2 ,3 ,4 ]
Soares, Salviano F. P. [1 ,5 ]
Pio Abreu, J. L. [6 ,7 ]
机构
[1] Engn Dept UTAD, Sch Sci & Technol, P-5000801 Vila Real, Portugal
[2] Inst Politecn Braganca, Res Ctr Digitalizat & Intelligent Robot CEDRI, P-5300253 Braganca, Portugal
[3] Inst Politecn Braganca, Lab Sustentabilidade & Tecnol Regioes Montanha Su, P-5300253 Braganca, Portugal
[4] Inst Politecn Braganca, Appl Management Res Unit UNIAG, P-5300253 Braganca, Portugal
[5] Inst Elect & Informat Engn Aveiro IEETA, P-3810193 Aveiro, Portugal
[6] Hosp Univ Coimbra, P-3004561 Coimbra, Portugal
[7] Univ Coimbra, Fac Med, P-3000548 Coimbra, Portugal
关键词
Emotional state; Speech; SVM; FEATURES; SELECTION; CLASSIFICATION;
D O I
10.1007/978-3-031-23236-7_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, research was done to understand what is needed to build a database to recognise emotions through speech. Some features that can highlight a good success rate for emotion recognition through speech were investigated. Also studied were some characteristics (symptoms) that can be associated with a specific emotional state. On the other hand, we also studied some features that can be used to identify some emotional states. A System Emotion Recognition (SER) was built with SVM, and the binary analysis was compared with a multi-category analysis. The binary analysis achieved an accuracy of 87.5% and the multi-class 42.6%. The parameters Fundamental Frequency-F0, Linear Predictive Coefficients (LPC), and Mel Frequency Cepstral Coeficients (MFCC) were used. The modest accuracy of this work was achieved using only F0, LPC and MFCC features.
引用
收藏
页码:389 / 404
页数:16
相关论文
共 50 条
  • [1] Robust F0 estimation based on complex LPC analysis for IRS filtered noisy speech
    Funaki, Keiichi
    Kinjo, Tatsuhiko
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2007, E90A (08) : 1579 - 1586
  • [2] Speech Emotion Recognition Based on Improved MFCC
    Wang, Yan
    Hu, Weiping
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND APPLICATION ENGINEERING (CSAE2018), 2018,
  • [3] Speech Based Human Emotion Recognition Using MFCC
    Likitha, M. S.
    Gupta, Raksha R.
    Hasitha, K.
    Raju, A. Upendra
    [J]. 2017 2ND IEEE INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, SIGNAL PROCESSING AND NETWORKING (WISPNET), 2017, : 2257 - 2260
  • [4] An analysis on LPC, RASTA and MFCC techniques in Automatic Speech Recognition System
    Gupta, Kartiki
    Gupta, Divya
    [J]. 2016 6th International Conference - Cloud System and Big Data Engineering (Confluence), 2016, : 493 - 497
  • [5] F0 estimation of noisy speech based on complex speech analysis
    Kinjo, Tatsuhiko
    Funaki, Keiichi
    [J]. 2006 IEEE 12TH DIGITAL SIGNAL PROCESSING WORKSHOP & 4TH IEEE SIGNAL PROCESSING EDUCATION WORKSHOP, VOLS 1 AND 2, 2006, : 434 - 437
  • [6] ASERNet: Automatic speech emotion recognition system using MFCC-based LPC approach with deep learning CNN
    Jagadeeshwar, Kalyanapu
    Sreenivasarao, T.
    Pulicherla, Padmaja
    Satyanarayana, K. N. V.
    Lakshmi, K. Mohana
    Kumar, Pala Mahesh
    [J]. INTERNATIONAL JOURNAL OF MODELING SIMULATION AND SCIENTIFIC COMPUTING, 2023, 14 (04)
  • [7] Quantification of Segmentation and F0 Errors and Their Effect on Emotion Recognition
    Steidl, Stefan
    Batliner, Anton
    Noeth, Elinar
    Hornegger, Joachim
    [J]. TEXT, SPEECH AND DIALOGUE, PROCEEDINGS, 2008, 5246 : 525 - 534
  • [8] SEQUENCE-TO-SEQUENCE MODELLING OF F0 FOR SPEECH EMOTION CONVERSION
    Robinson, Carl
    Obin, Nicolas
    Roebel, Axel
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6830 - 6834
  • [9] Emotion Recognition in Speech Using MFCC and Classifiers
    Ajitha, G.
    Prashanth, Addagatla
    Radhika, Chelle
    Chaitanya, Kancharapu
    [J]. COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING ( ICCVBIC 2021), 2022, 1420 : 197 - 207
  • [10] Robust F0 Modeling for Mandarin Speech Recognition in Noise
    Qiang, Sheng
    Qian, Yao
    Soong, Frank K.
    Xu, Congfu
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1101 - +