F0, LPC, and MFCC Analysis for Emotion Recognition Based on Speech

被引:2
|
作者
Teixeira, Felipe L. [1 ,2 ,3 ]
Teixeira, Joao Paulo [2 ,3 ,4 ]
Soares, Salviano F. P. [1 ,5 ]
Pio Abreu, J. L. [6 ,7 ]
机构
[1] Engn Dept UTAD, Sch Sci & Technol, P-5000801 Vila Real, Portugal
[2] Inst Politecn Braganca, Res Ctr Digitalizat & Intelligent Robot CEDRI, P-5300253 Braganca, Portugal
[3] Inst Politecn Braganca, Lab Sustentabilidade & Tecnol Regioes Montanha Su, P-5300253 Braganca, Portugal
[4] Inst Politecn Braganca, Appl Management Res Unit UNIAG, P-5300253 Braganca, Portugal
[5] Inst Elect & Informat Engn Aveiro IEETA, P-3810193 Aveiro, Portugal
[6] Hosp Univ Coimbra, P-3004561 Coimbra, Portugal
[7] Univ Coimbra, Fac Med, P-3000548 Coimbra, Portugal
关键词
Emotional state; Speech; SVM; FEATURES; SELECTION; CLASSIFICATION;
D O I
10.1007/978-3-031-23236-7_27
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, research was done to understand what is needed to build a database to recognise emotions through speech. Some features that can highlight a good success rate for emotion recognition through speech were investigated. Also studied were some characteristics (symptoms) that can be associated with a specific emotional state. On the other hand, we also studied some features that can be used to identify some emotional states. A System Emotion Recognition (SER) was built with SVM, and the binary analysis was compared with a multi-category analysis. The binary analysis achieved an accuracy of 87.5% and the multi-class 42.6%. The parameters Fundamental Frequency-F0, Linear Predictive Coefficients (LPC), and Mel Frequency Cepstral Coeficients (MFCC) were used. The modest accuracy of this work was achieved using only F0, LPC and MFCC features.
引用
收藏
页码:389 / 404
页数:16
相关论文
共 50 条
  • [21] F0 Estimation of Speech Using SRH Based on TV-CAR Speech Analysis
    Funaki, Keiichi
    Higa, Takehito
    [J]. IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2013, E96A (11) : 2187 - 2190
  • [22] Investigation of Prosodic F0 Layers in Hierarchical F0 Modeling for HMM-based Speech Synthesis
    Lei, Ming
    Wu, Yi-Jian
    Ling, Zhen-Hua
    Dai, Li-Rong
    [J]. 2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 613 - +
  • [23] JOINT ANALYSIS OF F0 AND SPEECH RATE WITH FUNCTIONAL DATA ANALYSIS
    Gubian, Michele
    Boves, Lou
    Cangemi, Francesco
    [J]. 2011 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2011, : 4972 - 4975
  • [24] THE EMOTIONAL RECOGNITION RESEARCH ON THE F0 EFFECTS OF ERP COMPONENTS OF SPEECH SIGNAL
    Chang, Jiang
    Zhang, Xue-Ying
    Zhang, Qi-Ping
    Sun, Ying
    Chen, Hong-Tao
    [J]. JOURNAL OF RESIDUALS SCIENCE & TECHNOLOGY, 2016, 13 (01) : 111 - 119
  • [25] F0 ESTIMATION USING SRH BASED ON TV-CAR SPEECH ANALYSIS
    Funaki, Keiichi
    Higa, Takehito
    [J]. 2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2777 - 2781
  • [26] Speech Emotion Recognition using MFCC and Hybrid Neural Networks
    Badr, Youakim
    Mukherjee, Partha
    Thumati, Sindhu
    [J]. PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE (IJCCI), 2021, : 366 - 373
  • [27] On Evaluation of the F0 estimation based on time-varying complex speech analysis
    Funaki, Keiichi
    [J]. 11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2, 2010, : 637 - 640
  • [28] Speech Emotion Recognition using MFCC features and LSTM network
    Kumbhar, Harshawardhan S.
    Bhandari, Sheetal U.
    [J]. 2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
  • [29] Perturbative QCD analysis of neutral B-meson decays into σ σ, σ f0 and f0 f0
    Niu, Hua-Dian
    Li, Guo-Dong
    Ren, Jia-Le
    Liu, Xin
    [J]. EUROPEAN PHYSICAL JOURNAL C, 2022, 82 (02):
  • [30] Development of Speech Emotion Recognition Algorithm using MFCC and Prosody
    Koo, Hyejin
    Jeong, Soycong
    Yoon, Sungjae
    Kim, Wonjong
    [J]. 2020 INTERNATIONAL CONFERENCE ON ELECTRONICS, INFORMATION, AND COMMUNICATION (ICEIC), 2020,