Investigating Graph-based Features for Speech Emotion Recognition

被引:3
|
作者
Pentari, Anastasia [1 ]
Kafentzis, George [2 ]
Tsiknakis, Manolis [3 ,4 ]
机构
[1] Fdn Res & Technol Hellas, Computat BioMed Lab, Iraklion, Greece
[2] Univ Crete, Dept Comp Sci, Iraklion, Greece
[3] Hellen Mediterranean Univ, Biomed Informat & eHlth, Dept Elect & Comp Engn, Iraklion, Greece
[4] Inst Comp Sci, Iraklion, Greece
基金
欧盟地平线“2020”;
关键词
Affective Computing; Emotion Recognition; Speech Analysis; Visibility Graph Theory; Graph-based Features; FREQUENCY-ANALYSIS;
D O I
10.1109/BHI56158.2022.9926795
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
During the last decades, automatic speech emotion recognition (SER) has gained an increased interest by the research community. Specifically, SER aims to recognize the emotional state of a speaker directly from a speech recording. The most prominent approaches in the literature include feature extraction of speech signals in time and/or frequency domain that are successively applied as input into a classification scheme. In this paper, we propose to exploit graph theory and structures as alternative forms of speech representations. We suggest applying the so-called Visibility Graph (VG) theory to represent speech data using an adjacency matrix and extract well-known graph-based features from the latter. Finally, these features are fed into a Support Vector Machine (SVM) classifier in a leave-one-speaker-out, multi-class fashion. Our proposed feature set is compared with a well-known acoustic feature set named the Geneva Minimalistic Acoustic Parameter Set (GeMAPS). We test both approaches on two publicly available speech datasets: SAVEE and EMOVO. The experimental results show that the proposed graph-based features provide better results, namely a classification accuracy of 70% and 98%, respectively, yielding an increase by 29.2% and 60.6%, respectively, when compared to GeMAPS.
引用
下载
收藏
页数:5
相关论文
共 50 条
  • [31] Invisible emotion magnification algorithm (IEMA) for real-time micro-expression recognition with graph-based features
    Buhari, Adamu Muhammad
    Ooi, Chee-Pun
    Baskaran, Vishnu Monn
    Phan, Raphael C. W.
    Wong, KokSheik
    Tan, Wooi-Haw
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (07) : 9151 - 9176
  • [32] Invisible emotion magnification algorithm (IEMA) for real-time micro-expression recognition with graph-based features
    Adamu Muhammad Buhari
    Chee-Pun Ooi
    Vishnu Monn Baskaran
    Raphael CW Phan
    KokSheik Wong
    Wooi-Haw Tan
    Multimedia Tools and Applications, 2022, 81 : 9151 - 9176
  • [33] Enhancement of Medical Named Entity Recognition Using Graph-based Features
    Keretna, Sara
    Lim, Chee Peng
    Creighton, Doug
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 1895 - 1900
  • [34] SEMI-SUPERVISED SPEECH RECOGNITION VIA GRAPH-BASED TEMPORAL CLASSIFICATION
    Moritz, Niko
    Hori, Takaaki
    Le Roux, Jonathan
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6548 - 6552
  • [35] Speech Emotion Recognition Based on Transfer Emotion-Discriminative Features Subspace Learning
    Zhang, Kexin
    Liu, Yunxiang
    IEEE ACCESS, 2023, 11 : 56336 - 56343
  • [36] Integrating Language and Emotion Features for Multilingual Speech Emotion Recognition
    Heracleous, Panikos
    Mohammad, Yasser
    Yoneyama, Akio
    HUMAN-COMPUTER INTERACTION. MULTIMODAL AND NATURAL INTERACTION, HCI 2020, PT II, 2020, 12182 : 187 - 196
  • [37] Novel acoustic features for speech emotion recognition
    ROH Yong-Wan
    KIM Dong-Ju
    LEE Woo-Seok
    HONG Kwang-Seok
    Science China Technological Sciences, 2009, 52 (07) : 1838 - 1848
  • [38] Exploiting the potentialities of features for speech emotion recognition
    Li, Dongdong
    Zhou, Yijun
    Wang, Zhe
    Gao, Daqi
    INFORMATION SCIENCES, 2021, 548 : 328 - 343
  • [39] Learning Transferable Features for Speech Emotion Recognition
    Marczewski, Alison
    Veloso, Adriano
    Ziviani, Nivio
    PROCEEDINGS OF THE THEMATIC WORKSHOPS OF ACM MULTIMEDIA 2017 (THEMATIC WORKSHOPS'17), 2017, : 529 - 536
  • [40] Significance of Phonological Features in Speech Emotion Recognition
    Wei Wang
    Paul A. Watters
    Xinyi Cao
    Lingjie Shen
    Bo Li
    International Journal of Speech Technology, 2020, 23 : 633 - 642