SPATIO-TEMPORAL CONTEXT MODELLING FOR SPEECH EMOTION CLASSIFICATION

被引:0
|
作者
Jalal, Md Asif [1 ]
Moore, Roger K. [1 ]
Hain, Thomas [1 ]
机构
[1] Univ Sheffield, Speech & Hearing Res Grp SPandH, Sheffield, S Yorkshire, England
关键词
Emotion classification; SER; Deep Neural Networks; Convolutional Neural Network; Attention Network;
D O I
10.1109/asru46091.2019.9004037
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Speech emotion recognition (SER) is a requisite for emotional intelligence that affects the understanding of speech. One of the most crucial tasks is to obtain patterns having a maximum correlation for the emotion classification task from the speech signal while being invariant to the changes in frequency, time and other external distortions. Therefore, learning emotional contextual feature representation independent of speaker and environment is essential. In this paper, a novel spatiotemporal context modelling framework for robust SER is proposed to learn feature representation by using acoustic context expansion with high dimensional feature projection. The framework uses a deep convolutional neural network (CNN) and self-attention network. The CNNs combine spatiotemporal features. The attention network produces high dimensional task-specific features and combines these features for context modelling, which altogether provides a state-of-the-art technique for classifying the extracted patterns for speech emotion. Speech emotion is a categorical perception representing discrete sensory events. The proposed approach is compared with a wide range of architectures on the RAVDESS and IEMOCAP corpora for 8-class and 4-class emotion classification tasks and remarkable gain over state-of-the-art systems are obtained, absolutely 15%, 10% respectively.
引用
收藏
页码:853 / 859
页数:7
相关论文
共 50 条
  • [41] Modelling underreported spatio-temporal crime events
    Riascos Villegas, Alvaro J.
    Nungo, Jose Sebastian
    Gomez Tobon, Lucas
    Dulce Rubio, Mateo
    Gomez, Francisco
    [J]. PLOS ONE, 2023, 18 (07):
  • [42] An image warping approach to spatio-temporal modelling
    Aberg, S
    Lindgren, F
    Malmberg, A
    Holst, J
    Holst, U
    [J]. ENVIRONMETRICS, 2005, 16 (08) : 833 - 848
  • [43] Mathematical modelling of spatio-temporal glioma evolution
    Papadogiorgaki, Maria
    Koliou, Panagiotis
    Kotsiakis, Xenofon
    Zervakis, Michalis E.
    [J]. THEORETICAL BIOLOGY AND MEDICAL MODELLING, 2013, 10
  • [44] Modelling spatio-temporal dynamics of herbicide resistance
    Richter, O
    Zwerger, P
    Böttcher, U
    [J]. WEED RESEARCH, 2002, 42 (01) : 52 - 64
  • [45] Spatio-temporal modelling of corrosion in an industrial furnace
    Little, J
    Goldstein, M
    Jonathan, P
    den Heijer, K
    [J]. APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2004, 20 (03) : 219 - 238
  • [46] Spatio-temporal Context Manager in an Open Context Awareness Framework
    Ahn, YoonAe
    Park, JeongSeok
    [J]. NCM 2008: 4TH INTERNATIONAL CONFERENCE ON NETWORKED COMPUTING AND ADVANCED INFORMATION MANAGEMENT, VOL 2, PROCEEDINGS, 2008, : 681 - 684
  • [47] SPATIO-TEMPORAL CROP CLASSIFICATION ON VOLUMETRIC DATA
    Qadeer, Muhammad Usman
    Saeed, Salar
    Taj, Murtaza
    Muhammad, Abubakr
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 3812 - 3816
  • [48] A spatio-temporal, functional classification of Indian cities
    Pomeroy, G
    [J]. CHALLENGES TO ASIAN URBANIZATION IN THE 21ST CENTURY, 2003, 75 : 137 - 161
  • [49] Spatio-temporal data classification using CVNNs
    Zahradnik, Jakub
    Skrbek, Miroslav
    [J]. SIMULATION MODELLING PRACTICE AND THEORY, 2013, 33 : 81 - 88
  • [50] Spatio-Temporal Pattern Classification with KernelCanvas and WiSARD
    de Souza, Diego F. P.
    Franca, Felipe M. G.
    Lima, Priscila M. V.
    [J]. 2014 BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2014, : 228 - 233