Multimodal Affect Classification at Various Temporal Lengths

被引:20
|
作者
Kim, Jonathan C. [1 ]
Clements, Mark A. [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
基金
美国国家科学基金会;
关键词
Audio-visual emotion recognition; classifier fusion; speech analysis; EMOTION RECOGNITION; ACOUSTIC PROFILES; VOCAL EXPRESSIONS;
D O I
10.1109/TAFFC.2015.2411273
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Earlier studies have shown that certain emotional characteristics are best observed at different analysis-frame lengths. When features of multiple modalities are extracted, it is reasonable to believe that different temporal lengths would better model the underlying characteristics that result from different emotions. In this study, we examine the use of such differing timescales in constructing emotion classifiers. A novel fusion method is introduced that utilizes the outputs of individual classifiers that are trained using multi-dimensional inputs with multiple temporal lengths. We used the IEMOCAP database which contains audiovisual information of 10 subjects in dyadic interaction settings. The classification task was performed over three emotional dimensions: valence, activation, and dominance. The results demonstrate the utility of the multimodal-multitemporal approach. Statistically significant improvements in accuracy are seen for in all three dimensions when compared with unimodal-unitemporal classifiers.
引用
收藏
页码:371 / 384
页数:14
相关论文
共 50 条
  • [41] Multimodal Brain Tumor Classification
    Lerousseau, Marvin
    Deutsch, Eric
    Paragios, Nikos
    BRAINLESION: GLIOMA, MULTIPLE SCLEROSIS, STROKE AND TRAUMATIC BRAIN INJURIES (BRAINLES 2020), PT II, 2021, 12659 : 475 - 486
  • [42] Multimodal Classification of Driver Glance
    Baumann, Daniel
    Mahmoud, Marwa
    Robinson, Peter
    Dias, Eduardo
    Skrypchuk, Lee
    2017 SEVENTH INTERNATIONAL CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2017, : 389 - 394
  • [43] Multimodal classification: Case studies
    Skowron, Andrzej
    Wang, Hui
    Wojna, Arkadiusz
    Bazan, Jan
    TRANSACTIONS ON ROUGH SETS V, 2006, 4100 : 224 - 239
  • [44] A hierarchical approach to multimodal classification
    Skowron, A
    Wang, H
    Wojna, A
    Bazan, J
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, PT 2, PROCEEDINGS, 2005, 3642 : 119 - 127
  • [45] Multimodal Affect Recognition in Virtual Worlds: Avatars Mirroring Users' Affect
    Gonzalez-Sanchez, Javier
    Chavez-Echeagaray, Maria Elena
    Gibson, David
    Atkinson, Robert
    2013 HUMAINE ASSOCIATION CONFERENCE ON AFFECTIVE COMPUTING AND INTELLIGENT INTERACTION (ACII), 2013, : 724 - +
  • [46] Temporal Artery Biopsy Lengths in Alberta: Which Surgical Subspecialty Achieves Optimal Biopsy Lengths?
    Chu, Raymond
    Foster, Caylea
    Ali, Mohsin
    Chaba, Todd
    Soo, Jason
    Ord, Alison Cliff
    Tervaert, Jan Willem Cohen
    Yacyshyn, Elaine
    ARTHRITIS & RHEUMATOLOGY, 2019, 71
  • [47] Evaluation of the distortion of photographs using various focal lengths
    Suresh, Nilesh
    Sivakumar, Arvind
    BIOINFORMATION, 2021, 17 (09) : 814 - 817
  • [48] TABLE OF NORMAL PROBABILITIES FOR INTERVALS OF VARIOUS LENGTHS AND LOCATIONS
    DIXON, WJ
    ANNALS OF MATHEMATICAL STATISTICS, 1948, 19 (03): : 424 - 426
  • [49] Spatio-temporal Multimodal Mean
    Azmat, Shoaib
    Wills, Linda
    Wills, Scott
    2014 IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION (SSIAI 2014), 2014, : 81 - 84
  • [50] Determination of multistage silencers lengths with various thicknesses baffles
    Tupov B.V.
    Medvedev V.T.
    Thermal Engineering, 2014, 61 (03) : 242 - 245