Multimodal Affect Classification at Various Temporal Lengths

被引:20
|
作者
Kim, Jonathan C. [1 ]
Clements, Mark A. [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
基金
美国国家科学基金会;
关键词
Audio-visual emotion recognition; classifier fusion; speech analysis; EMOTION RECOGNITION; ACOUSTIC PROFILES; VOCAL EXPRESSIONS;
D O I
10.1109/TAFFC.2015.2411273
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Earlier studies have shown that certain emotional characteristics are best observed at different analysis-frame lengths. When features of multiple modalities are extracted, it is reasonable to believe that different temporal lengths would better model the underlying characteristics that result from different emotions. In this study, we examine the use of such differing timescales in constructing emotion classifiers. A novel fusion method is introduced that utilizes the outputs of individual classifiers that are trained using multi-dimensional inputs with multiple temporal lengths. We used the IEMOCAP database which contains audiovisual information of 10 subjects in dyadic interaction settings. The classification task was performed over three emotional dimensions: valence, activation, and dominance. The results demonstrate the utility of the multimodal-multitemporal approach. Statistically significant improvements in accuracy are seen for in all three dimensions when compared with unimodal-unitemporal classifiers.
引用
收藏
页码:371 / 384
页数:14
相关论文
共 50 条
  • [31] Multimodal temporal pattern mining
    Hong, PY
    Huang, TS
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 465 - 468
  • [32] Temporal Multimodal Multivariate Learning
    Park, Hyoshin
    Darko, Justice
    Deshpande, Niharika
    Pandey, Venktesh
    Su, Hui
    Ono, Masahiro
    Barkely, Dedrick
    Folsom, Larkin
    Posselt, Derek
    Chien, Steve
    PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3722 - 3732
  • [33] Temporal mechanisms of multimodal binding
    Burr, David
    Silva, Ottavia
    Cicchini, Guido Marco
    Banks, Martin S.
    Morrone, Maria Concetta
    PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2009, 276 (1663) : 1761 - 1769
  • [34] Hybrid approach to clustering various lengths video
    Mashtalir S.V.
    Stolbovoi M.I.
    Yakovlev S.V.
    Journal of Automation and Information Sciences, 2019, 51 (04) : 26 - 35
  • [35] Ladder oligomers of various lengths containing metallomacrocycles
    Rack, M.
    Hanack, M.
    Angewandte Chemie (International Edition in English), 1994, 33 (15-16): : 1646 - 1648
  • [36] Multimodal Dynamics: Dynamical Fusion for Trustworthy Multimodal Classification
    Han, Zongbo
    Yang, Fan
    Huang, Junzhou
    Zhang, Changqing
    Yao, Jianhua
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20675 - 20685
  • [37] Path lengths, correlations, and centrality in temporal networks
    Pan, Raj Kumar
    Saramaki, Jari
    PHYSICAL REVIEW E, 2011, 84 (01)
  • [38] MULTIMODAL AFFECT DETECTION OF CAR DRIVERS
    Rothkrantz, Leon J. M.
    Datcu, Dragos
    Absil, Neil
    NEURAL NETWORK WORLD, 2009, 19 (03) : 293 - 305
  • [39] Fusion Mappings for Multimodal Affect Recognition
    Kaechele, Markus
    Schels, Martin
    Thiam, Patrick
    Schwenker, Friedhelm
    2015 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2015, : 307 - 313
  • [40] ELECTROPHYSIOLOGICAL STUDIES OF VARIOUS GRAFT LENGTHS AND LESION LENGTHS IN REPAIR OF NERVE GAPS IN PRIMATES
    KIM, DH
    CONNOLLY, SE
    GILLESPIE, JT
    VOORHIES, RM
    KLINE, DG
    JOURNAL OF NEUROSURGERY, 1991, 75 (03) : 440 - 446