Multimodal Affect Classification at Various Temporal Lengths

被引：20

作者：

Kim, Jonathan C. ^{[1
]}

Clements, Mark A. ^{[1
]}

机构：

[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA

来源：

IEEE TRANSACTIONS ON AFFECTIVE COMPUTING | 2015年 / 6卷 / 04期

基金：

美国国家科学基金会;

关键词：

Audio-visual emotion recognition; classifier fusion; speech analysis; EMOTION RECOGNITION; ACOUSTIC PROFILES; VOCAL EXPRESSIONS;

D O I：

10.1109/TAFFC.2015.2411273

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Earlier studies have shown that certain emotional characteristics are best observed at different analysis-frame lengths. When features of multiple modalities are extracted, it is reasonable to believe that different temporal lengths would better model the underlying characteristics that result from different emotions. In this study, we examine the use of such differing timescales in constructing emotion classifiers. A novel fusion method is introduced that utilizes the outputs of individual classifiers that are trained using multi-dimensional inputs with multiple temporal lengths. We used the IEMOCAP database which contains audiovisual information of 10 subjects in dyadic interaction settings. The classification task was performed over three emotional dimensions: valence, activation, and dominance. The results demonstrate the utility of the multimodal-multitemporal approach. Statistically significant improvements in accuracy are seen for in all three dimensions when compared with unimodal-unitemporal classifiers.

引用

页码：371 / 384

页数：14

共 50 条

[31] Multimodal temporal pattern mining
Hong, PY
Huang, TS
16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL III, PROCEEDINGS, 2002, : 465 - 468
[32] Temporal Multimodal Multivariate Learning
Park, Hyoshin
Darko, Justice
Deshpande, Niharika
Pandey, Venktesh
Su, Hui
Ono, Masahiro
Barkely, Dedrick
Folsom, Larkin
Posselt, Derek
Chien, Steve
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022, 2022, : 3722 - 3732
[33] Temporal mechanisms of multimodal binding
Burr, David
Silva, Ottavia
Cicchini, Guido Marco
Banks, Martin S.
Morrone, Maria Concetta
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2009, 276 (1663) : 1761 - 1769
[34] Hybrid approach to clustering various lengths video
Mashtalir S.V.
Stolbovoi M.I.
Yakovlev S.V.
Journal of Automation and Information Sciences, 2019, 51 (04) : 26 - 35
[35] Ladder oligomers of various lengths containing metallomacrocycles
Rack, M.
Hanack, M.
Angewandte Chemie (International Edition in English), 1994, 33 (15-16): : 1646 - 1648
[36] Multimodal Dynamics: Dynamical Fusion for Trustworthy Multimodal Classification
Han, Zongbo
Yang, Fan
Huang, Junzhou
Zhang, Changqing
Yao, Jianhua
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 20675 - 20685
[37] Path lengths, correlations, and centrality in temporal networks
Pan, Raj Kumar
Saramaki, Jari
PHYSICAL REVIEW E, 2011, 84 (01)
[38] MULTIMODAL AFFECT DETECTION OF CAR DRIVERS
Rothkrantz, Leon J. M.
Datcu, Dragos
Absil, Neil
NEURAL NETWORK WORLD, 2009, 19 (03) : 293 - 305
[39] Fusion Mappings for Multimodal Affect Recognition
Kaechele, Markus
Schels, Martin
Thiam, Patrick
Schwenker, Friedhelm
2015 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2015, : 307 - 313
[40] ELECTROPHYSIOLOGICAL STUDIES OF VARIOUS GRAFT LENGTHS AND LESION LENGTHS IN REPAIR OF NERVE GAPS IN PRIMATES
KIM, DH
CONNOLLY, SE
GILLESPIE, JT
VOORHIES, RM
KLINE, DG
JOURNAL OF NEUROSURGERY, 1991, 75 (03) : 440 - 446

← 1 2 3 4 5 →