Topic and Style-aware Transformer for Multimodal Emotion Recognition

被引：0

作者：

Qiu, Shuwen ^{[1
]}

Sekhar, Nitesh ^{[2
]}

Singhal, Prateek ^{[2
]}

机构：

[1] Univ Calif Los Angeles, Los Angeles, CA 90024 USA

[2] Amazon, Seattle, WA USA

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Understanding emotion expressions in multi-modal signals is key for machines to have a better understanding of human communication. While language, visual and acoustic modalities can provide clues from different perspectives, the visual modality is shown to make minimal contribution to the performance in the emotion recognition field due to its high dimensionality. Therefore, we first leverage the strong multi-modality backbone VATT to project the visual signal to the common space with language and acoustic signals. Also, we propose content-oriented features Topic and Speaking style on top of it to approach the subjectivity issues. Experiments conducted on the benchmark dataset MOSEI show our model can outperform SOTA results and effectively incorporate visual signals and handle subjectivity issues by serving as content "normalization".

引用

页码：2074 / 2082

页数：9

共 50 条

[21] Learning Mutual Correlation in Multimodal Transformer for Speech Emotion Recognition
Wang, Yuhua
Shen, Guang
Xu, Yuezhu
Li, Jiahang
Zhao, Zhengdao
INTERSPEECH 2021, 2021, : 4518 - 4522
[22] KEY-SPARSE TRANSFORMER FOR MULTIMODAL SPEECH EMOTION RECOGNITION
Chen, Weidong
Xing, Xiaofeng
Xu, Xiangmin
Yang, Jichen
Pang, Jianxin
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6897 - 6901
[23] Fashion Style-Aware Embeddings for Clothing Image Retrieval
Naka, Rino
Katsurai, Marie
Yanagi, Keisuke
Goto, Ryosuke
PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 49 - 53
[24] Emotion Recognition from Multimodal Physiological Signals for Emotion Aware Healthcare Systems
Ayata, Deger
Yaslan, Yusuf
Kamasak, Mustafa E.
JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2020, 40 (02) : 149 - 157
[25] Emotion Recognition from Multimodal Physiological Signals for Emotion Aware Healthcare Systems
Değer Ayata
Yusuf Yaslan
Mustafa E. Kamasak
Journal of Medical and Biological Engineering, 2020, 40 : 149 - 157
[26] Style-Aware Image Recommendation for Social Media Marketing
Zhang, Yiwei
Yamasaki, Toshihiko
PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3106 - 3114
[27] A Style-Aware Content Loss for Real-Time HD Style Transfer
Sanakoyeu, Artsiom
Kotovenko, Dmytro
Lang, Sabine
Ommer, Bjoern
COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 715 - 731
[28] Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection
Zhu, Lixing
Pergola, Gabriele
Gui, Lin
Zhou, Deyu
He, Yulan
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1571 - 1582
[29] PROTOTYPE-TO-STYLE: Dialogue Generation With Style-Aware Editing on Retrieval Memory
Su, Yixuan
Wang, Yan
Cai, Deng
Baker, Simon
Korhonen, Anna
Collier, Nigel
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2152 - 2161
[30] Hierarchical Style-Aware Domain Generalization for Remote Physiological Measurement
Wang, Jiyao
Lu, Hao
Wang, Ange
Chen, Yingcong
He, Dengbo
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (03) : 1635 - 1643

← 1 2 3 4 5 →