Topic and Style-aware Transformer for Multimodal Emotion Recognition

被引:0
|
作者
Qiu, Shuwen [1 ]
Sekhar, Nitesh [2 ]
Singhal, Prateek [2 ]
机构
[1] Univ Calif Los Angeles, Los Angeles, CA 90024 USA
[2] Amazon, Seattle, WA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Understanding emotion expressions in multi-modal signals is key for machines to have a better understanding of human communication. While language, visual and acoustic modalities can provide clues from different perspectives, the visual modality is shown to make minimal contribution to the performance in the emotion recognition field due to its high dimensionality. Therefore, we first leverage the strong multi-modality backbone VATT to project the visual signal to the common space with language and acoustic signals. Also, we propose content-oriented features Topic and Speaking style on top of it to approach the subjectivity issues. Experiments conducted on the benchmark dataset MOSEI show our model can outperform SOTA results and effectively incorporate visual signals and handle subjectivity issues by serving as content "normalization".
引用
收藏
页码:2074 / 2082
页数:9
相关论文
共 50 条
  • [21] Learning Mutual Correlation in Multimodal Transformer for Speech Emotion Recognition
    Wang, Yuhua
    Shen, Guang
    Xu, Yuezhu
    Li, Jiahang
    Zhao, Zhengdao
    INTERSPEECH 2021, 2021, : 4518 - 4522
  • [22] KEY-SPARSE TRANSFORMER FOR MULTIMODAL SPEECH EMOTION RECOGNITION
    Chen, Weidong
    Xing, Xiaofeng
    Xu, Xiangmin
    Yang, Jichen
    Pang, Jianxin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6897 - 6901
  • [23] Fashion Style-Aware Embeddings for Clothing Image Retrieval
    Naka, Rino
    Katsurai, Marie
    Yanagi, Keisuke
    Goto, Ryosuke
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2022, 2022, : 49 - 53
  • [24] Emotion Recognition from Multimodal Physiological Signals for Emotion Aware Healthcare Systems
    Ayata, Deger
    Yaslan, Yusuf
    Kamasak, Mustafa E.
    JOURNAL OF MEDICAL AND BIOLOGICAL ENGINEERING, 2020, 40 (02) : 149 - 157
  • [25] Emotion Recognition from Multimodal Physiological Signals for Emotion Aware Healthcare Systems
    Değer Ayata
    Yusuf Yaslan
    Mustafa E. Kamasak
    Journal of Medical and Biological Engineering, 2020, 40 : 149 - 157
  • [26] Style-Aware Image Recommendation for Social Media Marketing
    Zhang, Yiwei
    Yamasaki, Toshihiko
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 3106 - 3114
  • [27] A Style-Aware Content Loss for Real-Time HD Style Transfer
    Sanakoyeu, Artsiom
    Kotovenko, Dmytro
    Lang, Sabine
    Ommer, Bjoern
    COMPUTER VISION - ECCV 2018, PT VIII, 2018, 11212 : 715 - 731
  • [28] Topic-Driven and Knowledge-Aware Transformer for Dialogue Emotion Detection
    Zhu, Lixing
    Pergola, Gabriele
    Gui, Lin
    Zhou, Deyu
    He, Yulan
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021), 2021, : 1571 - 1582
  • [29] PROTOTYPE-TO-STYLE: Dialogue Generation With Style-Aware Editing on Retrieval Memory
    Su, Yixuan
    Wang, Yan
    Cai, Deng
    Baker, Simon
    Korhonen, Anna
    Collier, Nigel
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 : 2152 - 2161
  • [30] Hierarchical Style-Aware Domain Generalization for Remote Physiological Measurement
    Wang, Jiyao
    Lu, Hao
    Wang, Ange
    Chen, Yingcong
    He, Dengbo
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (03) : 1635 - 1643