Multi-level Multi-task representation learning with adaptive fusion for multimodal sentiment analysis

被引:0
|
作者
Chuanbo Zhu [1 ]
Min Chen [2 ]
Haomin Li [3 ]
Sheng Zhang [1 ]
Han Liang [1 ]
Chao Sun [1 ]
Yifan Liu [1 ]
Jincai Chen [1 ]
机构
[1] Huazhong University of Science and Technology,Wuhan National Laboratory for Optoelectronics
[2] South China University of Technology,School of Computer Science and Engineering
[3] Pazhou Laboratory,School of Computer Science and Technology
[4] Huazhong University of Science and Technology,Key Laboratory of Information Storage System
[5] Ministry of Education of China,undefined
关键词
Multimodal sentiment analysis; Multimodal adaptive fusion; Multi-level representation; Multi-task learning;
D O I
10.1007/s00521-024-10678-1
中图分类号
学科分类号
摘要
Multimodal sentiment analysis is an active task in multimodal intelligence, which aims to compute the user’s sentiment tendency from multimedia data. Generally, each modality is a specific and necessary perspective to express human sentiment, providing complementary and consensus information unavailable in a single modality. Nevertheless, the heterogeneous multimedia data often contain inconsistent and conflicting sentiment semantics that limits the model performance. In this work, we propose a Multi-level Multi-task Representation Learning with Adaptive Fusion (MuReLAF) network to bridge the semantic gap among different modalities. Specifically, we design a modality adaptive fusion block to adjust modality contributions dynamically. Besides, we build a multi-level multimodal representations framework to obtain modality-specific and modality-shared semantics by the multi-task learning strategy, where modality-specific semantics contain complementary information and modality-shared semantics include consensus information. Extensive experiments are conducted on four publicly available datasets: MOSI, MOSEI, SIMS, and SIMSV2(s), demonstrating that our model exhibits superior or comparable performance to state-of-the-art models. The achieved accuracies are 86.28%, 86.07%, 84.46%, and 82.78%, respectively, showcasing improvements of 0.82%, 0.84%, 1.75%, and 1.83%. Further analyses also indicate the effectiveness of our model in sentiment analysis.
引用
收藏
页码:1491 / 1508
页数:17
相关论文
共 50 条
  • [21] Multi-level Multiple Attentions for Contextual Multimodal Sentiment Analysis
    Poria, Soujanya
    Cambria, Erik
    Hazarika, Devamanyu
    Mazumder, Navonil
    Zadeh, Amir
    Morency, Louis-Philippe
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 1033 - 1038
  • [22] A Deep Multi-level Attentive Network for Multimodal Sentiment Analysis
    Yadav, Ashima
    Vishwakarma, Dinesh Kumar
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [23] Multi-Level Attention Map Network for Multimodal Sentiment Analysis
    Xue, Xiaojun
    Zhang, Chunxia
    Niu, Zhendong
    Wu, Xindong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 5105 - 5118
  • [24] Multi-task aquatic toxicity prediction model based on multi-level features fusion
    Yang, Xin
    Sun, Jianqiang
    Jin, Bingyu
    Lu, Yuer
    Cheng, Jinyan
    Jiang, Jiaju
    Zhao, Qi
    Shuai, Jianwei
    JOURNAL OF ADVANCED RESEARCH, 2025, 68 : 477 - 489
  • [25] Low-rank tensor fusion and self-supervised multi-task multimodal sentiment analysis
    Miao, Xinmeng
    Zhang, Xuguang
    Zhang, Haoran
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (23) : 63291 - 63308
  • [26] Multi-task Learning for Multi-modal Emotion Recognition and Sentiment Analysis
    Akhtar, Md Shad
    Chauhan, Dushyant Singh
    Ghosal, Deepanway
    Poria, Soujanya
    Ekbal, Asif
    Bhattacharyya, Pushpak
    2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 370 - 379
  • [27] Multi-task & Multi-modal Sentiment Analysis Model Based on Aware Fusion
    Wu S.
    Ma J.
    Data Analysis and Knowledge Discovery, 2023, 7 (10) : 74 - 84
  • [28] Multi-task Learning for Automated Essay Scoring with Sentiment Analysis
    Muangkammuen, Panitan
    Fukumoto, Fumiyo
    AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 109 - 116
  • [29] Imbalanced Sentiment Classification with Multi-Task Learning
    Wu, Fangzhao
    Wu, Chuhan
    Liu, Junxin
    CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1631 - 1634
  • [30] Active Multi-Task Representation Learning
    Chen, Yifang
    Du, Simon S.
    Jamieson, Kevin
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,