Multi-level Multi-task representation learning with adaptive fusion for multimodal sentiment analysis

被引：0

作者：

Chuanbo Zhu ^{[1
]}

Min Chen ^{[2
]}

Haomin Li ^{[3
]}

Sheng Zhang ^{[1
]}

Han Liang ^{[1
]}

Chao Sun ^{[1
]}

Yifan Liu ^{[1
]}

Jincai Chen ^{[1
]}

机构：

[1] Huazhong University of Science and Technology,Wuhan National Laboratory for Optoelectronics

[2] South China University of Technology,School of Computer Science and Engineering

[3] Pazhou Laboratory,School of Computer Science and Technology

[4] Huazhong University of Science and Technology,Key Laboratory of Information Storage System

[5] Ministry of Education of China,undefined

来源：

Neural Computing and Applications | 2025年 / 37卷 / 3期

关键词：

Multimodal sentiment analysis; Multimodal adaptive fusion; Multi-level representation; Multi-task learning;

D O I：

10.1007/s00521-024-10678-1

中图分类号：

学科分类号：

摘要：

Multimodal sentiment analysis is an active task in multimodal intelligence, which aims to compute the user’s sentiment tendency from multimedia data. Generally, each modality is a specific and necessary perspective to express human sentiment, providing complementary and consensus information unavailable in a single modality. Nevertheless, the heterogeneous multimedia data often contain inconsistent and conflicting sentiment semantics that limits the model performance. In this work, we propose a Multi-level Multi-task Representation Learning with Adaptive Fusion (MuReLAF) network to bridge the semantic gap among different modalities. Specifically, we design a modality adaptive fusion block to adjust modality contributions dynamically. Besides, we build a multi-level multimodal representations framework to obtain modality-specific and modality-shared semantics by the multi-task learning strategy, where modality-specific semantics contain complementary information and modality-shared semantics include consensus information. Extensive experiments are conducted on four publicly available datasets: MOSI, MOSEI, SIMS, and SIMSV2(s), demonstrating that our model exhibits superior or comparable performance to state-of-the-art models. The achieved accuracies are 86.28%, 86.07%, 84.46%, and 82.78%, respectively, showcasing improvements of 0.82%, 0.84%, 1.75%, and 1.83%. Further analyses also indicate the effectiveness of our model in sentiment analysis.

引用

页码：1491 / 1508

页数：17

共 50 条

[21] Multi-level Multiple Attentions for Contextual Multimodal Sentiment Analysis
Poria, Soujanya
Cambria, Erik
Hazarika, Devamanyu
Mazumder, Navonil
Zadeh, Amir
Morency, Louis-Philippe
2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 1033 - 1038
[22] A Deep Multi-level Attentive Network for Multimodal Sentiment Analysis
Yadav, Ashima
Vishwakarma, Dinesh Kumar
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
[23] Multi-Level Attention Map Network for Multimodal Sentiment Analysis
Xue, Xiaojun
Zhang, Chunxia
Niu, Zhendong
Wu, Xindong
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 5105 - 5118
[24] Multi-task aquatic toxicity prediction model based on multi-level features fusion
Yang, Xin
Sun, Jianqiang
Jin, Bingyu
Lu, Yuer
Cheng, Jinyan
Jiang, Jiaju
Zhao, Qi
Shuai, Jianwei
JOURNAL OF ADVANCED RESEARCH, 2025, 68 : 477 - 489
[25] Low-rank tensor fusion and self-supervised multi-task multimodal sentiment analysis
Miao, Xinmeng
Zhang, Xuguang
Zhang, Haoran
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (23) : 63291 - 63308
[26] Multi-task Learning for Multi-modal Emotion Recognition and Sentiment Analysis
Akhtar, Md Shad
Chauhan, Dushyant Singh
Ghosal, Deepanway
Poria, Soujanya
Ekbal, Asif
Bhattacharyya, Pushpak
2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, 2019, : 370 - 379
[27] Multi-task & Multi-modal Sentiment Analysis Model Based on Aware Fusion
Wu S.
Ma J.
Data Analysis and Knowledge Discovery, 2023, 7 (10) : 74 - 84
[28] Multi-task Learning for Automated Essay Scoring with Sentiment Analysis
Muangkammuen, Panitan
Fukumoto, Fumiyo
AACL-IJCNLP 2020: THE 1ST CONFERENCE OF THE ASIA-PACIFIC CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 10TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2020, : 109 - 116
[29] Imbalanced Sentiment Classification with Multi-Task Learning
Wu, Fangzhao
Wu, Chuhan
Liu, Junxin
CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1631 - 1634
[30] Active Multi-Task Representation Learning
Chen, Yifang
Du, Simon S.
Jamieson, Kevin
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,

← 1 2 3 4 5 →