Multi-task learning and mutual information maximization with crossmodal transformer for multimodal sentiment analysis

被引:1
|
作者
Shi, Yang [1 ]
Cai, Jinglang [1 ]
Liao, Lei [1 ]
机构
[1] Sichuan Normal Univ, Coll Phys & Elect Engn, Chengdu 610101, Peoples R China
关键词
Multimodal sentiment analysis; Multi-Task learning; Mutual information maximization; Crossmodal transformer;
D O I
10.1007/s10844-024-00858-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The effectiveness of multimodal sentiment analysis hinges on the seamless integration of information from diverse modalities, where the quality of modality fusion directly influences sentiment analysis accuracy. Prior methods often rely on intricate fusion strategies, elevating computational costs and potentially yielding inaccurate multimodal representations due to distribution gaps and information redundancy across heterogeneous modalities. This paper centers on the backpropagation of loss and introduces a Transformer-based model called Multi-Task Learning and Mutual Information Maximization with Crossmodal Transformer (MMMT). Addressing the issue of inaccurate multimodal representation for MSA, MMMT effectively combines mutual information maximization with crossmodal Transformer to convey more modality-invariant information to multimodal representation, fully exploring modal commonalities. Notably, it utilizes multi-modal labels for uni-modal training, presenting a fresh perspective on multi-task learning in MSA. Comparative experiments on the CMU-MOSI and CMU-MOSEI datasets demonstrate that MMMT improves model accuracy while reducing computational burden, making it suitable for resource-constrained and real-time performance-requiring application scenarios. Additionally, ablation experiments validate the efficacy of multi-task learning and probe the specific impact of combining mutual information maximization with Transformer in MSA.
引用
收藏
页码:1 / 19
页数:19
相关论文
共 50 条
  • [21] PURE: Personality-Coupled Multi-Task Learning Framework for Aspect-Based Multimodal Sentiment Analysis
    Zhang, Puning
    Fu, Miao
    Zhao, Rongjian
    Zhang, Hongbin
    Luo, Changchun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (01) : 462 - 477
  • [22] A Multi-Task Learning Approach to Improve Sentiment Analysis with Explicit Recommendation
    Habimana, Olivier
    Li, Yuhua
    Li, Ruixuan
    Gu, Xiwu
    Peng, Yuqi
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [23] Sentiment Analysis and Sarcasm Detection using Deep Multi-Task Learning
    Yik Yang Tan
    Chee-Onn Chow
    Jeevan Kanesan
    Joon Huang Chuah
    YongLiang Lim
    Wireless Personal Communications, 2023, 129 : 2213 - 2237
  • [24] Sentiment Analysis and Sarcasm Detection using Deep Multi-Task Learning
    Tan, Yik Yang
    Chow, Chee-Onn
    Kanesan, Jeevan
    Chuah, Joon Huang
    Lim, YongLiang
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 129 (03) : 2213 - 2237
  • [25] Paraphrase Bidirectional Transformer with Multi-Task Learning
    Ko, Bowon
    Choi, Ho-Jin
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2020), 2020, : 217 - 220
  • [26] Multi-Task Learning for Sentiment Analysis with Hard-Sharing and Task Recognition Mechanisms
    Zhang, Jian
    Yan, Ke
    Mo, Yuchang
    INFORMATION, 2021, 12 (05)
  • [27] Multimodal Representation Learning via Maximization of Local Mutual Information
    Liao, Ruizhi
    Moyer, Daniel
    Cha, Miriam
    Quigley, Keegan
    Berkowitz, Seth
    Horng, Steven
    Golland, Polina
    Wells, William M.
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2021, PT II, 2021, 12902 : 273 - 283
  • [28] Multi-Task Learning with Prior Information
    Zhang, Mengyuan
    Liu, Kai
    PROCEEDINGS OF THE 2023 SIAM INTERNATIONAL CONFERENCE ON DATA MINING, SDM, 2023, : 586 - 594
  • [29] Multi-task disagreement-reducing multimodal sentiment fusion network
    Wang, Zijun
    Jiang, Naicheng
    Chao, Xinyue
    Sun, Bin
    IMAGE AND VISION COMPUTING, 2024, 149
  • [30] A Multi-Task Learning Approach to Hate Speech Detection Leveraging Sentiment Analysis
    Plaza-Del-Arco, Flor Miriam
    Molina-Gonzalez, M. Dolores
    Urena-Lopez, L. Alfonso
    Martin-Valdivia, Maria Teresa
    IEEE ACCESS, 2021, 9 : 112478 - 112489