Uncertainty-Aware Gradient Modulation and Feature Masking for Multimodal Sentiment Analysis

被引:0
|
作者
Wu, Yuxian [1 ]
Wang, Chengji [1 ]
Li, Jingzhe [1 ]
Zhang, Wenjing [1 ]
Jiang, Xingpeng [1 ]
机构
[1] Cent China Normal Univ, Sch Comp Sci, Natl Language Resources Monitoring & Res Ctr Netw, Hubei Prov Key Lab Artificial Intelligence & Smar, Wuhan, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Multimodal sentiment analysis; Modal uncertainty; Gradient modulation; Feature masking;
D O I
10.1007/978-981-97-8795-1_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal Sentiment Analysis (MSA) aims to analyze the attitudes of speakers from video content. Previous methods focus on exploring consistent and cross-modal sentiment representations by multimodal interactions, they treat each modality equally. However, modalities are incomplete and uncertain, e.g., noise, semantic ambiguity. Modality with low uncertainty contributes more to the final loss, suppressing the optimization of modalities with high uncertainties. To address this problem, we propose a new Uncertainty-aware Gradient modulation and Feature masking model (UGF) for MSA, which aims to assist optimization of modalities with high uncertainty. We propose a novel modal uncertainty estimation method, which considers both the intra- and inter-modality consistency to estimate modal uncertainty. We improve the model by two aspects: First, we design a dynamic gradient modulation module (DGM) to amend the optimization process of each modality, it dynamically modulates the gradients of modality encoders according to their uncertainties. Second, we propose a uncertainty guided feature masking (UFM), it adaptively adds noise to the deterministic modality, making model pay more attention on uncertain modalities. We conducted extensive experiments on three popular datasets, e.g., MOSI, MOSEI and CH-SIMS. Experimental results show that our propose UGF achieves competitive results, the ablation studies demonstrate the effectiveness of the proposed components.
引用
收藏
页码:321 / 335
页数:15
相关论文
共 50 条
  • [21] Mutual Information-calibrated Conformal Feature Fusion for Uncertainty-Aware Multimodal 3D Object Detection at the Edge
    Stunts, Alex C.
    Erricolo, Danilo
    Ravi, Sathya
    Tulabandhula, Theja
    Trivedi, Amit Ranjan
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 2029 - 2035
  • [22] M3: MultiModal Masking applied to sentiment analysis
    Georgiou, Efthymios
    Paraskevopoulos, Georgios
    Potamianos, Alexandros
    INTERSPEECH 2021, 2021, : 2876 - 2880
  • [23] MAP: Multimodal Uncertainty-Aware Vision-Language Pre-training Model
    Ji, Yatai
    Wang, Junjie
    Gong, Yuan
    Zhang, Lin
    Zhu, Yanru
    Wang, Hongfa
    Zhang, Jiaxing
    Sakai, Tetsuya
    Yang, Yujiu
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 23262 - 23271
  • [24] Uncertainty-Aware Multimodal Trajectory Prediction via a Single Inference from a Single Model
    Suk, Ho
    Kim, Shiho
    SENSORS, 2025, 25 (01)
  • [25] Uncertainty-Aware Dynamic Reliability Analysis Framework for Complex Systems
    Kabir, Sohag
    Yazdi, Mohammad
    Aizpurua, Jose Ignacio
    Papadopoulos, Yiannis
    IEEE ACCESS, 2018, 6 : 29499 - 29515
  • [26] Aspect-aware semantic feature enhanced networks for multimodal aspect-based sentiment analysis
    Zeng, Biqing
    Xie, Liangqi
    Li, Ruizhe
    Yao, Yongtao
    Li, Ruiyuan
    Deng, Huimin
    JOURNAL OF SUPERCOMPUTING, 2025, 81 (01):
  • [27] ConFEDE: Contrastive Feature Decomposition for Multimodal Sentiment Analysis
    Yang, Jiuding
    Yu, Yakun
    Niu, Di
    Guo, Weidong
    Xu, Yu
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 7617 - 7630
  • [28] Length Uncertainty-Aware Graph Contrastive Fusion Network for multimodal physiological signal emotion recognition
    Li, Guangqiang
    Chen, Ning
    Zhu, Hongqing
    Li, Jing
    Xu, Zhangyong
    Zhu, Zhiying
    NEURAL NETWORKS, 2025, 187
  • [29] COLD Fusion: Calibrated and Ordinal Latent Distribution Fusion for Uncertainty-Aware Multimodal Emotion Recognition
    Tellamekala, Mani Kumar
    Amiriparian, Shahin
    Schuller, Bjorn W.
    Andre, Elisabeth
    Giesbrecht, Timo
    Valstar, Michel
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (02) : 805 - 822
  • [30] Uncertainty-Aware Contrastive Learning for Semi-Supervised Classification of Multimodal Remote Sensing Images
    Ding, Kexin
    Lu, Ting
    Li, Shutao
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 13