Uncertainty-Aware Gradient Modulation and Feature Masking for Multimodal Sentiment Analysis

被引:0
|
作者
Wu, Yuxian [1 ]
Wang, Chengji [1 ]
Li, Jingzhe [1 ]
Zhang, Wenjing [1 ]
Jiang, Xingpeng [1 ]
机构
[1] Cent China Normal Univ, Sch Comp Sci, Natl Language Resources Monitoring & Res Ctr Netw, Hubei Prov Key Lab Artificial Intelligence & Smar, Wuhan, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Multimodal sentiment analysis; Modal uncertainty; Gradient modulation; Feature masking;
D O I
10.1007/978-981-97-8795-1_22
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal Sentiment Analysis (MSA) aims to analyze the attitudes of speakers from video content. Previous methods focus on exploring consistent and cross-modal sentiment representations by multimodal interactions, they treat each modality equally. However, modalities are incomplete and uncertain, e.g., noise, semantic ambiguity. Modality with low uncertainty contributes more to the final loss, suppressing the optimization of modalities with high uncertainties. To address this problem, we propose a new Uncertainty-aware Gradient modulation and Feature masking model (UGF) for MSA, which aims to assist optimization of modalities with high uncertainty. We propose a novel modal uncertainty estimation method, which considers both the intra- and inter-modality consistency to estimate modal uncertainty. We improve the model by two aspects: First, we design a dynamic gradient modulation module (DGM) to amend the optimization process of each modality, it dynamically modulates the gradients of modality encoders according to their uncertainties. Second, we propose a uncertainty guided feature masking (UFM), it adaptively adds noise to the deterministic modality, making model pay more attention on uncertain modalities. We conducted extensive experiments on three popular datasets, e.g., MOSI, MOSEI and CH-SIMS. Experimental results show that our propose UGF achieves competitive results, the ablation studies demonstrate the effectiveness of the proposed components.
引用
收藏
页码:321 / 335
页数:15
相关论文
共 50 条
  • [41] MahaEmoSen: Towards Emotion-aware Multimodal Marathi Sentiment Analysis
    Chaudhari, Prasad
    Nandeshwar, Pankaj
    Bansal, Shubhi
    Kumar, Nagendra
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (09)
  • [42] Can Clicks Be Both Labels and Features? Unbiased Behavior Feature Collection and Uncertainty-aware Learning to Rank
    Yang, Tao
    Luo, Chen
    Lu, Hanging
    Gupta, Parth
    Yin, Bing
    Ai, Qingyao
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 6 - 17
  • [43] Uncertainty-Aware Deep Neural Representations for Visual Analysis of Vector Field Data
    Kumar, Atul
    Garg, Siddharth
    Dutta, Soumya
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2025, 31 (01) : 1343 - 1353
  • [44] Accurate, Uncertainty-Aware Classification of Molecular Chemical Motifs from Multimodal X-ray Absorption Spectroscopy
    Carbone, Matthew R.
    Maffettone, Phillip M.
    Qu, Xiaohui
    Yoo, Shinjae
    Lu, Deyu
    JOURNAL OF PHYSICAL CHEMISTRY A, 2024, 128 (10): : 1948 - 1957
  • [45] A multimodal feature learning approach for sentiment analysis of social network multimedia
    Baecchi, Claudio
    Uricchio, Tiberio
    Bertini, Marco
    Del Bimbo, Alberto
    MULTIMEDIA TOOLS AND APPLICATIONS, 2016, 75 (05) : 2507 - 2525
  • [46] Feature-guided Multimodal Sentiment Analysis towards Industry 4.0
    Yu, Bihui
    Wei, Jingxuan
    Yu, Bo
    Cai, Xingye
    Wang, Ke
    Sun, Huajun
    Bu, Liping
    Chen, Xiaowei
    COMPUTERS & ELECTRICAL ENGINEERING, 2022, 100
  • [47] Sentiment analysis based on text information enhancement and multimodal feature fusion
    Liu, Zijun
    Cai, Li
    Yang, Wenjie
    Liu, Junhui
    PATTERN RECOGNITION, 2024, 156
  • [48] A multimodal feature learning approach for sentiment analysis of social network multimedia
    Claudio Baecchi
    Tiberio Uricchio
    Marco Bertini
    Alberto Del Bimbo
    Multimedia Tools and Applications, 2016, 75 : 2507 - 2525
  • [49] Meta Noise Adaption Framework for Multimodal Sentiment Analysis With Feature Noise
    Yuan, Ziqi
    Zhang, Baozheng
    Xu, Hua
    Gao, Kai
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 7265 - 7277
  • [50] A short video sentiment analysis model based on multimodal feature fusion
    Shi, Hongyu
    SYSTEMS AND SOFT COMPUTING, 2024, 6