Scanning, attention, and reasoning multimodal content for sentiment analysis

被引:5
|
作者
Liu, Yun [1 ]
Li, Zhoujun [2 ]
Zhou, Ke [1 ]
Zhang, Leilei [1 ]
Li, Lang [1 ]
Tian, Peng [1 ]
Shen, Shixun [1 ]
机构
[1] Moutai Inst, Dept Automat, Renhuai 564507, Guizhou Provinc, Peoples R China
[2] Beihang Univ, Sch Comp Sci & Engn, State Key Lab Software Dev Environm, Beijing 100191, Peoples R China
基金
中国国家自然科学基金;
关键词
Multimodal sentiment analysis; Attention; Reasoning; FUSION;
D O I
10.1016/j.knosys.2023.110467
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rise of social networks has provided people with platforms to display their lives and emotions, often in multimodal forms such as images and descriptive texts. Capturing the emotions embedded in the multimodal content of social networks involves great research challenges and practical values. Existing methods usually make sentiment predictions based on a single-round reasoning process with multimodal attention networks, however, this may be insufficient for tasks that require deep understanding and complex reasoning. To effectively comprehend multimodal content and predict the correct sentiment tendencies, we propose the Scanning, Attention, and Reasoning (SAR) model for multimodal sentiment analysis. Specifically, a perceptual scanning model is designed to roughly perceive the image and text content, as well as the intrinsic correlation between them. To deeply understand the complementary features between images and texts, an intensive attention model is proposed for cross-modal feature association learning. The multimodal joint features from the scanning and attention models are fused together as the representation of a multimodal node in the social network. A heterogeneous reasoning model implemented with a graph neural network is constructed to capture the influence of network communication in social networks and make sentiment predictions. Extensive experiments conducted on three benchmark datasets confirm the effectiveness and superiority of our model compared with state-of-the-art methods.(c) 2023 Elsevier B.V. All rights reserved.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Fuzzy commonsense reasoning for multimodal sentiment analysis
    Chaturvedi, Iti
    Satapathy, Ranjan
    Cavallari, Sandro
    Cambria, Erik
    PATTERN RECOGNITION LETTERS, 2019, 125 : 264 - 270
  • [2] Attention fusion network for multimodal sentiment analysis
    Yuanyi Luo
    Rui Wu
    Jiafeng Liu
    Xianglong Tang
    Multimedia Tools and Applications, 2024, 83 : 8207 - 8217
  • [3] Attention fusion network for multimodal sentiment analysis
    Luo, Yuanyi
    Wu, Rui
    Liu, Jiafeng
    Tang, Xianglong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (03) : 8207 - 8217
  • [4] Multimodal sentiment analysis based on multiple attention
    Wang, Hongbin
    Ren, Chun
    Yu, Zhengtao
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 140
  • [5] Multimodal PEAR Chain-of-Thought Reasoning for Multimodal Sentiment Analysis
    Li, Yan
    Lan, Xiangyuan
    Chen, Haifeng
    Lu, Ke
    Jiang, Dongmei
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 20 (09)
  • [6] GATED MECHANISM FOR ATTENTION BASED MULTIMODAL SENTIMENT ANALYSIS
    Kumar, Ayush
    Vepa, Jithendra
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4477 - 4481
  • [7] Counterfactual Reasoning for Out-of-distribution Multimodal Sentiment Analysis
    Sun, Teng
    Wang, Wenjie
    Jing, Liqiang
    Cui, Yiran
    Song, Xuemeng
    Nie, Liqiang
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022,
  • [8] SKEAFN: Sentiment Knowledge Enhanced Attention Fusion Network for multimodal sentiment analysis
    Zhu, Chuanbo
    Chen, Min
    Zhang, Sheng
    Sun, Chao
    Liang, Han
    Liu, Yifan
    Chen, Jincai
    INFORMATION FUSION, 2023, 100
  • [9] SmartRAN: Smart Routing Attention Network for multimodal sentiment analysis
    Guo, Xueyu
    Tian, Shengwei
    Yu, Long
    He, Xiaoyu
    APPLIED INTELLIGENCE, 2024, 54 (24) : 12742 - 12763
  • [10] Multimodal Sentiment Analysis Based on Bidirectional Mask Attention Mechanism
    Zhang Y.
    Zhang H.
    Liu Y.
    Liang K.
    Wang Y.
    Data Analysis and Knowledge Discovery, 2023, 7 (04) : 46 - 55