A Deep Multi-level Attentive Network for Multimodal Sentiment Analysis

被引:22
|
作者
Yadav, Ashima [1 ]
Vishwakarma, Dinesh Kumar [2 ]
机构
[1] Bennett Univ, Dept Comp Sci & Engn, Plot 8-11,Tech Zone 2, Greater Noida 201310, Uttar Pradesh, India
[2] Delhi Technol Univ, Dept Informat Technol, Bawana Rd, New Delhi 110042, India
关键词
Attention; deep learning; multimodal analysis; sentiment analysis; FUSION;
D O I
10.1145/3517139
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multimodal sentiment analysis has attracted increasing attention with broad application prospects. Most of the existing methods have focused on a single modality, which fails to handle social media data due to its multiple modalities. Moreover, in multimodal learning, most of the works have focused on simply combining the two modalities without exploring the complicated correlations between them. This resulted in dissatisfying performance for multimodal sentiment classification. Motivated by the status quo, we propose a Deep Multi-level Attentive network (DMLANet), which exploits the correlation between image and text modalities to improve multimodal learning. Specifically, we generate the bi-attentive visual map along the spatial and channel dimensions to magnify Convolutional neural network representation power. Then, we model the correlation between the image regions and semantics of the word by extracting the textual features related to the bi-attentive visual features by applying semantic attention. Finally, self-attention is employed to fetch the sentiment-rich multimodal features for the classification automatically. We conduct extensive evaluations on four real-world datasets, namely, MVSA-Single, MVSA-Multiple, Flickr, and Getty Images, which verify our method's superiority.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Multi-Level Attention Map Network for Multimodal Sentiment Analysis
    Xue, Xiaojun
    Zhang, Chunxia
    Niu, Zhendong
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (05) : 5105 - 5118
  • [2] A Multi-step Attention and Multi-level Structure Network for Multimodal Sentiment Analysis
    Zhang, Chuanlei
    Zhao, Hongwei
    Wang, Bo
    Wang, Wei
    Ke, Ting
    Li, Jianrong
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2022, PT I, 2022, 13551 : 723 - 735
  • [3] Multi-level fusion with deep neural networks for multimodal sentiment classification
    Zhang Guangwei
    Zhao Bing
    Li Ruifan
    [J]. The Journal of China Universities of Posts and Telecommunications, 2022, 29 (03) : 25 - 33
  • [4] Multi-level Multiple Attentions for Contextual Multimodal Sentiment Analysis
    Poria, Soujanya
    Cambria, Erik
    Hazarika, Devamanyu
    Mazumder, Navonil
    Zadeh, Amir
    Morency, Louis-Philippe
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 1033 - 1038
  • [5] MULTI-CHANNEL ATTENTIVE GRAPH CONVOLUTIONAL NETWORK WITH SENTIMENT FUSION FOR MULTIMODAL SENTIMENT ANALYSIS
    Xiao, Luwei
    Wu, Xingjiao
    Wu, Wen
    Yang, Jing
    He, Liang
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 4578 - 4582
  • [6] Multi-Level Context Pyramid Network for Visual Sentiment Analysis
    Ou, Haochun
    Qing, Chunmei
    Xu, Xiangmin
    Jin, Jianxiu
    [J]. SENSORS, 2021, 21 (06) : 1 - 20
  • [7] Multi-level graph neural network for text sentiment analysis
    Liao, Wenxiong
    Zeng, Bi
    Liu, Jianqi
    Wei, Pengfei
    Cheng, Xiaochun
    Zhang, Weiwen
    [J]. COMPUTERS & ELECTRICAL ENGINEERING, 2021, 92
  • [8] Multi-level Deep Correlative Networks for Multi-modal Sentiment Analysis
    CAI Guoyong
    LYU Guangrui
    LIN Yuming
    WEN Yimin
    [J]. Chinese Journal of Electronics, 2020, 29 (06) : 1025 - 1038
  • [9] Multi-level textual-visual alignment and fusion network for multimodal aspect-based sentiment analysis
    Li, You
    Ding, Han
    Lin, Yuming
    Feng, Xinyu
    Chang, Liang
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2024, 57 (04)
  • [10] Multi-level textual-visual alignment and fusion network for multimodal aspect-based sentiment analysis
    You Li
    Han Ding
    Yuming Lin
    Xinyu Feng
    Liang Chang
    [J]. Artificial Intelligence Review, 57