Hierarchical Interactive Multimodal Transformer for Aspect-Based Multimodal Sentiment Analysis

被引:26
|
作者
Yu, Jianfei [1 ]
Chen, Kai [1 ]
Xia, Rui [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
关键词
Fine-grained opinion mining; aspect-based sentiment analysis; multimodal sentiment analysis; ATTENTION; NETWORK;
D O I
10.1109/TAFFC.2022.3171091
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aspect-based multimodal sentiment analysis (ABMSA) aims to determine the sentiment polarities of each aspect or entity mentioned in a multimodal post or review. Previous studies to ABMSA can be summarized into two subtasks: aspect-term based multimodal sentiment classification (ATMSC) and aspect-category based multimodal sentiment classification (ACMSC). However, these existing studies have three shortcomings: (1) ignoring the object-level semantics in images; (2) primarily focusing on aspect-text and aspect-image interactions; (3) failing to consider the semantic gap between text and image representations. To tackle these issues, we propose a general Hierarchical Interactive Multimodal Transformer (HIMT) model for ABMSA. Specifically, we extract salient features with semantic concepts from images via an object detection method, and then propose a hierarchical interaction module to first model the aspect-text and aspect-image interactions, followed by capturing the text-image interactions. Moreover, an auxiliary reconstruction module is devised to largely eliminate the semantic gap between text and image representations. Experimental results show that our HIMTmodel significantly outperforms state-of-the-art methods on two benchmarks for ATMSC and one benchmark for ACMSC.
引用
收藏
页码:1966 / 1978
页数:13
相关论文
共 50 条
  • [31] Multi-grained fusion network with self-distillation for aspect-based multimodal sentiment analysis
    Yang, Juan
    Xiao, Yali
    Du, Xu
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 293
  • [32] EPICURE - Aspect-based Multimodal Review Summarization
    Kashyap, Abhinav Ramesh
    von der Weth, Christian
    Cheng, Zhiyong
    Kankanhalli, Mohan
    [J]. WEBSCI'18: PROCEEDINGS OF THE 10TH ACM CONFERENCE ON WEB SCIENCE, 2018, : 365 - 369
  • [33] Multimodal Sentiment Analysis Based on Composite Hierarchical Fusion
    Lei, Yu
    Qu, Keshuai
    Zhao, Yifan
    Han, Qing
    Wang, Xuguang
    [J]. Lei, Yu (leiyu@stdu.edu.cn), 1600, Oxford University Press (67): : 2230 - 2245
  • [34] Multimodal Sentiment Analysis Based on Composite Hierarchical Fusion
    Lei, Yu
    Qu, Keshuai
    Zhao, Yifan
    Han, Qing
    Wang, Xuguang
    [J]. COMPUTER JOURNAL, 2024, 67 (06): : 2230 - 2245
  • [35] TMBL: Transformer-based multimodal binding learning model for multimodal sentiment analysis
    Huang, Jiehui
    Zhou, Jun
    Tang, Zhenchao
    Lin, Jiaying
    Chen, Calvin Yu-Chian
    [J]. KNOWLEDGE-BASED SYSTEMS, 2024, 285
  • [36] TensorFormer: A Tensor-Based Multimodal Transformer for Multimodal Sentiment Analysis and Depression Detection
    Sun, Hao
    Chen, Yen-Wei
    Lin, Lanfen
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (04) : 2776 - 2786
  • [37] Sentiment Difficulty in Aspect-Based Sentiment Analysis
    Chifu, Adrian-Gabriel
    Fournier, Sebastien
    [J]. MATHEMATICS, 2023, 11 (22)
  • [38] CGT: A Clause Graph Transformer Structure for aspect-based sentiment analysis
    Su, Zelong
    Gao, Bin
    Pan, Xiaoou
    Liu, Zhengjun
    Ji, Yu
    Liu, Shutian
    [J]. DATA & KNOWLEDGE ENGINEERING, 2024, 153
  • [39] Transformer-based Relation Detect Model for Aspect-based Sentiment Analysis
    Wei, Zixi
    Xu, Xiaofei
    Li, Lijian
    Qin, Kaixin
    Li, Li
    [J]. 2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [40] Lightweight multilayer interactive attention network for aspect-based sentiment analysis
    Zheng, Wenjun
    Zhang, Shunxiang
    Yang, Cheng
    Hu, Peng
    [J]. CONNECTION SCIENCE, 2023, 35 (01)