Image-to-Text Conversion and Aspect-Oriented Filtration for Multimodal Aspect-Based Sentiment Analysis

被引:0
|
作者
Wang, Qianlong [1 ]
Xu, Hongling [1 ]
Wen, Zhiyuan [1 ]
Liang, Bin [1 ]
Yang, Min [2 ]
Qin, Bing [3 ]
Xu, Ruifeng [1 ,4 ,5 ]
机构
[1] Harbin Inst Technol Shenzhen, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[3] Harbin Inst Technol, Harbin 150001, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[5] Guangdong Prov Key Lab Novel Secur Intelligence Te, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; Visualization; Task analysis; Social networking (online); Filtration; Analytical models; Electronic mail; Aspect-Based sentiment analysis; multimodal sentiment analysis; natural language processing; pre-trained language model; CLASSIFICATION;
D O I
10.1109/TAFFC.2023.3333200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal aspect-based sentiment analysis (MABSA) aims to determine the sentiment polarity of each aspect mentioned in the text based on multimodal content. Various approaches have been proposed to model multimodal sentiment features for each aspect via modal interactions. However, most existing approaches have two shortcomings: (1) The representation gap between textual and visual modalities may increase the risk of misalignment in modal interactions; (2) In some examples where the image is not related to the text, the visual information may not enrich the textual modality when learning aspect-based sentiment features. In such cases, blindly leveraging visual information may introduce noises in reasoning the aspect-based sentiment expressions. To tackle these shortcomings, we propose an end-to-end MABSA framework with image conversion and noise filtration. Specifically, to bridge the representation gap in different modalities, we attempt to translate images into the input space of a pre-trained language model (PLM). To this end, we develop an image-to-text conversion module that can convert an image to an implicit sequence of token embedding. Moreover, an aspect-oriented filtration module is devised to alleviate the noise in the implicit token embeddings, which consists of two attention operations. After filtering the noise, we leverage a PLM to encode the text, aspect, and image prompt derived from filtered implicit token embeddings as sentiment features to perform aspect-based sentiment prediction. Experimental results on two MABSA datasets show that our framework achieves state-of-the-art performance. Furthermore, extensive experimental analysis demonstrates the proposed framework has superior robustness and efficiency.
引用
下载
收藏
页码:1264 / 1278
页数:15
相关论文
共 50 条
  • [1] Lexical attention and aspect-oriented graph convolutional networks for aspect-based sentiment analysis
    Li, Wenwen
    Yin, Shiqun
    Pu, Ting
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (03) : 1643 - 1654
  • [2] Research on Multimodal Aspect-Based Sentiment Analysis Based on Image Caption and Multimodal Aspect Extraction
    Huang, Peng
    Tao, Jun
    Su, Tengrong
    Zhang, Xiaoqing
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 5415 - 5418
  • [3] Text-image semantic relevance identification for aspect-based multimodal sentiment analysis
    Zhang, Tianzhi
    Zhou, Gang
    Lu, Jicang
    Li, Zhibo
    Wu, Hao
    Liu, Shuo
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [4] A Survey on Multimodal Aspect-Based Sentiment Analysis
    Zhao, Hua
    Yang, Manyu
    Bai, Xueyang
    Liu, Han
    IEEE ACCESS, 2024, 12 : 12039 - 12052
  • [5] Hierarchical Interactive Multimodal Transformer for Aspect-Based Multimodal Sentiment Analysis
    Yu, Jianfei
    Chen, Kai
    Xia, Rui
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (03) : 1966 - 1978
  • [6] Optimal Target-Oriented Knowledge Transportation For Aspect-Based Multimodal Sentiment Analysis
    Zhang, Linhao
    Jin, Li
    Xu, Guangluan
    Li, Xiaoyu
    Sun, Xian
    Zhang, Zequn
    Zhang, Yanan
    Li, Qi
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2024,
  • [7] Text-Image Feature Fine-Grained Learning for Joint Multimodal Aspect-Based Sentiment Analysis
    Zhang, Tianzhi
    Zhou, Gang
    Zhang, Shuang
    Li, Shunhang
    Sun, Yepeng
    Pi, Qiankun
    Liu, Shuo
    Computers, Materials and Continua, 2025, 82 (01): : 279 - 305
  • [8] Multimodal Aspect-Based Sentiment Analysis with External Knowledge and Multi-granularity Image-Text FeaturesMultimodal Aspect-Based Sentiment Analysis with External...Z. Liu et al.
    Zhanghui Liu
    Jiali Lin
    Yuzhong Chen
    Yu Dong
    Neural Processing Letters, 57 (2)
  • [9] Aspect-based sentiment analysis using adaptive aspect-based lexicons
    Mowlaei, Mohammad Erfan
    Abadeh, Mohammad Saniee
    Keshavarz, Hamidreza
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 148
  • [10] Survey on aspect detection for aspect-based sentiment analysis
    Trusca, Maria Mihaela
    Frasincar, Flavius
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (05) : 3797 - 3846