Image-to-Text Conversion and Aspect-Oriented Filtration for Multimodal Aspect-Based Sentiment Analysis

被引:0
|
作者
Wang, Qianlong [1 ]
Xu, Hongling [1 ]
Wen, Zhiyuan [1 ]
Liang, Bin [1 ]
Yang, Min [2 ]
Qin, Bing [3 ]
Xu, Ruifeng [1 ,4 ,5 ]
机构
[1] Harbin Inst Technol Shenzhen, Sch Comp Sci & Technol, Shenzhen 518055, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen 518055, Peoples R China
[3] Harbin Inst Technol, Harbin 150001, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[5] Guangdong Prov Key Lab Novel Secur Intelligence Te, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Sentiment analysis; Visualization; Task analysis; Social networking (online); Filtration; Analytical models; Electronic mail; Aspect-Based sentiment analysis; multimodal sentiment analysis; natural language processing; pre-trained language model; CLASSIFICATION;
D O I
10.1109/TAFFC.2023.3333200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multimodal aspect-based sentiment analysis (MABSA) aims to determine the sentiment polarity of each aspect mentioned in the text based on multimodal content. Various approaches have been proposed to model multimodal sentiment features for each aspect via modal interactions. However, most existing approaches have two shortcomings: (1) The representation gap between textual and visual modalities may increase the risk of misalignment in modal interactions; (2) In some examples where the image is not related to the text, the visual information may not enrich the textual modality when learning aspect-based sentiment features. In such cases, blindly leveraging visual information may introduce noises in reasoning the aspect-based sentiment expressions. To tackle these shortcomings, we propose an end-to-end MABSA framework with image conversion and noise filtration. Specifically, to bridge the representation gap in different modalities, we attempt to translate images into the input space of a pre-trained language model (PLM). To this end, we develop an image-to-text conversion module that can convert an image to an implicit sequence of token embedding. Moreover, an aspect-oriented filtration module is devised to alleviate the noise in the implicit token embeddings, which consists of two attention operations. After filtering the noise, we leverage a PLM to encode the text, aspect, and image prompt derived from filtered implicit token embeddings as sentiment features to perform aspect-based sentiment prediction. Experimental results on two MABSA datasets show that our framework achieves state-of-the-art performance. Furthermore, extensive experimental analysis demonstrates the proposed framework has superior robustness and efficiency.
引用
下载
收藏
页码:1264 / 1278
页数:15
相关论文
共 50 条
  • [11] Aspect-Level Sentiment Analysis through Aspect-Oriented Features
    Busst M.B.M.A.
    Anbananthen K.S.M.
    Kannan S.
    HighTech and Innovation Journal, 2024, 5 (01): : 109 - 128
  • [12] Survey on aspect detection for aspect-based sentiment analysis
    Maria Mihaela Truşcǎ
    Flavius Frasincar
    Artificial Intelligence Review, 2023, 56 : 3797 - 3846
  • [13] Aspect-Based Sentiment Analysis Using Aspect Map
    Noh, Yunseok
    Park, Seyoung
    Park, Seong-Bae
    APPLIED SCIENCES-BASEL, 2019, 9 (16):
  • [14] Combining transfer and ensemble learning models for image and text aspect-based sentiment analysis
    Amit Chauhan
    Rajni Mohana
    International Journal of System Assurance Engineering and Management, 2025, 16 (3) : 1001 - 1019
  • [15] Sentiment Difficulty in Aspect-Based Sentiment Analysis
    Chifu, Adrian-Gabriel
    Fournier, Sebastien
    MATHEMATICS, 2023, 11 (22)
  • [16] Aspect-aware semantic feature enhanced networks for multimodal aspect-based sentiment analysis
    Zeng, Biqing
    Xie, Liangqi
    Li, Ruizhe
    Yao, Yongtao
    Li, Ruiyuan
    Deng, Huimin
    Journal of Supercomputing, 2025, 81 (01):
  • [17] Aspect Is Not You Need: No-aspect Differential Sentiment Framework for Aspect-based Sentiment Analysis
    Cao, Jiahao
    Liu, Rui
    Peng, Huailiang
    Jiang, Lei
    Bai, Xu
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 1599 - 1609
  • [18] Aspect-Based Sentiment Analysis Model of Multimodal Collaborative Contrastive Learning
    Yu, Bengong
    Xing, Yu
    Zhang, Shuwen
    Data Analysis and Knowledge Discovery, 2024, 8 (11) : 22 - 32
  • [19] Visual Enhancement Capsule Network for Aspect-based Multimodal Sentiment Analysis
    Zhang, Yifei
    Zhang, Zhiqing
    Feng, Shi
    Wang, Daling
    APPLIED SCIENCES-BASEL, 2022, 12 (23):
  • [20] Target-Aspect-Sentiment Joint Detection for Aspect-Based Sentiment Analysis
    Wan, Hai
    Yang, Yufei
    Du, Jianfeng
    Liu, Yanan
    Qi, Kunxun
    Pan, Jeff Z.
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 9122 - 9129