Multi-modal sarcasm detection based on Multi-Channel Enhanced Fusion model

被引:7
|
作者
Fang, Hong [1 ]
Liang, Dahao [2 ]
Xiang, Weiyu [2 ]
机构
[1] Shanghai Polytech Univ, Sch Math Phys & Stat, Shanghai 201209, Peoples R China
[2] Shanghai Polytech Univ, Inst Artificial Intelligence, Sch Comp & Informat Engn, Shanghai 201209, Peoples R China
关键词
Multi-modal sarcasm detection; Attention mechanism; Feature fusion;
D O I
10.1016/j.neucom.2024.127440
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The voluminous quantity of data accessible on social media platforms offers insight into the sentiment disposition of individual users, where multi -modal sarcasm detection is often confounding. Existing sarcasm detection methods use different information fusion methods to combine information from different modalities but ignore hidden information within modalities and inconsistent information between modalities. Discovering the implicit information within the modalities and strengthening the information interaction between modalities is still an important challenge. In this paper, we propose a Multi -Channel Enhanced Fusion (MCEF) model for cross -modal sarcasm detection to maximize the information extraction between different modalities. Specifically, text extracted from images acts as a new modality in the front-end fusion models to augment the utilization of image semantic information. Then, we propose a novel bipolar semantic attention mechanism to uncover the inconsistencies among different modal features. Furthermore, a decision -level fusion strategy from a new perspective is devised based on four models to achieve multi -channel fusion, each with a distinct focus, to leverage their advantages and mitigate the limitations. Extensive experiments demonstrate that our model surpasses current state-of-the-art models in multi -modal sarcasm detection.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Modeling inter-modal incongruous sentiment expressions for multi-modal sarcasm detection
    Ou, Lisong
    Li, Zhixin
    NEUROCOMPUTING, 2025, 616
  • [22] Multi-Modal Sarcasm Detection via Cross-Modal Graph Convolutional Network
    Liang, Bin
    Lou, Chenwei
    Li, Xiang
    Yang, Min
    Gui, Lin
    He, Yulan
    Pei, Wenjie
    Xu, Ruifeng
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 1767 - 1777
  • [23] MSTGC: Multi-Channel Spatio-Temporal Graph Convolution Network for Multi-Modal Brain Networks Fusion
    Xu, Ruting
    Zhu, Qi
    Li, Shengrong
    Hou, Zhenghua
    Shao, Wei
    Zhang, Daoqiang
    IEEE TRANSACTIONS ON NEURAL SYSTEMS AND REHABILITATION ENGINEERING, 2023, 31 : 2359 - 2369
  • [24] Deep Regression via Multi-Channel Multi-Modal Learning for Pneumonia Screening
    Wang, Qiuli
    Yang, Dan
    Li, Zhihuan
    Zhang, Xiaohong
    Liu, Chen
    IEEE ACCESS, 2020, 8 : 78530 - 78541
  • [25] Disease Classification Model Based on Multi-Modal Feature Fusion
    Wan, Zhengyu
    Shao, Xinhui
    IEEE ACCESS, 2023, 11 : 27536 - 27545
  • [26] Modeling Multi-Task Joint Training of Aggregate Networks for Multi-Modal Sarcasm Detection
    Ou, Lisong
    Li, Zhixin
    PROCEEDINGS OF THE 4TH ANNUAL ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2024, 2024, : 833 - 841
  • [27] Multi-Modal Fusion for Enhanced Automatic Modulation Classification
    Li, Yingkai
    Wang, Shufei
    Zhang, Yibin
    Huang, Hao
    Wang, Yu
    Zhang, Qianyun
    Lin, Yun
    Gui, Guan
    2024 IEEE 99TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2024-SPRING, 2024,
  • [28] Multi-Modal Sarcasm Detection and Humor Classification in Code-Mixed Conversations
    Bedi, Manjot
    Kumar, Shivani
    Akhtar, Md Shad
    Chakraborty, Tanmoy
    IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 14 (02) : 1363 - 1375
  • [29] MMSD2.0: Towards a Reliable Multi-modal Sarcasm Detection System
    Qin, Libo
    Huang, Shijue
    Chen, Qiguang
    Cai, Chenran
    Zhang, Yudi
    Bin Liang
    Che, Wanxiang
    Xu, Ruifeng
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2023), 2023, : 10834 - 10845
  • [30] Modeling Intra and Inter-modality Incongruity for Multi-Modal Sarcasm Detection
    Pan, Hongliang
    Lin, Zheng
    Fu, Peng
    Qi, Yatao
    Wang, Weiping
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 1383 - 1392