MIAN: Multi-head Incongruity Aware Attention Network with transfer learning for sarcasm detection

被引：0

作者：

Guan, Xin ^{[1
]}

Cao, Jiuxin ^{[1
]}

Zhang, Hui ^{[1
]}

Cao, Biwei ^{[1
]}

Liu, Bo ^{[2
]}

机构：

[1] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China

[2] Southeast Univ, Sch Comp Sci & Engn, Nanjing 211189, Peoples R China

来源：

EXPERT SYSTEMS WITH APPLICATIONS | 2025年 / 263卷

基金：

中国国家自然科学基金;

关键词：

Sentiment analysis; Sarcasm detection; Transfer learning; Deep neural networks; SENTIMENT;

D O I：

10.1016/j.eswa.2024.125702

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Sarcasm is a common rhetorical metaphor in social media platforms, that individuals express emotion contrary to the literal meaning. Capturing the incongruity in the texts is the critical factor in sarcasm detection. Although several studies have looked at the incongruity of a single text, there is currently a lack of studies on modeling the incongruity of contextual information. Inspired by Multi-Head Attention mechanism from Transformer, we propose a Multi-head Incongruity Aware Attention Network, which concentrates on both target semantic incongruity and contextual semantic incongruity. Specifically, we design a multi-head self-match network to capture target semantic incongruity in a single text. Moreover, a multi-head co-match network is applied to model the contextual semantic incongruity. Furthermore, due to the scarcity of sarcasm data and considering the correlation between tasks of sentiment analysis and sarcasm detection, we pre-train the language model with a great amount of sentiment analysis data, which enhances its ability to capture sentimental features in the text. The experimental results demonstrate that our model achieves state-of-the-art performance on four benchmark datasets, with an accuracy gain of 3.8% on Tweets Ghost, 1.1% on SARC Pol, 1.9% on Ciron and an F1-Score gain of 0.3% on FigLang Twitter.

引用

页数：10

共 50 条

[21] Local interpretable spammer detection model with multi-head graph channel attention network
Zhang, Fuzhi
Huo, Chenghang
Ma, Ru
Chao, Jinbo
NEURAL NETWORKS, 2025, 184
[22] Learning Contextual Features with Multi-head Self-attention for Fake News Detection
Wang, Yangqian
Han, Hao
Ding, Ye
Wang, Xuan
Liao, Qing
COGNITIVE COMPUTING - ICCC 2019, 2019, 11518 : 132 - 142
[23] Enhanced Semantic Representation Learning for Sarcasm Detection by Integrating Context-Aware Attention and Fusion Network
Hao, Shufeng
Yao, Jikun
Shi, Chongyang
Zhou, Yu
Xu, Shuang
Li, Dengao
Cheng, Yinghan
ENTROPY, 2023, 25 (06)
[24] TCIP: Network with topology capture and incongruity perception for sarcasm detection
Gao, Ling
Sheng, Nan
Liu, Yiming
Xu, Hao
INFORMATION FUSION, 2025, 117
[25] Person Re-Identification by Context-Aware Part Attention and Multi-Head Collaborative Learning
Wu, Dongming
Ye, Mang
Lin, Gaojie
Gao, Xin
Shen, Jianbing
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2022, 17 : 115 - 126
[26] Personalized federated learning based on multi-head attention algorithm
Jiang, Shanshan
Lu, Meixia
Hu, Kai
Wu, Jiasheng
Li, Yaogen
Weng, Liguo
Xia, Min
Lin, Haifeng
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2023, 14 (11) : 3783 - 3798
[27] Personalized federated learning based on multi-head attention algorithm
Shanshan Jiang
Meixia Lu
Kai Hu
Jiasheng Wu
Yaogen Li
Liguo Weng
Min Xia
Haifeng Lin
International Journal of Machine Learning and Cybernetics, 2023, 14 : 3783 - 3798
[28] Multi-head attention with reinforcement learning for supervised video summarization
Kadam, Bhakti Deepak
Deshpande, Ashwini Mangesh
JOURNAL OF ELECTRONIC IMAGING, 2024, 33 (05)
[29] Leveraging Multi-head Attention Mechanism to Improve Event Detection
Tong, Meihan
Xu, Bin
Hou, Lei
Li, Juanzi
Wang, Shuai
CHINESE COMPUTATIONAL LINGUISTICS, CCL 2019, 2019, 11856 : 268 - 280
[30] Jointly Learning Sentimental Clues and Context Incongruity for Sarcasm Detection
Chen, Wangqun
Lin, Fuqiang
Zhang, Xuan
Li, Guowei
Liu, Bo
IEEE ACCESS, 2022, 10 : 48292 - 48300

← 1 2 3 4 5 →