Entity-centric multi-domain transformer for improving generalization in fake news detection

被引:5
|
作者
Bazmi, Parisa [1 ]
Asadpour, Masoud [1 ]
Shakery, Azadeh [1 ,2 ]
Maazallahi, Abbas [1 ]
机构
[1] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran, Iran
关键词
Cross-domain; Domain generalization; Entity abstraction; Fake news detection; Knowledge entities; Mixture; -of; -experts; Multi-domain;
D O I
10.1016/j.ipm.2024.103807
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fake news has become a significant concern in recent times, particularly during the COVID-19 pandemic, as spreading false information can pose significant public health risks. Although many models have been suggested to detect fake news, they are often limited in their ability to extend to new emerging domains since they are designed for a single domain. Previous studies on multidomain fake news detection have focused on developing models that can perform well on multiple domains, but they often lack the ability to generalize to new unseen domains, which limits their effectiveness. To overcome this limitation, in this paper, we propose the Entity-centric Multi-domain Transformer (EMT) model. EMT uses entities in the news as key components in learning domain-invariant and domain-specific news representations, which addresses the challenges of domain shift and incomplete domain labeling in multidomain fake news detection. It incorporates entity background information from external knowledge sources to enhance finegrained news domain representation. EMT consists of a Domain-Invariant (DI) encoder, a Domain-Specific (DS) encoder, and a Cross-Domain Transformer (CT) that facilitates investigation of domain relationships and knowledge interaction with input news, enabling effective generalization. We evaluate the EMT's performance in multi-domain fake news detection across three settings: supervised multi-domain, zero-shot setting on new unseen domain, and limited samples from new domain. EMT demonstrates greater stability than state-of-the-art models when dealing with domain changes and varying training data. Specifically, in the zero-shot setting on new unseen domains, EMT achieves a good F1 score of approximately 72 %. The results highlight the effectiveness of EMT's entity-centric approach and its potential for real-world applications, as it demonstrates the ability to adapt to various training settings and outperform existing models in handling limited label data and adapting to previously unseen domains.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] LIMFA: label-irrelevant multi-domain feature alignment-based fake news detection for unseen domain
    Wu, Danke
    Tan, Zhenhua
    Zhao, Haoran
    Jiang, Taotao
    Qi, Meilin
    NEURAL COMPUTING & APPLICATIONS, 2023, 36 (10): : 5197 - 5215
  • [22] LIMFA: label-irrelevant multi-domain feature alignment-based fake news detection for unseen domain
    Danke Wu
    Zhenhua Tan
    Haoran Zhao
    Taotao Jiang
    Meilin Qi
    Neural Computing and Applications, 2024, 36 : 5197 - 5215
  • [23] Ask To The Point: Open-Domain Entity-Centric Question Generation
    Liu, Yuxiang
    Huang, Jie
    Chang, Kevin Chen-Chuan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 2703 - 2716
  • [24] KG-MFEND: an efficient knowledge graph-based model for multi-domain fake news detection
    Fu, Lifang
    Peng, Huanxin
    Liu, Shuai
    JOURNAL OF SUPERCOMPUTING, 2023, 79 (16): : 18417 - 18444
  • [25] KG-MFEND: an efficient knowledge graph-based model for multi-domain fake news detection
    Lifang fu
    Huanxin Peng
    Shuai Liu
    The Journal of Supercomputing, 2023, 79 : 18417 - 18444
  • [26] Embracing Domain Differences in Fake News: Cross-domain Fake News Detection using Multi-modal Data
    Silva, Amila
    Luo, Ling
    Karunasekera, Shanika
    Leckie, Christopher
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 557 - 565
  • [27] Querytogether: Enabling entity-centric exploration in multi-device collaborative search
    Andolina, Salvatore
    Klouche, Khalil
    Ruotsalo, Tuukka
    Floreen, Patrik
    Jacucci, Giulio
    INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (06) : 1182 - 1202
  • [28] A Federated Convolution Transformer for Fake News Detection
    Djenouri, Youcef
    Belbachir, Ahmed Nabil
    Michalak, Tomasz
    Srivastava, Gautam
    IEEE TRANSACTIONS ON BIG DATA, 2024, 10 (03) : 214 - 225
  • [29] Multi-aspect Entity-Centric Analysis of Big Social Media Archives
    Fafalios, Pavlos
    Iosifidis, Vasileios
    Stefanidis, Kostas
    Ntoutsi, Eirini
    RESEARCH AND ADVANCED TECHNOLOGY FOR DIGITAL LIBRARIES (TPDL 2017), 2017, 10450 : 261 - 273
  • [30] Knowledge Graphs for Social Good: An Entity-Centric Search Engine for the Human Trafficking Domain
    Kejriwal, Mayank
    Szekely, Pedro
    IEEE TRANSACTIONS ON BIG DATA, 2022, 8 (03) : 592 - 606