Entity-centric multi-domain transformer for improving generalization in fake news detection

被引:5
|
作者
Bazmi, Parisa [1 ]
Asadpour, Masoud [1 ]
Shakery, Azadeh [1 ,2 ]
Maazallahi, Abbas [1 ]
机构
[1] Univ Tehran, Coll Engn, Sch Elect & Comp Engn, Tehran, Iran
[2] Inst Res Fundamental Sci IPM, Sch Comp Sci, Tehran, Iran
关键词
Cross-domain; Domain generalization; Entity abstraction; Fake news detection; Knowledge entities; Mixture; -of; -experts; Multi-domain;
D O I
10.1016/j.ipm.2024.103807
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Fake news has become a significant concern in recent times, particularly during the COVID-19 pandemic, as spreading false information can pose significant public health risks. Although many models have been suggested to detect fake news, they are often limited in their ability to extend to new emerging domains since they are designed for a single domain. Previous studies on multidomain fake news detection have focused on developing models that can perform well on multiple domains, but they often lack the ability to generalize to new unseen domains, which limits their effectiveness. To overcome this limitation, in this paper, we propose the Entity-centric Multi-domain Transformer (EMT) model. EMT uses entities in the news as key components in learning domain-invariant and domain-specific news representations, which addresses the challenges of domain shift and incomplete domain labeling in multidomain fake news detection. It incorporates entity background information from external knowledge sources to enhance finegrained news domain representation. EMT consists of a Domain-Invariant (DI) encoder, a Domain-Specific (DS) encoder, and a Cross-Domain Transformer (CT) that facilitates investigation of domain relationships and knowledge interaction with input news, enabling effective generalization. We evaluate the EMT's performance in multi-domain fake news detection across three settings: supervised multi-domain, zero-shot setting on new unseen domain, and limited samples from new domain. EMT demonstrates greater stability than state-of-the-art models when dealing with domain changes and varying training data. Specifically, in the zero-shot setting on new unseen domains, EMT achieves a good F1 score of approximately 72 %. The results highlight the effectiveness of EMT's entity-centric approach and its potential for real-world applications, as it demonstrates the ability to adapt to various training settings and outperform existing models in handling limited label data and adapting to previously unseen domains.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Deepfake detection based on cross-domain local characteristic analysis with multi-domain transformer
    Amin, Muhammad Ahmad
    Hu, Yongjian
    Li, Chang-Tsun
    Liu, Beibei
    ALEXANDRIA ENGINEERING JOURNAL, 2024, 91 : 592 - 609
  • [42] Entity-Oriented Multi-Modal Alignment and Fusion Network for Fake News Detection
    Li, Peiguang
    Sun, Xian
    Yu, Hongfeng
    Tian, Yu
    Yao, Fanglong
    Xu, Guangluan
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3455 - 3468
  • [43] Improving Fake News Detection by Using an Entity-enhanced Framework to Fuse Diverse Multimodal Clues
    Qi, Peng
    Cao, Juan
    Li, Xirong
    Liu, Huan
    Sheng, Qiang
    Mi, Xiaoyue
    He, Qin
    Lv, Yongbiao
    Guo, Chenyang
    Yu, Yingchao
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1212 - 1220
  • [44] DWIE: An entity-centric dataset for multi-task document-level information extraction
    Zaporojets, Klim
    Deleu, Johannes
    Develder, Chris
    Demeester, Thomas
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
  • [45] Positive Unlabeled Fake News Detection via Multi-Modal Masked Transformer Network
    Wang, Jinguang
    Qian, Shengsheng
    Hu, Jun
    Hong, Richang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 234 - 244
  • [46] Indonesia's Fake News Detection using Transformer Network
    Awalina, Aisyah
    Fawaid, Jibran
    Krisnabayu, Rifky Yunus
    Yudistira, Novanto
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY, SIET 2021, 2021, : 247 - 251
  • [47] Multi-Source Domain Adaptation with Weak Supervision for Early Fake News Detection
    Li, Yichuan
    Lee, Kyumin
    Kordzadeh, Nima
    Faber, Brenton
    Fiddes, Cameron
    Chen, Elaine
    Shu, Kai
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 668 - 676
  • [48] Improving fake news detection with domain-adversarial and graph-attention neural network
    Yuan, Hua
    Zheng, Jie
    Ye, Qiongwei
    Qian, Yu
    Zhang, Yan
    DECISION SUPPORT SYSTEMS, 2021, 151
  • [49] Cross-Domain Failures of Fake News Detection
    Janicka, Maria
    Pszona, Maria
    Wawer, Aleksander
    COMPUTACION Y SISTEMAS, 2019, 23 (03): : 1089 - 1097
  • [50] LIMESODA: Dataset for Fake News Detection in Healthcare Domain
    Payoungkhamdee, Patomporn
    Porkaew, Peerachet
    Sinthunyathum, Atthasith
    Songphum, Phattharaphon
    Kawidam, Witsarut
    Loha-Udom, Wichayut
    Boonkwan, Prachya
    Sutantayawalee, Vipas
    16TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING (ISAI-NLP 2021), 2021,