Bilingual COVID-19 Fake News Detection Based on LDA Topic Modeling and BERT Transformer

被引:1
|
作者
Omrani, Pouria [1 ,3 ]
Ebrahimian, Zahra [2 ,3 ]
Toosi, Ramin [2 ,3 ]
Akhaee, Mohammad Ali [2 ]
机构
[1] K N Toosi Univ Technol, Fac Elect Engn, Tehran, Iran
[2] Univ Tehran, Sch Elect & Comp Engn, Coll Engn, Tehran, Iran
[3] Adak Vira Iranian Rahjoo Co, Tehran, Iran
关键词
BERT Transformer; Topic Modeling; Fake News Detection; COVID-19;
D O I
10.1109/IPRIA59240.2023.10147179
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The spread of fake news has become more prevalent given the popularity of social media and the various news that circulates on it. As a result, it is crucial to discern between real and fake news. During the COVID-19 pandemic, there have been numerous tweets, posts, and news about this illness in social media and electronic media worldwide. This research presents a bilingual model combining Latent Dirichlet Allocation (LDA) topic modeling and the BERT transformer to detect COVID-19 fake news in both Persian and English. First, the dataset is prepared in Persian and English, and then the proposed method is used to detect COVID-19 fake news on the prepared dataset. Finally, the proposed model is evaluated using various metrics such as accuracy, precision, recall, and the f1-score. As a result of this approach, we achieve 92.18% accuracy, which shows that adding topic information to the pre-trained contextual representations given by the BERT network, significantly improves the solving of instances that are domain-specific. Also, the results show that our proposed approach outperforms previous state-of-the-art methods.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] A BERT-Based Semantic Enhanced Model for COVID-19 Fake News Detection
    Yin, Hui
    Liu, Xiao
    Wu, Yutao
    Aria, Hilya Mudrika
    Mohawesh, Rami
    [J]. WEB AND BIG DATA, PT I, APWEB-WAIM 2023, 2024, 14331 : 1 - 15
  • [2] BERT Model for Fake News Detection Based on Social Bot Activities in the COVID-19 Pandemic
    Heidari, Maryam
    Zad, Samira
    Hajibabaee, Parisa
    Malekzadeh, Masoud
    HekmatiAthar, SeyyedPooya
    Uzuner, Ozlem
    Jones, James H. Jr Jr
    [J]. 2021 IEEE 12TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2021, : 103 - 109
  • [3] Towards COVID-19 fake news detection using transformer-based models
    Alghamdi, Jawaher
    Lin, Yuqing
    Luo, Suhuai
    [J]. KNOWLEDGE-BASED SYSTEMS, 2023, 274
  • [4] Covid-19 Fake News Detection: A Survey
    Shushkevich, Elena
    Alexandrov, Mikhail
    Cardiff, John
    [J]. COMPUTACION Y SISTEMAS, 2021, 25 (04): : 783 - 792
  • [5] Fake Sentence Detection Based on Transfer Learning: Applying to Korean COVID-19 Fake News
    Lee, Jeong-Wook
    Kim, Jae-Hoon
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (13):
  • [6] Financial Topic Modeling Based on the BERT-LDA Embedding
    Zhou, Mei
    Kong, Ying
    Lin, Jianwu
    [J]. 2022 IEEE 20TH INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS (INDIN), 2022, : 495 - 500
  • [7] Detection of Fake News on COVID-19 on Web Search Engines
    Mazzeo, Valeria
    Rapisarda, Andrea
    Giuffrida, Giovanni
    [J]. FRONTIERS IN PHYSICS, 2021, 9
  • [8] COVID-19 Infodemic in Malaysia: Conceptualizing Fake News for Detection
    Lim, Chee Kuan
    Zainol, Zurinahni
    Omar, Bahiyah
    Ibrahim, Noor Farizah
    [J]. ADVANCES IN MULTIMEDIA, 2023, 2023
  • [9] Cross-lingual COVID-19 Fake News Detection
    Du, Jiangshu
    Dou, Yingtong
    Xia, Congying
    Cui, Limeng
    Ma, Jing
    Yu, Philip S.
    [J]. 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 859 - 862
  • [10] Fake news in COVID-19: A perspective
    Carrion-Alvarez, Diego
    Tijerina-Salina, Perla X.
    [J]. HEALTH PROMOTION PERSPECTIVES, 2020, 10 (04): : 290 - 291