Developing a fake news identification model with advanced deep language transformers for Turkish COVID-19 misinformation data

被引:8
|
作者
Bozuyla, Mehmet [1 ]
Ozcift, Akin [2 ]
机构
[1] Pamukkale Univ, Fac Engn, Dept Elect Elect Engn, Denizli, Turkey
[2] Manisa Celal Bayar Univ, Hasan Ferdi Turgutlu Technol Fac, Dept Software Engn, Manisa, Turkey
关键词
  Infodemic; fake news; BerTURK; language transformers; machine learning; COVID-19; SCIENCE;
D O I
10.3906/elk-2106-55
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The massive use of social media causes rapid information dissemination that amplifies harmful messages such as fake news. Fake-news is misleading information presented as factual news that is generally used to manipulate public opinion. In particular, fake news related to COVID-19 is defined as 'infodemic' by World Health Organization. An infodemic is a misleading information that causes confusion which may harm health. There is a high volume of misinformation about COVID-19 that causes panic and high stress. Therefore, the importance of development of COVID-19 related fake news identification model is clear and it is particularly important for Turkish language from COVID-19 fake news identification point of view. In this article, we propose an advanced deep language transformer model to identify the truth of Turkish COVID-19 news from social media. For this aim, we first generated Turkish COVID-19 news from various sources as a benchmark dataset. Then we utilized five conventional machine learning algorithms (i.e. Naive Bayes, Random Forest, K-Nearest Neighbor, Support Vector Machine, Logistic Regression) on top of several language preprocessing tasks. As a next step, we used novel deep learning algorithms such as Long Short -Term Memory, Bi-directional Long-Short-Term-Memory, Convolutional Neural Networks, Gated Recurrent Unit and Bi-directional Gated Recurrent Unit. For further evaluation, we made use of deep learning based language transformers, i.e. Bi-directional Encoder Representations from Transformers and its variations, to improve efficiency of the proposed approach. From the obtained results, we observed that neural transformers, in particular Turkish dedicated transformer BerTURK, is able to identify COVID-19 fake news in 98.5% accuracy.
引用
下载
收藏
页码:908 / 926
页数:19
相关论文
共 50 条
  • [41] Exploring Content-Based and Meta-Data Analysis for Detecting Fake News Infodemic: A case study on COVID-19
    Ajao, Oluwaseun
    Garg, Ashish
    Da Costa-Abreu, Marjory
    2022 12TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION SYSTEMS (ICPRS), 2022,
  • [42] Identification and prediction of time-varying parameters of COVID-19 model: a data-driven deep learning approach
    Long, Jie
    Khaliq, A. Q. M.
    Furati, K. M.
    INTERNATIONAL JOURNAL OF COMPUTER MATHEMATICS, 2021, 98 (08) : 1617 - 1632
  • [43] Developing a Deep Neural Network model for COVID-19 diagnosis based on CT scan images
    Joloudari, Javad Hassannataj
    Azizi, Faezeh
    Nodehi, Issa
    Nematollahi, Mohammad Ali
    Kamrannejhad, Fateme
    Hassannatajjeloudari, Edris
    Alizadehsani, Roohallah
    Islam, Sheikh Mohammed Shariful
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2023, 20 (09) : 16236 - 16258
  • [44] Developing a COVID-19 Crisis Management Strategy Using News Media and Social Media in Big Data Analytics
    Park, Young-Eun
    SOCIAL SCIENCE COMPUTER REVIEW, 2022, 40 (06) : 1358 - 1375
  • [45] AraCovTexFinder: Leveraging the transformer-based language model for Arabic COVID-19 text identification
    Hossain, Md. Rajib
    Hoque, Mohammed Moshiul
    Siddique, Nazmul
    Dewan, Ali Akber
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 133
  • [46] Cross-SEAN: A cross-stitch semi-supervised neural attention model for COVID-19 fake news detection
    Paka, William Scott
    Bansal, Rachit
    Kaushik, Abhay
    Sengupta, Shubhashis
    Chakraborty, Tanmoy
    APPLIED SOFT COMPUTING, 2021, 107
  • [47] Semi-supervised self-training for COVID-19 misinformation detection: analyzing Twitter data and alternative news media on Norwegian Twitter
    Siri Frisli
    Journal of Computational Social Science, 2025, 8 (2):
  • [48] DeepCOVID-19: A model for identification of COVID-19 virus sequences with genomic signal processing and deep learning
    Adetiba, Emmanuel
    Abolarinwa, Joshua A.
    Adegoke, Anthony A.
    Taiwo, Tunmike B.
    Ajayi, Oluwaseun T.
    Abayomi, Abdultaofeek
    Adetiba, Joy N.
    Badejo, Joke A.
    COGENT ENGINEERING, 2022, 9 (01):
  • [49] A New Artificial Intelligent Based Deep Learning Model Using IOT For COVID-19 Identification
    Basha, Shaik Shakeer
    Khasim, Syed
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (02) : 2630 - 2636
  • [50] Effective hybrid deep learning model for COVID-19 patterns identification using CT images
    Ibrahim, Dheyaa Ahmed
    Zebari, Dilovan Asaad
    Mohammed, Hussam J.
    Mohammed, Mazin Abed
    EXPERT SYSTEMS, 2022, 39 (10)