Automatic detection of fake tweets about the COVID-19 Vaccine in Portuguese

被引:1
|
作者
Geurgas, Rafael [1 ]
Tessler, Leandro R. [1 ]
机构
[1] Univ Estadual Campinas, IFGW, BR-13083970 Campinas, SP, Brazil
基金
巴西圣保罗研究基金会;
关键词
Disinformation; COVID-19; Neural networks; Automatic classification;
D O I
10.1007/s13278-024-01216-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The COVID-19 pandemic induced an unprecedented wave of disinformation in social media in Brazil. In particular, Twitter (currently X) was used to spread fake news about COVID-19 vaccines that helped to induce vaccine hesitation. This article presents a BERT-based neural network for the automatic detection of fake tweets. The optimized architecture relies upon BERTimbau, a BERT implementation pre-trained in Brazilian Portuguese, fine-tuned using three fully connected layers. All 2,857,908 tweets in Portuguese containing the word vacina (vaccine in Portuguese) were collected over 7 months. A random subset of 16,731 tweets was manually classified as real or fake. Of these, 2309 were discarded for not being about non-COVID-19 vaccines and 422 were discarded for containing irony. Of the remaining 14,000 tweets, 1144 were labeled fake and 12,856 were real. To balance the training dataset, the network was fine-tuned using the 1144 curated fake tweets and a random sample of 2000 real tweets. Optimal results were achieved by melting the last four layers of the BERTimbau. The best results obtained were 77.1% F1-score and 76.9% accuracy. These results are already acceptable for practical applications. They can be improved by increasing the size of the training dataset. A weighted 96.3% F1-score was obtained by training the same neural network architecture and hyperparameters with a larger curated balanced English language training dataset.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] A Deep Learning Model to Detect Fake News about COVID-19
    Shanmugavel S.B.
    Rangaswamy K.D.
    Muthukannan M.
    Recent Advances in Computer Science and Communications, 2023, 16 (09) : 58 - 66
  • [42] About Sinopharm COVID-19 vaccine adverse events
    Mezarina-Mendoza, Jhon Paul I.
    Carrasco-Freitas, Maria del C.
    Aguirre-Siancas, Elias
    REVISTA CHILENA DE INFECTOLOGIA, 2021, 38 (04): : 586 - 587
  • [43] Communicating about COVID-19 vaccine development and safety
    Thorpe, Alistair
    Fagerlin, Angela
    Butler, Jorie
    Stevens, Vanessa
    Drews, Frank A.
    Shoemaker, Holly
    Riddoch, Marian S.
    Scherer, Laura D.
    PLOS ONE, 2022, 17 (08):
  • [44] COVID-19 and SOS tweets in India
    Jena, Anuraag
    LANCET INFECTIOUS DISEASES, 2021, 21 (08): : 1072 - 1073
  • [45] Tweeting about the COVID-19 vaccine: A content analysis
    Hauer, Michael K.
    Jenkins, Alexander
    MacPherson, Janna
    Sun, Qingyue
    Swain, Marianne
    ATLANTIC JOURNAL OF COMMUNICATION, 2024, 32 (04) : 545 - 557
  • [46] Review of Method to Automatic Detection of COVID-19
    Li, Leyang
    Cao, Guixing
    Liu, Jun
    2021 5TH INTERNATIONAL CONFERENCE ON INNOVATION IN ARTIFICIAL INTELLIGENCE (ICIAI 2021), 2021, : 213 - 220
  • [47] Text Analysis of COVID-19 Tweets
    Theocharopoulos, Panagiotis C.
    Tsoukala, Anastasia
    Georgakopoulos, Spiros V.
    Tasoulis, Sotiris K.
    Plagianakos, Vassilis P.
    ENGINEERING APPLICATIONS OF NEURAL NETWORKS, EAAAI/EANN 2022, 2022, 1600 : 517 - 528
  • [48] Illusion of Truth: Analysing and Classifying COVID-19 Fake News in Brazilian Portuguese Language
    Endo, Patricia Takako
    Santos, Guto Leoni
    de Lima Xavier, Maria Eduarda
    Nascimento Campos, Gleyson Rhuan
    de Lima, Luciana Conceicao
    Silva, Ivanovitch
    Egli, Antonia
    Lynn, Theo
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (02)
  • [49] Emotional Analysis of Tweets About Clinically Extremely Vulnerable COVID-19 Groups
    Awoyemi, Toluwalase
    Ogunniyi, Kayode E.
    Adejumo, Adedolapo, V
    Ebili, Ujunwa
    Olusanya, Abiola
    Olojakpoke, Eloho H.
    Shonibare, Olufunto
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2022, 14 (09)
  • [50] Coronavirus, Ageism, and Twitter: An Evaluation of Tweets about Older Adults and COVID-19
    Jimenez-Sotomayor, Maria Renee
    Gomez-Moreno, Carolina
    Soto-Perez-de-Celis, Enrique
    JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2020, 68 (08) : 1661 - 1665