Detection of fake news in a new corpus for the Spanish language

被引:44
|
作者
Posadas-Duran, Juan-Pablo [1 ]
Gomez-Adorno, Helena [2 ]
Sidorov, Grigori [3 ]
Moreno Escobar, Jesus Jaime [1 ]
机构
[1] Inst Politecn Nacl, ESIME Zacatenco, Unidad Zacatenco, Escuela Super Ingn Meccan & Elect, Mexico City, DF, Mexico
[2] Univ Nacl Autonoma Mexico, IIMAS, Mexico City, DF, Mexico
[3] Inst Politecn Nacl, CIC, Mexico City, DF, Mexico
关键词
Fake news; corpus; Spanish; resource; machine learning;
D O I
10.3233/JIFS-179034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new resource to analyze and detect deceptive information that is present in a huge amount of news websites. Specifically, we compiled a corpus of news in the Spanish language extracted from several websites. The corpus is annotated with two labels (real and fake) for automatic fake news detection. Furthermore, the corpus also provides the category of the news, presenting a detailed analysis on vocabulary overlap among categories. Finally, we present a style-based fake news detection method. The obtained results show that the introduced corpus is an interesting resource for future research in this area.
引用
收藏
页码:4869 / 4876
页数:8
相关论文
共 50 条
  • [41] Leveraging Natural Language Processing and Machine Learning for Efficient Fake News Detection
    Kumar, Naresh
    Malhotra, Meetu
    Aggarwal, Bharti
    Rai, Dinesh
    Aggarwal, Gaurav
    Proceedings - International Conference on Technological Advancements in Computational Sciences, ICTACS 2023, 2023, : 535 - 541
  • [42] Fake news detection on social media using a natural language inference approach
    Sadeghi, Fariba
    Bidgoly, Amir Jalaly
    Amirkhani, Hossein
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (23) : 33801 - 33821
  • [43] Data Augmentation using Machine Translation for Fake News Detection in the Urdu Language
    Amjad, Maaz
    Sidorov, Grigori
    Zhila, Alisa
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2537 - 2542
  • [44] A hybrid model for fake news detection: Leveraging news content and user comments in fake news
    Albahar, Marwan
    IET INFORMATION SECURITY, 2021, 15 (02) : 169 - 177
  • [45] Leismo or fake leismo? New insights into Catalan Contact Spanish from the FEC Corpus
    Burkard, Monja
    VERBA-ANUARIO GALEGO DE FILOLOXIA, 2020, 47 : 1 - 26
  • [46] Fake News Classification Based on Subjective Language
    Melo Jeronimo, Caio Libanio
    Marinho, Leandro Balby
    Campelo, Claudio E. C.
    Veloso, Adriano
    da Costa Melo, Allan Sales
    IIWAS2019: THE 21ST INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2019, : 15 - 24
  • [47] Automatic Fake News Detection for Romanian Online News
    Buzea, Marius Cristian
    Trausan-Matu, Stefan
    Rebedea, Traian
    INFORMATION, 2022, 13 (03)
  • [48] Fake News Detection with Generated Comments for News Articles
    Yanagi, Yuta
    Orihara, Ryohei
    Sei, Yuichi
    Tahara, Yasuyuki
    Ohsuga, Akihiko
    2020 IEEE 24TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS (INES 2020), 2020, : 85 - 89
  • [49] A comprehensive Benchmark for fake news detection
    Antonio Galli
    Elio Masciari
    Vincenzo Moscato
    Giancarlo Sperlí
    Journal of Intelligent Information Systems, 2022, 59 : 237 - 261
  • [50] Fake News Detection on Indian Sources
    Gogineni, Navyadhara
    Rachamallu, Yashashvini
    Mekala, Ruchitha
    Mamatha, H. R.
    THIRD INTERNATIONAL CONFERENCE ON IMAGE PROCESSING AND CAPSULE NETWORKS (ICIPCN 2022), 2022, 514 : 23 - 35