Detection of fake news in a new corpus for the Spanish language

被引:44
|
作者
Posadas-Duran, Juan-Pablo [1 ]
Gomez-Adorno, Helena [2 ]
Sidorov, Grigori [3 ]
Moreno Escobar, Jesus Jaime [1 ]
机构
[1] Inst Politecn Nacl, ESIME Zacatenco, Unidad Zacatenco, Escuela Super Ingn Meccan & Elect, Mexico City, DF, Mexico
[2] Univ Nacl Autonoma Mexico, IIMAS, Mexico City, DF, Mexico
[3] Inst Politecn Nacl, CIC, Mexico City, DF, Mexico
关键词
Fake news; corpus; Spanish; resource; machine learning;
D O I
10.3233/JIFS-179034
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a new resource to analyze and detect deceptive information that is present in a huge amount of news websites. Specifically, we compiled a corpus of news in the Spanish language extracted from several websites. The corpus is annotated with two labels (real and fake) for automatic fake news detection. Furthermore, the corpus also provides the category of the news, presenting a detailed analysis on vocabulary overlap among categories. Finally, we present a style-based fake news detection method. The obtained results show that the introduced corpus is an interesting resource for future research in this area.
引用
收藏
页码:4869 / 4876
页数:8
相关论文
共 50 条
  • [1] FakeRecogna: A New Brazilian Corpus for Fake News Detection
    Garcia, Gabriel L.
    Afonso, Luis C. S.
    Papa, Joao P.
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 57 - 67
  • [2] A study on the Detection of Fake news in Spanish
    Galvez, Alba Maribel Sanchez
    Albores, Francisco Javier
    Gonzalez, Ricardo Alvarez
    Conde, Said Gonzalez
    Galvez, Sully Sanchez
    INTERNATIONAL JOURNAL OF COMBINATORIAL OPTIMIZATION PROBLEMS AND INFORMATICS, 2024, 15 (02): : 85 - 94
  • [3] The PolitiFact-Oslo Corpus: A New Dataset for Fake News Analysis and Detection
    Poldvere, Nele
    Uddin, Zia
    Thomas, Aleena
    INFORMATION, 2023, 14 (12)
  • [4] Contributions to the Study of Fake News in Portuguese: New Corpus and Automatic Detection Results
    Monteiro, Rafael A.
    Santos, Roney L. S.
    Pardo, Thiago A. S.
    de Almeida, Tiago A.
    Ruiz, Evandro E. S.
    Vale, Oto A.
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2018, 2018, 11122 : 324 - 334
  • [5] Language-Independent Fake News Detection: English, Portuguese, and Spanish Mutual Features
    Abonizio, Hugo Queiroz
    de Morais, Janaina Ignacio
    Tavares, Gabriel Marques
    Barbon Junior, Sylvio
    FUTURE INTERNET, 2020, 12 (05):
  • [6] Cross-Language Fake News Detection
    Chu S.K.W.
    Xie R.
    Wang Y.
    Data and Information Management, 2021, 5 (01) : 100 - 109
  • [7] A Review of Fake News Detection Techniques for Arabic Language
    Alotaibi, Taghreed
    Al-Dossari, Hmood
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (01) : 392 - 407
  • [8] Fake news article detection datasets for Hindi language
    Kumar, Sujit
    Shankhdhar, Anant
    Singal, Divyam
    Aggarwal, Bhuvan
    Malhotra, Ahaan Sameer
    Ranbir Singh, Sanasam
    LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [9] A Survey on Natural Language Processing for Fake News Detection
    Oshikawa, Ray
    Qian, Jing
    Wang, William Yang
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6086 - 6093
  • [10] Fakepedia Corpus: A Flexible Fake News Corpus in Portuguese
    Charles, Anderson Cordeiro
    Ruback, Livia
    Oliveira, Jonice
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 37 - 45