AFND: Arabic fake news dataset for the detection and classification of articles credibility

被引:8
|
作者
Khalil, Ashwaq [1 ]
Jarrah, Moath [1 ]
Aldwairi, Monther [1 ,2 ]
Jaradat, Manar [3 ]
机构
[1] Jordan Univ Sci & Technol, Dept Comp Engn, POB 3030, Irbid 22110, Jordan
[2] Zayed Univ, Coll Technol Innovat, Abu Dhabi, U Arab Emirates
[3] Hashemite Univ, Dept Comp Engn, POB 330127, Zarqa 13133, Jordan
来源
DATA IN BRIEF | 2022年 / 42卷
关键词
Arabic news dataset; Arabic fake news; Article credibility; Weak labeling; Detection; Classification;
D O I
10.1016/j.dib.2022.108141
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The news credibility detection task has started to gain more attention recently due to the rapid increase of news on different social media platforms. This article provides a large, labeled, and diverse Arabic Fake News Dataset (AFND) that is collected from public Arabic news websites. This dataset enables the research community to use supervised and unsupervised machine learning algorithms to classify the credibility of Arabic news articles. AFND consists of 606912 public news articles that were scraped from 134 public news websites of 19 different Arab countries over a 6-month period using Python scripts. The Arabic fact-check platform, Misbar, is used manually to classify each public news source into credible, not credible, or undecided. Weak supervision is applied to label news articles with the same label as the public source. AFND is imbalanced in the number of articles in each class. Hence, it is useful for researchers who focus on finding solutions for imbalanced datasets. The dataset is available in JSON format and can be accessed from Mendeley Data repository. (C) 2022 The Author(s). Published by Elsevier Inc.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Dataset for Arabic Fake News
    Assaf, Rasha
    Saheb, Mahmoud
    [J]. 2021 IEEE 15TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT2021), 2021,
  • [2] An Experimental Evaluation of Data Classification Models for Credibility Based Fake News Detection
    Ramkissoon, Amit Neil
    Mohammed, Shareeda
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2020), 2020, : 93 - 100
  • [3] Fake News Detection with Generated Comments for News Articles
    Yanagi, Yuta
    Orihara, Ryohei
    Sei, Yuichi
    Tahara, Yasuyuki
    Ohsuga, Akihiko
    [J]. 2020 IEEE 24TH INTERNATIONAL CONFERENCE ON INTELLIGENT ENGINEERING SYSTEMS (INES 2020), 2020, : 85 - 89
  • [4] Determining an Optimal Data Classification Model for Credibility-Based Fake News Detection
    Amit Neil Ramkissoon
    Shareeda Mohammed
    Wayne Goodridge
    [J]. The Review of Socionetwork Strategies, 2021, 15 : 347 - 380
  • [5] Determining an Optimal Data Classification Model for Credibility-Based Fake News Detection
    Ramkissoon, Amit Neil
    Mohammed, Shareeda
    Goodridge, Wayne
    [J]. REVIEW OF SOCIONETWORK STRATEGIES, 2021, 15 (02): : 347 - 380
  • [6] IFND: a benchmark dataset for fake news detection
    Dilip Kumar Sharma
    Sonal Garg
    [J]. Complex & Intelligent Systems, 2023, 9 : 2843 - 2863
  • [7] An Arabic Corpus of Fake News: Collection, Analysis and Classification
    Alkhair, Maysoon
    Meftouh, Karima
    Smaili, Kamel
    Othman, Nouha
    [J]. ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 292 - 302
  • [8] IFND: a benchmark dataset for fake news detection
    Sharma, Dilip Kumar
    Garg, Sonal
    [J]. COMPLEX & INTELLIGENT SYSTEMS, 2023, 9 (03) : 2843 - 2863
  • [9] A Moroccan News Articles Dataset (MNAD) For Arabic Text Categorization
    Jbene, Mourad
    Tigani, Small
    Saadane, Rachid
    Chehri, Abdellah
    [J]. 2021 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATION (DASA), 2021,
  • [10] A Fake News Detection and Credibility Ranking Platform for Portuguese Online News
    Lima, Ines Rito
    Pinto, Marcia
    Amorim, Ivone
    Marreiros, Goreti
    Ulisses, Alexandre
    [J]. INFORMATION SYSTEMS AND TECHNOLOGIES, WORLDCIST 2022, VOL 1, 2022, 468 : 531 - 541