Dataset for Arabic Fake News

被引:5
|
作者
Assaf, Rasha [1 ]
Saheb, Mahmoud [1 ]
机构
[1] Palestine Polytech Univ, Hebron, Palestine
关键词
Fabricated contents; Annotations; Cohen's Kappa;
D O I
10.1109/AICT52784.2021.9620228
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
the adaptation of social media platforms allows the fast spread of misinformation, which can mislead the public. This dissemination of information and usage of the internet enables users to create and share massive amounts of information, some of which are unreliable. Fake news has become an important social issue for researchers to tackle. Few English fake news datasets were published and numerous machine learning approaches were proposed for news reliability classification. However, up to now, there is a limited reliable Arabic dataset for fake news detection. This paper is a data paper in which we present a new dataset of Arabic fake news. The data was collected from various sources including PalKashif. The articles and news segments were labeled by two experts. The dataset contains about 500 news segments and the inter-annotator agreement measured using Cohen's Kappa is 0.807. The dataset will be published for public use on Githubi(1).
引用
收藏
页数:4
相关论文
共 50 条
  • [1] AFND: Arabic fake news dataset for the detection and classification of articles credibility
    Khalil, Ashwaq
    Jarrah, Moath
    Aldwairi, Monther
    Jaradat, Manar
    DATA IN BRIEF, 2022, 42
  • [2] Detection of Arabic and Algerian Fake News
    Hamadouche, Khaoula
    Bousmaha, Kheira Zineb
    Amar, Mohamed Yasine Bahi
    Hadrich-Belguith, Lamia
    APPLIED COMPUTER SYSTEMS, 2024, 29 (02) : 14 - 21
  • [3] JointBert for Detecting Arabic Fake News
    Shishah, Wesam
    IEEE ACCESS, 2022, 10 : 71951 - 71960
  • [4] Annotation-Scheme Reconstruction for "Fake News" and Japanese Fake News Dataset
    Murayama, Taichi
    Hisada, Shohei
    Uehara, Makoto
    Wakamiya, Shoko
    Aramaki, Eiji
    LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2022, : 7226 - 7234
  • [5] Annotation-Scheme Reconstruction for Fake News and Japanese Fake News Dataset
    Murayama, Taichi
    Hisada, Shohei
    Uehara, Makoto
    Wakamiya, Shoko
    Aramaki, Eiji
    arXiv, 2022,
  • [6] Annotation-Scheme Reconstruction for Fake News and Japanese Fake News Dataset
    Murayama, Taichi
    Hisada, Shohei
    Uehara, Makoto
    Wakamiya, Shoko
    Aramaki, Eiji
    2022 Language Resources and Evaluation Conference, LREC 2022, 2022, : 7226 - 7234
  • [7] ANAD: Arabic news article dataset
    Altamimi, Mohammed
    Alayba, Abdulaziz M.
    DATA IN BRIEF, 2023, 50
  • [8] Fake News vs Satire: A Dataset and Analysis
    Golbeck, Jennifer
    Mauriello, Matthew
    Auxier, Brooke
    Bhanushali, Keval H.
    Bonk, Christopher
    Bouzaghrane, Mohamed Amine
    Buntain, Cody
    Chanduka, Riya
    Cheakalos, Paul
    Everett, Jeannine B.
    Falak, Waleed
    Gieringer, Carl
    Graney, Jack
    Hoffman, Kelly M.
    Huth, Lindsay
    Ma, Zhenye
    Jha, Mayanka
    Khan, Misbah
    Kori, Varsha
    Lewis, Elo
    Mirano, George
    Mohn, William T.
    Mussenden, Sean
    Nelson, Tammie M.
    Mcwillie, Sean
    Pant, Akshat
    Shetye, Priya
    Shrestha, Rusha
    Steinheimer, Alexandra
    Subramanian, Aditya
    Visnansky, Gina
    WEBSCI'18: PROCEEDINGS OF THE 10TH ACM CONFERENCE ON WEB SCIENCE, 2018, : 17 - 21
  • [9] IFND: a benchmark dataset for fake news detection
    Dilip Kumar Sharma
    Sonal Garg
    Complex & Intelligent Systems, 2023, 9 : 2843 - 2863
  • [10] BanFakeNews: A Dataset for Detecting Fake News in Bangla
    Hossain, Md Zobaer
    Rahman, Md Ashraful
    Islam, Md Saiful
    Kar, Sudipta
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 2862 - 2871