Detecting opinion spams and fake news using text classification

被引:188
|
作者
Ahmed, Hadeer [1 ]
Traore, Issa [1 ]
Saad, Sherif [2 ]
机构
[1] Univ Victoria, ECE Dept, Victoria, BC, Canada
[2] Univ Windsor, Sch Comp Sci, Windsor, ON, Canada
来源
SECURITY AND PRIVACY | 2018年 / 1卷 / 01期
关键词
fake content detection; online fake news; online fake reviews; online social network security; opinion spams; text classification;
D O I
10.1002/spy2.9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, deceptive content such as fake news and fake reviews, also known as opinion spams, have increasingly become a dangerous prospect for online users. Fake reviews have affected consumers and stores alike. Furthermore, the problem of fake news has gained attention in 2016, especially in the aftermath of the last U.S. presidential elections. Fake reviews and fake news are a closely related phenomenon as both consist of writing and spreading false information or beliefs. The opinion spam problem was formulated for the first time a few years ago, but it has quickly become a growing research area due to the abundance of user-generated content. It is now easy for anyone to either write fake reviews or write fake news on the web. The biggest challenge is the lack of an efficient way to tell the difference between a real review and a fake one; even humans are often unable to tell the difference. In this paper, we introduce a new n-gram model to detect automatically fake contents with a particular focus on fake reviews and fake news. We study and compare 2 different features extraction techniques and 6 machine learning classification techniques. Experimental evaluation using existing public datasets and a newly introduced fake news dataset indicate very encouraging and improved performances compared to the state-of-the-art methods.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Active Learning for Text Classification and Fake News Detection
    Sahan, Marko
    Smidl, Vaclav
    Marik, Radek
    [J]. 2021 INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROLS (ISCSIC 2021), 2021, : 87 - 94
  • [2] Deep Learning Algorithms for Detecting Fake News in Online Text
    Girgis, Sherry
    Amer, Eslam
    Gadallah, Mahmoud
    [J]. PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 93 - 97
  • [3] Detecting opinion spams through supervised boosting approach
    Hazim, Mohamad
    Anuar, Nor Badrul
    Ab Razak, Mohd Faizal
    Abdullah, Nor Aniza
    [J]. PLOS ONE, 2018, 13 (06):
  • [4] Detecting tag spams for social bookmarking Websites using a text mining approach
    Yang, Hsin-Chang
    Lee, Chung-Hong
    [J]. INTERNATIONAL JOURNAL OF INFORMATION TECHNOLOGY & DECISION MAKING, 2014, 13 (02) : 387 - 406
  • [5] Detecting Fake News Using Machine Learning Algorithms
    Bharath, G.
    Manikanta, K. J.
    Prakash, G. Bhanu
    Sumathi, R.
    Chinnasamy, P.
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [6] Detecting Arabic Fake News Using Machine Learning
    Khalil, Ashwaq
    Jarrah, Moath
    Aldwairi, Monther
    Jararweh, Yaser
    [J]. 2021 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT DATA SCIENCE TECHNOLOGIES AND APPLICATIONS (IDSTA), 2021, : 171 - 177
  • [7] Detecting Fake News Articles
    Lin, Jun
    Tremblay-Taylor, Glenna
    Mou, Guanyi
    You, Di
    Lee, Kyumin
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 3021 - 3025
  • [8] Detection of Fake News Text Classification on COVID-19 Using Deep Learning Approaches
    Bangyal, Waqas Haider
    Qasim, Rukhma
    Rehman, Najeeb Ur
    Ahmad, Zeeshan
    Dar, Hafsa
    Rukhsar, Laiqa
    Aman, Zahra
    Ahmad, Jamil
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
  • [9] Text Data Augmentation Techniques for Word Embeddings in Fake News Classification
    Kapusta, Jozef
    Drzik, David
    Steflovic, Kirsten
    Nagy, Kitti Szabo
    [J]. IEEE ACCESS, 2024, 12 : 31538 - 31550
  • [10] Transforming Fake News: Robust Generalisable News Classification Using Transformers
    Blackledge, Ciara
    Atapour-Abarghouei, Amir
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 3960 - 3968