Content Based Fake News Detection Using N-Gram Models

被引:24
|
作者
Wynne, Hnin Ei [1 ]
Wint, Zar Zar [1 ]
机构
[1] Mandalay Technol Univ, Dept Comp Engn & Informat Technol, Mandalay, Myanmar
关键词
Online fake news; Fake news detection; Word n-gram; Character n-grams;
D O I
10.1145/3366030.3366116
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Fake news is very popular these days because of the increasing popularity of social media. Detecting fake news is considered as one of the most dangerous types of deception because it is created with dishonest intention to misdirect the public. Many researchers proposed fake news detection systems considering many approaches; content, social-context, and propagation. When the news is detected fake or real, there is a limitation in the accuracy and understandability of language. In this paper, we propose the fake news detection system that considers the content of the online news articles. We investigate two machine learning algorithms with the use of word n-grams and character n-grams analysis. Experiments yield better results using character n-grams with Term-Frequency-Inverted Document Frequency (TF-IDF) and Gradient Boosting Classifier achieves an accuracy of 96%.
引用
收藏
页码:669 / 673
页数:5
相关论文
共 50 条
  • [1] Detection of Online Fake News Using N-Gram Analysis and Machine Learning Techniques
    Ahmed, Hadeer
    Traore, Issa
    Saad, Sherif
    [J]. INTELLIGENT, SECURE, AND DEPENDABLE SYSTEMS IN DISTRIBUTED AND CLOUD ENVIRONMENTS (ISDDC 2017), 2017, 10618 : 127 - 138
  • [2] N-Gram Based Sarcasm Detection for News and Social Media Text Using Hybrid Deep Learning Models
    Thaokar C.
    Rout J.K.
    Rout M.
    Ray N.K.
    [J]. SN Computer Science, 5 (1)
  • [3] Content Based Fake News Detection Using Knowledge Graphs
    Pan, Jeff Z.
    Pavlova, Siyana
    Li, Chenxi
    Li, Ningxi
    Li, Yangmei
    Liu, Jinshuo
    [J]. SEMANTIC WEB - ISWC 2018, PT I, 2018, 11136 : 669 - 683
  • [4] Bugram: Bug Detection with N-gram Language Models
    Wang, Song
    Chollak, Devin
    Movshovitz-Attias, Dana
    Tan, Lin
    [J]. 2016 31ST IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING (ASE), 2016, : 708 - 719
  • [5] N-gram Density based Malware Detection
    O'Kane, Philip
    Sezer, Sakir
    McLaughlin, Kieran
    [J]. 2014 WORLD SYMPOSIUM ON COMPUTER APPLICATIONS & RESEARCH (WSCAR), 2014,
  • [6] Alphabet Flatting as a variant of n-gram feature extraction method in ensemble classification of fake news
    Ksieniewicz, Pawel
    Zyblewski, Pawel
    Borek-Marciniec, Weronika
    Kozik, Rafal
    Choras, Michal
    Wozniak, Michal
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 120
  • [7] A quantitative approach for intrusions detection and prevention based on statistical n-gram models
    Boulaiche, Ammar
    Bouzayani, Hatem
    Adi, Kamel
    [J]. ANT 2012 AND MOBIWIS 2012, 2012, 10 : 450 - 457
  • [8] Audit file reduction using N-gram models
    Godínez, F
    Hutter, D
    Monroy, R
    [J]. FINANCIAL CRYPTOGRAPHY AND DATA SECURITY, 2005, 3570 : 336 - 340
  • [9] Profile based compression of n-gram language models
    Olsen, Jesper
    Oria, Daniela
    [J]. 2006 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-13, 2006, : 1041 - 1044
  • [10] HTTP attack detection using n-gram analysis
    Oza, Aditya
    Ross, Kevin
    Low, Richard M.
    Stamp, Mark
    [J]. COMPUTERS & SECURITY, 2014, 45 : 242 - 254