Fake or real? The computational detection of online deceptive text

被引:0
|
作者
Ball L. [1 ]
Elworthy J. [1 ]
机构
[1] Abertay University,
关键词
Applied artificial intelligence; Business analytics; Computational linguistics; Online fake reviews; Open source data; Text mining;
D O I
10.1057/jma.2014.15
中图分类号
学科分类号
摘要
Online repositories are providing business opportunities to gain feedback and opinions on products and services in the form of digital deposits. Such deposits are, in turn, capable of influencing the readers’ views and behaviours from the posting of misinformation intended to deceive or manipulate. Establishing the veracity of these digital deposits could thus bring key benefits to both online businesses and internet users. Although machine learning techniques are well established for classifying text in terms of their content, techniques to categorise them in terms of their veracity remain a challenge for the domain of feature set extraction and analysis. To date, text categorisation techniques for veracity have reported a wide and inconsistent range of accuracies between 57 and 90 per cent. This article evaluates the accuracy of detecting online deceptive text using a logistic regression classifier based on part of speech tags extracted from a corpus of known truthful and deceptive statements. An accuracy of 72 per cent is achieved by reducing 42 extracted part of speech tags to a feature vector of six using principle component analysis. The results compare favourably to other studies. Improvements are anticipated by training machine learning algorithms on more complex feature vectors by combining the key features identified in this study with others from disparate feature domains. © 2014 Macmillan Publishers Ltd.
引用
收藏
页码:187 / 201
页数:14
相关论文
共 50 条
  • [31] Detection of Online Fake News Using Blending Ensemble Learning
    Hansrajh, Arvin
    Adeliyi, Timothy T.
    Wing, Jeanette
    [J]. SCIENTIFIC PROGRAMMING, 2021, 2021
  • [32] Semantic Features-Based Discourse Analysis Using Deceptive and Real Text Reviews
    Alawadh, Husam M.
    Alabrah, Amerah
    Meraj, Talha
    Rauf, Hafiz Tayyab
    [J]. INFORMATION, 2023, 14 (01)
  • [33] Text-Convolutional Neural Networks for Fake News Detection in Tweets
    Birla Institute of Technology and Science, Pilani, India
    [J]. Adv. Intell. Sys. Comput., (81-90):
  • [34] Text Data Augmentation Techniques for Fake News Detection in the Romanian Language
    Bucos, Marian
    Tucudean, Georgiana
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (13):
  • [35] Fake or for real? A fake news workshop
    Hanz, Katherine
    Kingsland, Emily Sarah
    [J]. REFERENCE SERVICES REVIEW, 2020, 48 (01) : 91 - 112
  • [37] Computational Semantic Detection of Information Overlap in Text
    Taylor, Julia M.
    [J]. COGNITION IN FLUX, 2010, : 2170 - 2175
  • [38] Real or fake?
    Borman, S
    [J]. CHEMICAL & ENGINEERING NEWS, 2002, 80 (32) : 35 - 36
  • [39] Online Text Classification for Real Life Tweet Analysis
    Yar, Ersin
    Delibalta, Ibrahim
    Baruh, Lemi
    Kozat, Suleyman S.
    [J]. 2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 1609 - 1612
  • [40] Online Reasoning for Semantic Error Detection in Text
    [J]. Dou, Dejing (dou@cs.uoregon.edu), 1600, Springer Science and Business Media Deutschland GmbH (06):