Fake or real? The computational detection of online deceptive text

被引:0
|
作者
Ball L. [1 ]
Elworthy J. [1 ]
机构
[1] Abertay University,
关键词
Applied artificial intelligence; Business analytics; Computational linguistics; Online fake reviews; Open source data; Text mining;
D O I
10.1057/jma.2014.15
中图分类号
学科分类号
摘要
Online repositories are providing business opportunities to gain feedback and opinions on products and services in the form of digital deposits. Such deposits are, in turn, capable of influencing the readers’ views and behaviours from the posting of misinformation intended to deceive or manipulate. Establishing the veracity of these digital deposits could thus bring key benefits to both online businesses and internet users. Although machine learning techniques are well established for classifying text in terms of their content, techniques to categorise them in terms of their veracity remain a challenge for the domain of feature set extraction and analysis. To date, text categorisation techniques for veracity have reported a wide and inconsistent range of accuracies between 57 and 90 per cent. This article evaluates the accuracy of detecting online deceptive text using a logistic regression classifier based on part of speech tags extracted from a corpus of known truthful and deceptive statements. An accuracy of 72 per cent is achieved by reducing 42 extracted part of speech tags to a feature vector of six using principle component analysis. The results compare favourably to other studies. Improvements are anticipated by training machine learning algorithms on more complex feature vectors by combining the key features identified in this study with others from disparate feature domains. © 2014 Macmillan Publishers Ltd.
引用
收藏
页码:187 / 201
页数:14
相关论文
共 50 条
  • [21] Automated Fake News Detection Using Computational Forensic Linguistics
    Moura, Ricardo
    Sousa-Silva, Rui
    Cardoso, Henrique Lopes
    [J]. PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2021), 2021, 12981 : 788 - 800
  • [22] To Catch a Fake: Curbing Deceptive Yelp Ratings and Venues
    Rahman, Mahmudur
    Carbunar, Bogdan
    Ballesteros, Jaime
    Chau, Duen Horng
    [J]. STATISTICAL ANALYSIS AND DATA MINING, 2015, 8 (03) : 147 - 161
  • [23] Beyond deceptive news (fake news). Anatomy of misinformation
    Masip, Pere
    Ferrer Sapena, Antonia
    [J]. BID-TEXTOS UNIVERSITARIS DE BIBLIOTECONOMIA I DOCUMENTACIO, 2021, (46):
  • [24] The Role of Fake Review Detection in Managing Online Corporate Reputation
    Loke, R. E.
    Kisoen, Z.
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2022, : 245 - 256
  • [25] A framework for fake review detection in online consumer electronics retailers
    Barbado, Rodrigo
    Araque, Oscar
    Iglesias, Carlos A.
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2019, 56 (04) : 1234 - 1244
  • [26] DitDetector: Bimodal Learning based on Deceptive Image and Text for Macro Malware Detection
    Yan, Jia
    Jia, Xiangkun
    Su, Purui
    Wan, Ming
    Ying, Lingyun
    Wang, Zhanyi
    [J]. PROCEEDINGS OF THE 38TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE, ACSAC 2022, 2022, : 227 - 239
  • [27] Making Sense of Lies, Deceptive Propaganda, and Fake News
    Brennen, Bonnie
    [J]. JOURNAL OF MEDIA ETHICS, 2017, 32 (03) : 179 - 181
  • [28] Modeling the time to share fake and real news in online social networks
    Doe, Cooper
    Knezevic, Vladimir
    Zeng, Maya
    Spezzano, Francesca
    Babinkostova, Liljana
    [J]. INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2023, 18 (04) : 369 - 378
  • [29] SENTIMENT AWARE FAKE NEWS DETECTION ON ONLINE SOCIAL NETWORKS
    Ajao, Oluwaseun
    Bhowmik, Deepayan
    Zargari, Shahrzad
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2507 - 2511
  • [30] Survey on Fake Accounts Detection Algorithms on Online Social Networks
    Shamseddine, Jad
    Malli, Mohammad
    Hazimeh, Hussein
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INNOVATIONS IN COMPUTING RESEARCH (ICR'22), 2022, 1431 : 375 - 380