A Weighted Stacking Ensemble Model With Sampling for Fake Reviews Detection

被引:1
|
作者
Singhal, Rahul [1 ]
Kashef, Rasha [2 ]
机构
[1] Jaypee Inst Informat Technol, Dept Comp Sci & Engn, Noida 201309, India
[2] Toronto Metropolitan Univ, Dept Elect Comp & Biomed Engn, Toronto, ON M5B 2K3, Canada
关键词
Feature extraction; Hidden Markov models; Data models; Computational modeling; Support vector machines; Machine learning; Convolutional neural networks; Ensemble learning; fake review; machine learning; sampling techniques; NEWS; CLASSIFIERS; STRENGTH; PRODUCT; SPAM;
D O I
10.1109/TCSS.2023.3268548
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Customers use reviews as a primary source of information to judge a product or service. Positive reviews help boost companies' reputations, increasing their revenue by attracting new clients, and increasing the purchasing order size. On the other hand, negative reviews significantly reduce sales, which might be the case due to competitive advantage. Organizations can use fake (i.e., misleading or fraudulent) reviews to generate fast profits by deceiving customers into buying their products. Recently, various methods to assess the legitimacy of reviews have been introduced using advances in machine learning. However, existing methods fall short of achieving highly accurate detection results for unbalanced classes. We aimed to create a spam review identification model using ensemble-based learning while balancing classes using sampling techniques. This article proposes a weighted stacking ensemble model with sampling (WSEM-S) for efficient fake reviews detection. We used n-gram models to effectively model language data for feature retrieval. The experimental results on three customer reviews datasets: YELPNYC, Deceptive Opinion Spam Corpus (DOSC) v1.4, and Deception datasets show that the proposed model outperforms the conventional machine learning techniques [Naive Bayes, logistic regression, K-nearest neighbor (KNN), random forest, extreme gradient boosting (XGBoost), and convolutional neural network (CNN)] as well as the state-of-the-art ensemble models.
引用
下载
收藏
页码:2578 / 2594
页数:17
相关论文
共 50 条
  • [31] A metadata-aware detection model for fake restaurant reviews based on multimodal fusion
    Yifei Jian
    Xinyu Chen
    Xiaoda Wang
    Ying Liu
    Xingshu Chen
    Xiao Lan
    Wenxian Wang
    Haizhou Wang
    Neural Computing and Applications, 2025, 37 (1) : 475 - 498
  • [32] Enhancing Motor Imagery Electroencephalography Classification with a Correlation-Optimized Weighted Stacking Ensemble Model
    Ahmadi, Hossein
    Mesin, Luca
    ELECTRONICS, 2024, 13 (06)
  • [33] Monthly Runoff Prediction Based on Stochastic Weighted Averaging-Improved Stacking Ensemble Model
    Fu, Kaixiang
    Sun, Xutong
    Chen, Kai
    Mo, Li
    Xiao, Wenjing
    Liu, Shuangquan
    Water (Switzerland), 2024, 16 (24)
  • [34] Detection of Fake Reviews: Analysis of Sellers' Manipulation Behavior
    Chen, Lirong
    Li, Wenli
    Chen, Hao
    Geng, Shidao
    SUSTAINABILITY, 2019, 11 (17)
  • [35] The Detection of Fake Reviews in Bestselling Books: Exploration and Findings
    Krishnan, Kavita
    Wan, Yun
    JOURNAL OF ELECTRONIC COMMERCE IN ORGANIZATIONS, 2021, 19 (04) : 64 - 79
  • [36] A Model-Driven Method for Quality Reviews Detection: An Ensemble Model of Feature Selection
    Wang, Hongwei
    Meng, Yuan
    Yin, Pei
    Hua, Jin
    FIFTEENTH WUHAN INTERNATIONAL CONFERENCE ON E-BUSINESS, 2016, : 573 - 581
  • [37] Towards Undeceived: Fake Reviews Detection Models Comparison
    Wu, Keming
    Poursardar, Faryaneh
    2022 IEEE 23RD INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION FOR DATA SCIENCE (IRI 2022), 2022, : 41 - 42
  • [38] Bengali fake reviews: A benchmark dataset and detection system
    Shahariar G.M.
    Shawon M.T.R.
    Shah F.M.
    Alam M.S.
    Mahbub M.S.
    Neurocomputing, 2024, 592
  • [39] Fake Reviews Detection using Supervised Machine Learning
    Elmogy, Ahmed M.
    Tariq, Usman
    Ibrahim, Atef
    Mohammed, Ammar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (01) : 601 - 606
  • [40] An ensemble classification model for fake feedback detection using proposed labeled CloudArmor dataset
    Taneja, Harsh
    Kaur, Supreet
    COMPUTERS & ELECTRICAL ENGINEERING, 2021, 93