E-Commerce Fake Reviews Detection Using LSTM with Word2Vec Embedding

被引:0
|
作者
Raheem, Mafas [1 ]
Chong, Yi Chien [1 ]
机构
[1] School of Computing, Asia Pacific University of Technology and Innovation, Kuala Lumpur, Malaysia
关键词
Adversarial machine learning - Contrastive Learning - Deep learning - Embeddings - Natural language processing systems;
D O I
10.20532/cit.2024.1005803
中图分类号
学科分类号
摘要
Customer reviews inform potential buyers' decisions, but fake reviews in e-commerce can skew perceptions as customers may feel pressured to leave positive feedback. Detecting fake reviews in e-commerce platforms is a critical challenge, impacting online shopping and deceiving customers. Effective detection strategies, employing deep learning architectures and word embeddings, are essential to combat this issue. Specifically, the study presented in this paper employed a 1-layer Simple LSTM model, a 1D Convolutional model, and a combined CNN+LSTM model. These models were trained using different pre-trained word embeddings including Word2Vec, GloVe, FastText, and with Keras embeddings, to convert the text data into vector form. The models were evaluated based on accuracy and F1-score to provide a comprehensive measure of their performance. The results indicated that the Simple LSTM model with Word2Vec embeddings achieved an accuracy of nearly 91% and an F1-score of 0.9024, outperforming all other model-em-bedding combinations. The 1D convolutional model performed best without any embeddings, suggesting its ability to extract meaningful features from the raw text. The transformer-based models, BERT and DistilBERT, showed progressive learning but struggled with generalization, indicating the need for strategies such as early stopping, dropout, or regularization to prevent overfitting. Notably, the DistilBERT model consistently outperformed the LSTM model, achieving optimal performance with accuracy of 96% and an F1-score of 0.9639 using a batch size of 32 and a learning rate of 4.00E-05. ACM CCS (2012) Classification: Computing methodologies → Artificial intelligence → Natural language processing. © 2024, University of Zagreb Faculty of Electrical Engineering and Computing. All rights reserved.
引用
收藏
页码:65 / 80
相关论文
共 50 条
  • [31] Arabic Text Keywords Extraction using Word2vec
    Suleiman, Dima
    Awajan, Arafat A.
    Al Etaiwi, Wael
    2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2019, : 251 - 257
  • [32] A NOVEL METHOD FOR DOCUMENT SUMMARIZATION USING WORD2VEC
    Wang, Zhibo
    Ma, Long
    Zhang, Yanqing
    2016 IEEE 15TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC), 2016, : 523 - 529
  • [33] Using Word2Vec to Process Big Text Data
    Ma, Long
    Zhang, Yanqing
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2015, : 2895 - 2897
  • [34] Word2Vec for Indonesian Sentiment Analysis towards Hotel Reviews: An Evaluation Study
    Nawangsari, Rizka Putri
    Kusumaningrum, Retno
    Wibowo, Adi
    4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL INTELLIGENCE (ICCSCI 2019) : ENABLING COLLABORATION TO ESCALATE IMPACT OF RESEARCH RESULTS FOR SOCIETY, 2019, 157 : 360 - 366
  • [35] Detection of e-Commerce Anomalies using LSTM-recurrent Neural Networks
    Bozbura, Merih
    Tunc, Hunkar C.
    Kusak, Miray Endican
    Sakar, C. Okan
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2019, : 217 - 224
  • [36] Deep-ABPpred: identifying antibacterial peptides in protein sequences using bidirectional LSTM with word2vec
    Sharma, Ritesh
    Shrivastava, Sameer
    Singh, Sanjay Kumar
    Kumar, Abhinav
    Saxena, Sonal
    Singh, Raj Kumar
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (05)
  • [37] Sentiment Classification of Reviews on Automobile Websites by Combining Word2Vec and Dependency Parsing
    Liu, Feifei
    Wei, Fang
    Yu, Ke
    Wu, Xiaofei
    SMART COMPUTING AND COMMUNICATION, SMARTCOM 2017, 2018, 10699 : 206 - 221
  • [38] LogUAD: Log Unsupervised Anomaly Detection Based on Word2Vec
    Wang, Jin
    Zhao, Changqing
    He, Shiming
    Gu, Yu
    Alfarraj, Osama
    Abugabah, Ahed
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2022, 41 (03): : 1207 - 1222
  • [39] Constructing Hierarchical Product Categories for E-commerce by Word Embedding and Clustering
    Hsieh, Yi-Hsiang
    Wu, Shih-Hung
    Chen, Liang-Pu
    Yang, Ping-Che
    2017 IEEE 18TH INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION (IEEE IRI 2017), 2017, : 397 - 402
  • [40] Aspect Analysis of Cebu Establishments' Online Reviews using k-means Clustering and word2vec
    Capao, Kris
    Gorro, Ken D.
    Gorro, Kim D.
    Sabellano, Mary Jane
    Militante, Cris Lawrence Adrian G.
    Manalili, Justin Paul C.
    PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS (ICCCS), 2018, : 61 - 66