E-Commerce Fake Reviews Detection Using LSTM with Word2Vec Embedding

被引:0
|
作者
Raheem, Mafas [1 ]
Chong, Yi Chien [1 ]
机构
[1] School of Computing, Asia Pacific University of Technology and Innovation, Kuala Lumpur, Malaysia
关键词
Adversarial machine learning - Contrastive Learning - Deep learning - Embeddings - Natural language processing systems;
D O I
10.20532/cit.2024.1005803
中图分类号
学科分类号
摘要
Customer reviews inform potential buyers' decisions, but fake reviews in e-commerce can skew perceptions as customers may feel pressured to leave positive feedback. Detecting fake reviews in e-commerce platforms is a critical challenge, impacting online shopping and deceiving customers. Effective detection strategies, employing deep learning architectures and word embeddings, are essential to combat this issue. Specifically, the study presented in this paper employed a 1-layer Simple LSTM model, a 1D Convolutional model, and a combined CNN+LSTM model. These models were trained using different pre-trained word embeddings including Word2Vec, GloVe, FastText, and with Keras embeddings, to convert the text data into vector form. The models were evaluated based on accuracy and F1-score to provide a comprehensive measure of their performance. The results indicated that the Simple LSTM model with Word2Vec embeddings achieved an accuracy of nearly 91% and an F1-score of 0.9024, outperforming all other model-em-bedding combinations. The 1D convolutional model performed best without any embeddings, suggesting its ability to extract meaningful features from the raw text. The transformer-based models, BERT and DistilBERT, showed progressive learning but struggled with generalization, indicating the need for strategies such as early stopping, dropout, or regularization to prevent overfitting. Notably, the DistilBERT model consistently outperformed the LSTM model, achieving optimal performance with accuracy of 96% and an F1-score of 0.9639 using a batch size of 32 and a learning rate of 4.00E-05. ACM CCS (2012) Classification: Computing methodologies → Artificial intelligence → Natural language processing. © 2024, University of Zagreb Faculty of Electrical Engineering and Computing. All rights reserved.
引用
收藏
页码:65 / 80
相关论文
共 50 条
  • [21] Encrypted Malicious Traffic Detection Based on Word2Vec
    Ferriyan, Andrey
    Thamrin, Achmad Husni
    Takeda, Keiji
    Murai, Jun
    ELECTRONICS, 2022, 11 (05)
  • [22] KEYWORD EXTRACTION BASED ON WORD SYNONYMS USING WORD2VEC
    Ogul, Iskender Ulgen
    Ozcan, Caner
    Hakdagli, Ozlem
    2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [23] Duplicate Short Text Detection Based on Word2vec
    Gao, Jin
    He, Yahao
    Zhang, Xiaoyan
    Xia, Yamei
    PROCEEDINGS OF 2017 8TH IEEE INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2017), 2017, : 33 - 37
  • [24] Multiscale cascaded domain-based approach for Arabic fake reviews detection in e-commerce platforms
    Qandos, Nour
    Hamad, Ghadir
    Alharbi, Maitha
    Alturki, Shatha
    Alharbi, Waad
    Albelaihi, Arwa A.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (02)
  • [25] Image Caption Generation Using Scoring Based on Object Detection and Word2Vec
    Misawa, Tadanobu
    Morizumi, Nozomi
    Yamashita, Kazuya
    SENSORS AND MATERIALS, 2023, 35 (07) : 2195 - 2204
  • [26] 基于Word2Vec与LSTM病历文本分类研究
    王捷
    陈超
    周海权
    舒德胜
    黄豪
    现代计算机, 2023, 29 (17) : 41 - 44
  • [27] A Comparative Study of Sentiment Analysis Methods for Detecting Fake Reviews in E-Commerce
    Puttarattanamanee M.
    Boongasame L.
    Thammarak K.
    HighTech and Innovation Journal, 2023, 4 (02): : 349 - 363
  • [28] Chinese Sentiment Classification Using Extended Word2Vec
    张胜
    张鑫
    程佳军
    王晖
    Journal of Donghua University(English Edition), 2016, 33 (05) : 823 - 826
  • [29] Using Word2Vec Recommendation for Improved Purchase Prediction
    Esmeli, Ramazan
    Bader-El-Den, Mohamed
    Abdullahi, Hassana
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [30] A New Approach to Embedding Semantic Link Network with Word2Vec Binary Code
    Yuan, Yanhong
    Liu, Yao
    Huang, Qiaoli
    Huang, Zhixing
    2015 11TH INTERNATIONAL CONFERENCE ON SEMANTICS, KNOWLEDGE AND GRIDS (SKG), 2015, : 9 - 16