Document representation and feature combination for deceptive spam review detection

被引:96
|
作者
Li, Luyang [1 ]
Qin, Bing [1 ]
Ren, Wenjing [1 ]
Liu, Ting [1 ]
机构
[1] Harbin Inst Technol, Res Ctr Social Comp & Informat Retrieval, Harbin, Peoples R China
基金
中国国家自然科学基金;
关键词
Spam review detection; Opinion spam; Representation learning; PREDICTING DECEPTION;
D O I
10.1016/j.neucom.2016.10.080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deceptive spam reviews of products or service are harmful for customers in decision making. Existing approaches to detect deceptive spam reviews are concerned in feature designing. Hand-crafted features can show some linguistic phenomena, however can hardly reveal the latent semantic meaning of the review. We present a neural network based model to learn the representation of reviews. The model makes a hard attention through the composition from sentence representation into document representation. Specifically, we compute the importance weights of each sentence and incorporate them into the composition process of document representation. In the mixed-domain detection experiment, the results verify the effectiveness of our model by comparing with other neural network based methods. As the feature selection is very important in this direction, we make a feature combination to enhance the performance. Then we get 86.1% F1 value which outperform the state-of-the-art method. In the cross-domain detection experiment, our method has better robustness. (C) 2017 Published by Elsevier B.V.
引用
收藏
页码:33 / 41
页数:9
相关论文
共 50 条
  • [21] A deceptive review detection framework: Combination of coarse and fine-grained features
    Cao, Ning
    Ji, Shujuan
    Chiu, Dickson K. W.
    He, Mingxiang
    Sun, Xiaohong
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2020, 156
  • [22] Opinion Spam Detection Using Feature Selection
    Patel, Rinki
    Thakkar, Priyank
    [J]. 2014 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS, 2014, : 560 - 564
  • [23] Dynamic Feature Selection for Spam Detection in Twitter
    Karakasli, M. Salih
    Aydin, Muhammed Ali
    Yarkan, Serhan
    Boyaci, Ali
    [J]. INTERNATIONAL TELECOMMUNICATIONS CONFERENCE, ITELCON 2017, 2019, 504 : 239 - 250
  • [24] Detection of review spam: A survey
    Heydari, Atefeh
    Tavakoli, Mohammad Ali
    Salim, Naomie
    Heydari, Zahra
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (07) : 3634 - 3642
  • [25] Target Detection Using Sparse Representation With Element and Construction Combination Feature
    Liu, Haicang
    Li, Shutao
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2015, 64 (02) : 290 - 298
  • [26] GAIM: Graph-aware Feature Interactional Model for Spam Movie Review Detection
    Zhang, Lei
    Song, Xueqiang
    Zhao, Xiaoming
    Fang, Yuwei
    Li, Dong
    Wang, Haizhou
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 621 - 628
  • [27] Finding Rotten Eggs: A Review Spam Detection Model using Diverse Feature Sets
    Akram, Abubakker Usman
    Khan, Hikmat Ullah
    Iqbal, Saqib
    Iqbal, Tassawar
    Munir, Ehsan Ullah
    Shafi, Muhammad
    [J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (10): : 5120 - 5142
  • [28] Co-attention Based Feature Fusion Network for Spam Review Detection on Douban
    Cai, Huanyu
    Yu, Ke
    Zhou, Yuhao
    Wu, Xiaofei
    [J]. NEURAL PROCESSING LETTERS, 2022, 54 (06) : 5251 - 5271
  • [29] Co-attention Based Feature Fusion Network for Spam Review Detection on Douban
    Huanyu Cai
    Ke Yu
    Yuhao Zhou
    Xiaofei Wu
    [J]. Neural Processing Letters, 2022, 54 : 5251 - 5271
  • [30] Detecting Deceptive Review Spam via Attention-Based Neural Networks
    Wang, Xuepeng
    Liu, Kang
    Zhao, Jun
    [J]. NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2017, 2018, 10619 : 866 - 876