Document representation and feature combination for deceptive spam review detection

被引:96
|
作者
Li, Luyang [1 ]
Qin, Bing [1 ]
Ren, Wenjing [1 ]
Liu, Ting [1 ]
机构
[1] Harbin Inst Technol, Res Ctr Social Comp & Informat Retrieval, Harbin, Peoples R China
基金
中国国家自然科学基金;
关键词
Spam review detection; Opinion spam; Representation learning; PREDICTING DECEPTION;
D O I
10.1016/j.neucom.2016.10.080
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deceptive spam reviews of products or service are harmful for customers in decision making. Existing approaches to detect deceptive spam reviews are concerned in feature designing. Hand-crafted features can show some linguistic phenomena, however can hardly reveal the latent semantic meaning of the review. We present a neural network based model to learn the representation of reviews. The model makes a hard attention through the composition from sentence representation into document representation. Specifically, we compute the importance weights of each sentence and incorporate them into the composition process of document representation. In the mixed-domain detection experiment, the results verify the effectiveness of our model by comparing with other neural network based methods. As the feature selection is very important in this direction, we make a feature combination to enhance the performance. Then we get 86.1% F1 value which outperform the state-of-the-art method. In the cross-domain detection experiment, our method has better robustness. (C) 2017 Published by Elsevier B.V.
引用
收藏
页码:33 / 41
页数:9
相关论文
共 50 条
  • [31] A consensus pattern of content feature and link feature for web spam detection
    Gao, Shuang
    Zhang, Huaxiang
    Liu, Li
    Fang, Xiaonan
    [J]. Zhang, H. (824223485@163.com), 1600, Binary Information Press (10): : 3759 - 3766
  • [32] RLOSD: Representation Learning based Opinion Spam Detection
    Sedighi, Zeinab
    Ebrahimpour-Komleh, Hossein
    Bagheri, Ayoub
    [J]. 2017 3RD IRANIAN CONFERENCE ON SIGNAL PROCESSING AND INTELLIGENT SYSTEMS (ICSPIS), 2017, : 74 - 80
  • [33] Semantic Representation Based on Deep Learning for Spam Detection
    Saidani, Nadjate
    Adi, Kamel
    Allili, Mohand Said
    [J]. FOUNDATIONS AND PRACTICE OF SECURITY, FPS 2019, 2020, 12056 : 72 - 81
  • [34] On feature extraction for spam e-mail detection
    Gunal, Serkan
    Ergin, Semih
    Gulmezoglu, M. Bilginer
    Gerek, O. Nezih
    [J]. MULTIMEDIA CONTENT REPRESENTATION, CLASSIFICATION AND SECURITY, 2006, 4105 : 635 - 642
  • [35] Genetic-based Feature Selection for Spam Detection
    Arani, Seyyed Hossein Seyyedi
    Mozaffari, Saeed
    [J]. 2013 21ST IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2013,
  • [36] Concentration Based Feature Construction Approach for Spam Detection
    Tan, Ying
    Deng, Chao
    Ruan, Guangchen
    [J]. IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 510 - 515
  • [37] Towards Automated Comprehensive Feature Engineering for Spam Detection
    Kiwanuka, Fred N.
    Alqatawna, Ja'far
    Amin, Anang Hudaya Muhamad
    Paul, Sujni
    Faris, Hossam
    [J]. PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY (ICISSP), 2019, : 429 - 437
  • [38] Research of Deceptive Review Detection Based on Target Product Identification and Metapath Feature Weight Calculation
    Yuan, Ling
    Li, Dan
    Wei, Shikang
    Wang, Mingli
    [J]. COMPLEXITY, 2018,
  • [39] Spam Detection Using Feature Selection and Parameters Optimization
    Lee, Sang Min
    Kim, Dong Seong
    Kim, Ji Ho
    Park, Jong Sou
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPLEX, INTELLIGENT AND SOFTWARE INTENSIVE SYSTEMS (CISIS 2010), 2010, : 883 - 888
  • [40] Deceptive consumer review detection: a survey
    Dushyanthi U. Vidanagama
    Thushari P. Silva
    Asoka S. Karunananda
    [J]. Artificial Intelligence Review, 2020, 53 : 1323 - 1352