Deep Semi-supervised Learning with Weight Map for Review Helpfulness Prediction

被引:2
|
作者
Yin, Hua [1 ]
Hu, Zhensheng [1 ]
Peng, Yahui [2 ]
Wang, Zhijian [1 ]
Xu, Guanglong [3 ]
Xu, Yanfang [4 ]
机构
[1] Guangdong Univ Finance & Econ, Informat Sch, Guangzhou 510320, Guangdong, Peoples R China
[2] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510275, Guangdong, Peoples R China
[3] Guangdong Univ Finance & Econ, Sch Stat & Math, Guangzhou 510320, Guangdong, Peoples R China
[4] Guangdong Univ Finance & Econ, Sch Art & Design, Guangzhou 510320, Guangdong, Peoples R China
关键词
Semi-supervised learning; Review helpfulness; Pseudo label; Weight map; Labeling strategy; EMOTIONS;
D O I
10.2298/CSIS201228044Y
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Helpful online product reviews, which include massive information, have large impacts on customers' purchasing decisions. In most of e-commerce platforms, the helpfulness of reviews are decided by the votes from other customers. Making full use of these reviews with votes has enormous commercial value, especially in product recommendation. It drives researchers to study the technologies about how to evaluate the review helpfulness automatically. Although Deep Neural Network(DNN), learning from the historical reviews and labels, computed by the votes, has demonstrated effective results, it still has suffered insufficient labeled reviews problem. When the helpfulness of a large number of reviews is unknown for lack of votes, or some useful latest reviews with less votes are submerged by the past reviews, the accuracy of current DNN model decreases quickly. Therefore, we propose an end-to-end deep semi-supervised learning model with weight map, which makes full use of the unlabeled reviews. The training process in this model is divided into three stages:obtaining base classifier by less labeled reviews, iteratively applying weight map strategy on large unlabeled reviews to obtain pseudo-labeled reviews, training on above combined reviews to obtain the re-training classifier. Based on this novel model, we develop an algorithm and conduct a series of experiments, on Amazon Review Dataset, from the aspects of the baseline neural network selection and the strategies comparisons, including two labeling and three weighting strategies. The experimental results demonstrate the effectiveness of our method on utilizing the unlabeled data. And our findings show that the model adopted batch labeling strategy and non-linear weight mapping method has achieved the best performance.
引用
收藏
页码:1159 / 1174
页数:16
相关论文
共 50 条
  • [31] A review of semi-supervised learning for text classification
    José Marcio Duarte
    Lilian Berton
    Artificial Intelligence Review, 2023, 56 : 9401 - 9469
  • [32] Deep Learning for Industrial KPI Prediction: When Ensemble Learning Meets Semi-Supervised Data
    Sun, Qingqiang
    Ge, Zhiqiang
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (01) : 260 - 269
  • [33] A review of semi-supervised learning for text classification
    Duarte, Jose Marcio
    Berton, Lilian
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (09) : 9401 - 9469
  • [34] Metric learning by similarity network for deep semi-supervised learning
    Wu, Sanyou
    Feng, Xingdong
    Zhou, Fan
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 995 - 1002
  • [35] Semi-supervised deep learning for hyperspectral image classification
    Kang, Xudong
    Zhuo, Binbin
    Duan, Puhong
    REMOTE SENSING LETTERS, 2019, 10 (04) : 353 - 362
  • [36] SEMI-SUPERVISED DEEP LEARNING FOR OBJECT TRACKING AND CLASSIFICATION
    Doulamis, Nikolaos
    Doulamis, Anastasios
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 848 - 852
  • [37] Deep semi-supervised learning for brain tumor classification
    Chenjie Ge
    Irene Yu-Hua Gu
    Asgeir Store Jakola
    Jie Yang
    BMC Medical Imaging, 20
  • [38] Semi-supervised Deep Learning for Network Anomaly Detection
    Sun, Yuanyuan
    Guo, Lili
    Li, Ye
    Xu, Lele
    Wang, Yongming
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2019, PT II, 2020, 11945 : 383 - 390
  • [39] Semi-Supervised Image Registration using Deep Learning
    Sedghi, Alireza
    Luo, Jie
    Mehrtash, Alireza
    Pieper, Steve
    Tempany, Clare M.
    Kapur, Tina
    Mousavi, Parvin
    Wells, William M., III
    MEDICAL IMAGING 2019: IMAGE-GUIDED PROCEDURES, ROBOTIC INTERVENTIONS, AND MODELING, 2019, 10951
  • [40] Semi-supervised Learning
    Adams, Niall
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 2009, 172 : 530 - 530