Product Bundle Identification using Semi-Supervised Learning

被引:12
|
作者
Tzaban, Hen [1 ]
Guy, Ido [2 ]
Greenstein-Messica, Asnat [1 ]
Dagan, Arnon [2 ]
Rokach, Lior [1 ]
Shapira, Bracha [1 ]
机构
[1] Ben Gurion Univ Negev, Beer Sheva, Israel
[2] eBay Res, Netanya, Israel
关键词
electronic commerce; ensemble learning; product bundling; self-training; semi-supervised learning; NOISE;
D O I
10.1145/3397271.3401128
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many sellers on e-commerce platforms offer buyers product bundles, which package together two or more different items. The identification of such bundles is a necessary step to support a variety of related services, from recommendation to dynamic pricing. In this work, we present a comprehensive study of bundle identification on a large e-commerce website. Our analysis of bundle compared to non-bundle listed items reveals several key differentiating characteristics, spanning the listing's title, image, and attributes. Following, we experiment with a multi-modal classifier, which takes advantage of these characteristics as features. Our analysis also shows that a bundle indicator input by sellers tends to be highly noisy and carries only a weak signal. The bundle identification task therefore faces the challenge of having a small set of manually-labeled clean examples and a larger set of noisy-labeled examples, in conjunction with class imbalance due to the relative scarcity of bundles. Our experiments with basic supervised classifiers, using the manually-labeled and/or the noisy-labeled data for training, demonstrates only moderate performance. We therefore turn to a semi-supervised approach and propose GREED, a self-training ensemble-based algorithm with a greedy model selection. Our evaluation over two different meta-categories shows a superior performance of semi-supervised approaches for the bundle identification task, with GREED outperforming several semi-supervised alternatives. The combination of textual, image, and some metadata features is shown to yield the best performance, reaching an AUC of 0.89 and 0.92 for the two meta-categories, respectively.
引用
收藏
页码:791 / 800
页数:10
相关论文
共 50 条
  • [21] Using semi-supervised learning for question classification
    Tri, Nguyen Thanh
    Le, Nguyen Minh
    Shimazu, Akira
    COMPUTER PROCESSING OF ORIENTAL LANGUAGES, PROCEEDINGS: BEYOND THE ORIENT: THE RESEARCH CHALLENGES AHEAD, 2006, 4285 : 31 - +
  • [22] Semi-supervised Learning Using Siamese Networks
    Sahito, Attaullah
    Frank, Eibe
    Pfahringer, Bernhard
    AI 2019: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, 11919 : 586 - 597
  • [23] Safe Semi-Supervised Learning of Sum-Product Networks
    Trapp, Martin
    Madl, Tamas
    Peharz, Robert
    Pernkopf, Franz
    Trappl, Robert
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI2017), 2017,
  • [24] Identification of subject specific and functional consistent ROIs using semi-supervised learning
    Du, Yuhui
    Li, Hongming
    Wu, Hong
    Fan, Yong
    MEDICAL IMAGING 2012: IMAGE PROCESSING, 2012, 8314
  • [25] Seismic Horizon Identification Using Semi-Supervised Learning With Virtual Adversarial Training
    Wang, Fu
    Wu, Xinming
    Wang, Huazhong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [26] Semi-supervised learning for lithology identification using Laplacian support vector machine
    Li, Zerui
    Kang, Yu
    Feng, Deyong
    Wang, Xing-Mou
    Lv, Wenjun
    Chang, Ji
    Zheng, Wei Xing
    JOURNAL OF PETROLEUM SCIENCE AND ENGINEERING, 2020, 195
  • [27] Radio Frequency Fingerprinting Identification Using Semi-Supervised Learning with Meta Labels
    Tiantian Zhang
    Pinyi Ren
    Dongyang Xu
    Zhanyi Ren
    China Communications, 2023, 20 (12) : 78 - 95
  • [28] Radio Frequency Fingerprinting Identification Using Semi-Supervised Learning with Meta Labels
    Zhang, Tiantian
    Ren, Pinyi
    Xu, Dongyang
    Ren, Zhanyi
    CHINA COMMUNICATIONS, 2023, 20 (12) : 78 - 95
  • [29] Learning Semi-Supervised Representation Towards a Unified Optimization Framework for Semi-Supervised Learning
    Li, Chun-Guang
    Lin, Zhouchen
    Zhang, Honggang
    Guo, Jun
    2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2767 - 2775
  • [30] Semi-supervised learning by disagreement
    Zhou, Zhi-Hua
    Li, Ming
    KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 24 (03) : 415 - 439