Tree-Based Feature Transformation for Purchase Behavior Prediction

被引:4
|
作者
Hou, Chunyan [1 ]
Chen, Chen [2 ]
Wang, Jinsong [1 ]
机构
[1] Tianjin Univ Technol, Sch Comp & Commun Engn, Tianjin, Peoples R China
[2] Nankai Univ, Coll Comp & Control Engn, Tianjin, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
feature transformation; purchase behavior prediction; DIMENSIONALITY;
D O I
10.1587/transinf.2017EDL8210
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In the era of e-commerce, purchase behavior prediction is one of the most important issues to promote both online companies' sales and the consumers' experience. The previous researches usually use the feature engineering and ensemble machine learning algorithms for the prediction. The performance really depends on designed features and the scalability of algorithms because the large-scale data and a lot of categorical features lead to huge samples and the high-dimensional feature. In this study, we explore an alternative to use tree-based Feature Transformation (FT) and simple machine learning algorithms (e.g. Logistic Regression). Random Forest (RF) and Gradient Boosting decision tree (GB) are used for FT. Then, the simple algorithm, rather than ensemble algorithms, is used to predict purchase behavior based on transformed features. Tree-based FT regards the leaves of trees as transformed features, and can learn high-order interactions among original features. Compared with RF, if GB is used for FT, simple algorithms are enough to achieve better performance.
引用
收藏
页码:1441 / 1444
页数:4
相关论文
共 50 条
  • [21] Feature Bundles and their Effect on the Performance of Tree-based Evolutionary Classification and Feature Selection Algorithms
    Neshatian, Kourosh
    Varn, Lucianne
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 1612 - 1619
  • [22] Statistical tree-based feature vector for content-based image retrieval
    Aghav-Palwe, Sushila
    Mishra, Dhirendra
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2020, 21 (04) : 556 - 563
  • [23] Graphs from Features: Tree-Based Graph Layout for Feature Analysis
    Minghim, Rosane
    Huancapaza, Liz
    Artur, Erasmo
    Telles, Guilherme P.
    Belizario, Ivar V.
    ALGORITHMS, 2020, 13 (11)
  • [24] Decision tree-based Feature Ranking in Concept Drifting Data Streams
    Pereira Karax, Jean Antonio
    Malucelli, Andreia
    Barddal, Jean Paul
    SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING, 2019, : 590 - 592
  • [25] An Aggregated Decision Tree-Based Learner for Renewable Integration Prediction
    Lu, Tianguang
    Ai, Qian
    Lee, Wei-Jen
    Wang, Zhe
    He, Hongying
    2018 IEEE INDUSTRY APPLICATIONS SOCIETY ANNUAL MEETING (IAS), 2018,
  • [26] Feature Scoring using Tree-Based Ensembles for Evolving Data Streams
    Gomes, Heitor Murilo
    de Mello, Rodrigo Fernandes
    Pfahringer, Bernhard
    Bifet, Albert
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2019, : 761 - 769
  • [27] Performance evaluation of feature selection and tree-based algorithms for traffic classification
    Aouedi, Ons
    Piamrat, Kandaraj
    Parrein, Benoit
    2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
  • [28] Tree-Based Morse Regions: A Topological Approach to Local Feature Detection
    Xu, Yongchao
    Monasse, Pascal
    Geraud, Thierry
    Najman, Laurent
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (12) : 5612 - 5625
  • [29] Utilizing Hierarchies in Tree-Based Online Structured Output Prediction
    Osojnik, Aljaz
    Panov, Pance
    Dzeroski, Saso
    DISCOVERY SCIENCE (DS 2019), 2019, 11828 : 87 - 95
  • [30] Heart Disease Prediction Model Using Tree-based Methods
    Li, Yanran
    Liu, Yitong
    Luo, Jin
    Sun, Xiao
    2ND INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELLING, AND INTELLIGENT COMPUTING (CAMMIC 2022), 2022, 12259