SketchBoost: Fast Gradient Boosted Decision Tree for Multioutput Problems

被引:0
|
作者
Iosipoi, Leonid [1 ,2 ]
Vakhrushev, Anton [1 ]
机构
[1] Sber AI Lab, Moscow, Russia
[2] HSE Univ, Moscow, Russia
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gradient Boosted Decision Tree (GBDT) is a widely-used machine learning algorithm that has been shown to achieve state-of-the-art results on many standard data science problems. We are interested in its application to multioutput problems when the output is highly multidimensional. Although there are highly effective GBDT implementations, their scalability to such problems is still unsatisfactory. In this paper, we propose novel methods aiming to accelerate the training process of GBDT in the multioutput scenario. The idea behind these methods lies in the approximate computation of a scoring function used to find the best split of decision trees. These methods are implemented in SketchBoost, which itself is integrated into our easily customizable Python-based GPU implementation of GBDT called Py-Boost. Our numerical study demonstrates that SketchBoost speeds up the training process of GBDT by up to over 40 times while achieving comparable or even better performance.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Efficient Gradient Boosted Decision Tree Training on GPUs
    Wen, Zeyi
    He, Bingsheng
    Ramamohanarao, Kotagiri
    Lu, Shengliang
    Shi, Jiashuai
    [J]. 2018 32ND IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM (IPDPS), 2018, : 234 - 243
  • [2] Gradient Boosted Decision Tree Algorithms for Medicare Fraud Detection
    Hancock J.T.
    Khoshgoftaar T.M.
    [J]. SN Computer Science, 2021, 2 (4)
  • [3] Comparison of Decision Tree Classification Methods and Gradient Boosted Trees
    Dikananda, Arif Rinaldi
    Jumini, Sri
    Tarihoran, Nafan
    Christinawati, Santy
    Trimastuti, Wahyu
    Rahim, Robbi
    [J]. TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2022, 11 (01): : 316 - 322
  • [4] An Extension of Gradient Boosted Decision Tree incorporating Statistical Tests
    Sakata, Ryuji
    Ohama, Iku
    Taniguchi, Tadahiro
    [J]. 2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 964 - 969
  • [5] Scalable hardware architecture for fast gradient boosted tree training
    Sadasue, Tamon
    Tanaka, Takuya
    Kasahara, Ryosuke
    Darmawan, Arief
    Isshiki, Tsuyoshi
    [J]. IPSJ Transactions on System LSI Design Methodology, 2021, 14 : 11 - 20
  • [6] Gradient Boosted Decision Tree based Classification for Recognizing Human Behavior
    Priyadarshini, R. K.
    Banu, Bazila A.
    Nagamani, T.
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATION ENGINEERING (ICACCE-2019), 2019,
  • [7] A Statistical Approach to Predict Flight Delay Using Gradient Boosted Decision Tree
    Manna, Suvojit
    Biswas, Sanket
    Kundu, Riyanka
    Rakshit, Somnath
    Gupta, Priti
    Barman, Subhas
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS), 2017,
  • [8] Meta-Gradient Boosted Decision Tree Model for Weight and Target Learning
    Ustinovskiy, Yury
    Fedorova, Valentina
    Gusev, Gleb
    Serdyukov, Pavel
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [9] A gradient boosted decision tree-based sentiment classification of twitter data
    Neelakandan, S.
    Paulraj, D.
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2020, 18 (04)
  • [10] Empirical Measurement of Performance Maintenance of Gradient Boosted Decision Tree Models for Malware Detection
    Galen, Colin
    Steele, Robert
    [J]. 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021), 2021, : 193 - 198