Efficient Gradient Boosted Decision Tree Training on GPUs

被引:27
|
作者
Wen, Zeyi [1 ]
He, Bingsheng [1 ]
Ramamohanarao, Kotagiri [2 ]
Lu, Shengliang [1 ]
Shi, Jiashuai [1 ,3 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Univ Melbourne, Melbourne, Vic, Australia
[3] South China Univ Technol, Guangzhou, Guangdong, Peoples R China
关键词
CONSTRUCTION;
D O I
10.1109/IPDPS.2018.00033
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a novel parallel implementation for training Gradient Boosting Decision Trees (GBDTs) on Graphics Processing Units (GPUs). Thanks to the wide use of the open sourced XGBoost library, GBDTs have become very popular in recent years and won many awards in machine learning and data mining competitions. Although GPUs have demonstrated their success in accelerating many machine learning applications, there are a series of key challenges of developing a GPU-based GBDT algorithm, including irregular memory accesses, many small sorting operations and varying data parallel granularities in tree construction. To tackle these challenges on GPUs, we propose various novel techniques (including Run-length Encoding compression and thread/ block workload dynamic allocation, and reusing intermediate training results for efficient gradient computation). Our experimental results show that our algorithm named GPU-GBDT is often 10 to 20 times faster than the sequential version of XGBoost, and achieves 1.5 to 2 times speedup over a 40 threaded XGBoost running on a relatively high-end workstation of 20 CPU cores. Moreover, GPU-GBDT outperforms its CPU counterpart by 2 to 3 times in terms of performance-price ratio.
引用
收藏
页码:234 / 243
页数:10
相关论文
共 50 条
  • [11] Scalable Full Hardware Logic Architecture for Gradient Boosted Tree Training
    Sadasue, Tamon
    Isshiki, Tsuyoshi
    [J]. 28TH IEEE INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2020, : 234 - 234
  • [12] A Statistical Approach to Predict Flight Delay Using Gradient Boosted Decision Tree
    Manna, Suvojit
    Biswas, Sanket
    Kundu, Riyanka
    Rakshit, Somnath
    Gupta, Priti
    Barman, Subhas
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS), 2017,
  • [13] Meta-Gradient Boosted Decision Tree Model for Weight and Target Learning
    Ustinovskiy, Yury
    Fedorova, Valentina
    Gusev, Gleb
    Serdyukov, Pavel
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
  • [14] A gradient boosted decision tree-based sentiment classification of twitter data
    Neelakandan, S.
    Paulraj, D.
    [J]. INTERNATIONAL JOURNAL OF WAVELETS MULTIRESOLUTION AND INFORMATION PROCESSING, 2020, 18 (04)
  • [15] LightGBM: A Highly Efficient Gradient Boosting Decision Tree
    Ke, Guolin
    Meng, Qi
    Finley, Thomas
    Wang, Taifeng
    Chen, Wei
    Ma, Weidong
    Ye, Qiwei
    Liu, Tie-Yan
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [16] Flutter analysis of cold tandem rolling mills based on gradient boosted decision tree
    Zhou X.
    Hao Y.
    Cong W.
    Wei Z.
    Wen G.
    [J]. Zhendong yu Chongji/Journal of Vibration and Shock, 2021, 40 (13): : 154 - 158
  • [17] Empirical Measurement of Performance Maintenance of Gradient Boosted Decision Tree Models for Malware Detection
    Galen, Colin
    Steele, Robert
    [J]. 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021), 2021, : 193 - 198
  • [18] A Gradient Boosted Decision Tree-Based Influencer Prediction in Social Network Analysis
    Subramani, Neelakandan
    Easwaramoorthy, Sathishkumar Veerappampalayam
    Mohan, Prakash
    Subramanian, Malliga
    Sambath, Velmurugan
    [J]. BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (01)
  • [19] Predicting Parkinson disease related genes based on PyFeat and gradient boosted decision tree
    Helmy, Marwa
    Eldaydamony, Eman
    Mekky, Nagham
    Elmogy, Mohammed
    Soliman, Hassan
    [J]. SCIENTIFIC REPORTS, 2022, 12 (01):
  • [20] Gradient Boosted Decision Tree to Model Ustekinumab Trough Levels in Crohn's Disease
    Saleh, Adam A.
    Miroballi, Natalia
    Stading, Rachel
    Glassner, Kerri
    Abraham, Bincy
    [J]. AMERICAN JOURNAL OF GASTROENTEROLOGY, 2022, 117 (10): : S601 - S602