Efficient Gradient Boosted Decision Tree Training on GPUs

被引:27
|
作者
Wen, Zeyi [1 ]
He, Bingsheng [1 ]
Ramamohanarao, Kotagiri [2 ]
Lu, Shengliang [1 ]
Shi, Jiashuai [1 ,3 ]
机构
[1] Natl Univ Singapore, Singapore, Singapore
[2] Univ Melbourne, Melbourne, Vic, Australia
[3] South China Univ Technol, Guangzhou, Guangdong, Peoples R China
关键词
CONSTRUCTION;
D O I
10.1109/IPDPS.2018.00033
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a novel parallel implementation for training Gradient Boosting Decision Trees (GBDTs) on Graphics Processing Units (GPUs). Thanks to the wide use of the open sourced XGBoost library, GBDTs have become very popular in recent years and won many awards in machine learning and data mining competitions. Although GPUs have demonstrated their success in accelerating many machine learning applications, there are a series of key challenges of developing a GPU-based GBDT algorithm, including irregular memory accesses, many small sorting operations and varying data parallel granularities in tree construction. To tackle these challenges on GPUs, we propose various novel techniques (including Run-length Encoding compression and thread/ block workload dynamic allocation, and reusing intermediate training results for efficient gradient computation). Our experimental results show that our algorithm named GPU-GBDT is often 10 to 20 times faster than the sequential version of XGBoost, and achieves 1.5 to 2 times speedup over a 40 threaded XGBoost running on a relatively high-end workstation of 20 CPU cores. Moreover, GPU-GBDT outperforms its CPU counterpart by 2 to 3 times in terms of performance-price ratio.
引用
收藏
页码:234 / 243
页数:10
相关论文
共 50 条
  • [1] Exploiting GPUs for Efficient Gradient Boosting Decision Tree Training
    Wen, Zeyi
    Shi, Jiashuai
    He, Bingsheng
    Chen, Jian
    Ramamohanarao, Kotagiri
    Li, Qinbin
    [J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2019, 30 (12) : 2706 - 2717
  • [2] swGBDT: Efficient Gradient Boosted Decision Tree on Sunway Many-Core Processor
    Yin, Bohong
    Li, Yunchun
    Dun, Ming
    You, Xin
    Yang, Hailong
    Luan, Zhongzhi
    Qian, Depei
    [J]. SUPERCOMPUTING FRONTIERS (SCFA 2020), 2020, 12082 : 67 - 86
  • [3] Adversarial Training of Gradient-Boosted Decision Trees
    Calzavara, Stefano
    Lucchese, Claudio
    Tolomei, Gabriele
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2429 - 2432
  • [4] Gradient Boosted Decision Tree Algorithms for Medicare Fraud Detection
    Hancock J.T.
    Khoshgoftaar T.M.
    [J]. SN Computer Science, 2021, 2 (4)
  • [5] Comparison of Decision Tree Classification Methods and Gradient Boosted Trees
    Dikananda, Arif Rinaldi
    Jumini, Sri
    Tarihoran, Nafan
    Christinawati, Santy
    Trimastuti, Wahyu
    Rahim, Robbi
    [J]. TEM JOURNAL-TECHNOLOGY EDUCATION MANAGEMENT INFORMATICS, 2022, 11 (01): : 316 - 322
  • [6] An Extension of Gradient Boosted Decision Tree incorporating Statistical Tests
    Sakata, Ryuji
    Ohama, Iku
    Taniguchi, Tadahiro
    [J]. 2018 18TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2018, : 964 - 969
  • [7] SketchBoost: Fast Gradient Boosted Decision Tree for Multioutput Problems
    Iosipoi, Leonid
    Vakhrushev, Anton
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [8] Scalable hardware architecture for fast gradient boosted tree training
    Sadasue, Tamon
    Tanaka, Takuya
    Kasahara, Ryosuke
    Darmawan, Arief
    Isshiki, Tsuyoshi
    [J]. IPSJ Transactions on System LSI Design Methodology, 2021, 14 : 11 - 20
  • [9] Finding Influential Training Samples for Gradient Boosted Decision Trees
    Sharchilev, Boris
    Ustinovsky, Yury
    Serdyukov, Pavel
    de Rijke, Maarten
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [10] Gradient Boosted Decision Tree based Classification for Recognizing Human Behavior
    Priyadarshini, R. K.
    Banu, Bazila A.
    Nagamani, T.
    [J]. PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATION ENGINEERING (ICACCE-2019), 2019,