RANDOMIZED GRADIENT BOOSTING MACHINE

被引:14
|
作者
Lu, Haihao [1 ]
Mazumder, Rahul [2 ,3 ]
机构
[1] Univ Chicago, Booth Sch Business, Chicago, IL 60637 USA
[2] MIT, Sloan Sch Management, Ctr Operat Res, Cambridge, MA 02142 USA
[3] MIT, Ctr Stat, Cambridge, MA 02142 USA
关键词
gradient boosting; ensemble methods; convex optimization; coordinate descent; computational guarantees; first order methods; LOGISTIC-REGRESSION; CONDITION NUMBER; CONVERGENCE; OPTIMIZATION;
D O I
10.1137/18M1223277
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The Gradient Boosting Machine (GBM) introduced by Friedman [J. H. Friedman, Ann. Statist., 29 (2001), pp. 1189-1232] is a powerful supervised learning algorithm that is very widely used in practice-it routinely features as a leading algorithm in machine learning competitions such as Kaggle and the KDDCup. In spite of the usefulness of GBM in practice, our current theoretical understanding of this method is rather limited. In this work, we propose the Randomized Gradient Boosting Machine (RGBM), which leads to substantial computational gains compared to GBM by using a randomization scheme to reduce search in the space of weak learners. We derive novel computational guarantees for RGBM. We also provide a principled guideline towards better step-size selection in RGBM that does not require a line search. Our proposed framework is inspired by a special variant of coordinate descent that combines the benefits of randomized coordinate descent and greedy coordinate descent, and may be of independent interest as an optimization algorithm. As a special case, our results for RGBM lead to superior computational guarantees for GBM. Our computational guarantees depend upon a curious geometric quantity that we call the Minimal Cosine Angle, which relates to the density of weak learners in the prediction space. On a series of numerical experiments on real datasets, we demonstrate the effectiveness of RGBM over GBM in terms of obtaining a model with good training and/or testing data fidelity with a fraction of the computational cost.
引用
下载
收藏
页码:2780 / 2808
页数:29
相关论文
共 50 条
  • [1] Gradient Boosting Machine with Partially Randomized Decision Trees
    Konstantinov, Andrei
    Utkin, Lev
    Muliukha, Vladimir
    PROCEEDINGS OF THE 28TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION FRUCT, 2021, : 167 - 173
  • [2] Machine Unlearning in Gradient Boosting Decision Trees
    Lin, Huawei
    Chung, Jun Woo
    Lao, Yingjie
    Zhao, Weijie
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 1374 - 1383
  • [3] Greedy function approximation: A gradient boosting machine
    Friedman, JH
    ANNALS OF STATISTICS, 2001, 29 (05): : 1189 - 1232
  • [4] Modeling CO2 solubility in water using gradient boosting and light gradient boosting machine
    Mahmoudzadeh, Atena
    Amiri-Ramsheh, Behnam
    Atashrouz, Saeid
    Abedi, Ali
    Abuswer, Meftah Ali
    Ostadhassan, Mehdi
    Mohaddespour, Ahmad
    Hemmati-Sarapardeh, Abdolhossein
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [5] A boosting ensemble learning based hybrid light gradient boosting machine and extreme gradient boosting model for predicting house prices
    Sibindi, Racheal
    Mwangi, Ronald Waweru
    Waititu, Anthony Gichuhi
    ENGINEERING REPORTS, 2023, 5 (04)
  • [6] Idle Construction Land Prediction with Gradient Boosting Machine
    Jiang, Hongliang
    Mo, Lingfei
    Xun, Xiaofang
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), VOL 1, 2016, : 295 - 299
  • [7] A Gradient-Boosting Machine for Hierarchically Clustered Data
    Miller, Patrick J.
    McArtor, Daniel B.
    Lubke, Gitta H.
    MULTIVARIATE BEHAVIORAL RESEARCH, 2017, 52 (01) : 117 - 117
  • [8] Interpretable machine learning with an ensemble of gradient boosting machines
    Konstantinov, Andrei, V
    Utkin, Lev, V
    KNOWLEDGE-BASED SYSTEMS, 2021, 222
  • [9] Transportation modes recognitionusing a Light Gradient Boosting Machine
    Wang P.
    Liu Y.
    Huang Z.
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 2019, 51 (09): : 96 - 102
  • [10] GBMVis: Visual Analytics for Interpreting Gradient Boosting Machine
    Xia, Yulu
    Cheng, Kehan
    Cheng, Zhuoyue
    Rao, Yunbo
    Pu, Jiansu
    COOPERATIVE DESIGN, VISUALIZATION, AND ENGINEERING (CDVE 2021), 2021, 12983 : 63 - 72