RANDOMIZED GRADIENT BOOSTING MACHINE

被引:14
|
作者
Lu, Haihao [1 ]
Mazumder, Rahul [2 ,3 ]
机构
[1] Univ Chicago, Booth Sch Business, Chicago, IL 60637 USA
[2] MIT, Sloan Sch Management, Ctr Operat Res, Cambridge, MA 02142 USA
[3] MIT, Ctr Stat, Cambridge, MA 02142 USA
关键词
gradient boosting; ensemble methods; convex optimization; coordinate descent; computational guarantees; first order methods; LOGISTIC-REGRESSION; CONDITION NUMBER; CONVERGENCE; OPTIMIZATION;
D O I
10.1137/18M1223277
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The Gradient Boosting Machine (GBM) introduced by Friedman [J. H. Friedman, Ann. Statist., 29 (2001), pp. 1189-1232] is a powerful supervised learning algorithm that is very widely used in practice-it routinely features as a leading algorithm in machine learning competitions such as Kaggle and the KDDCup. In spite of the usefulness of GBM in practice, our current theoretical understanding of this method is rather limited. In this work, we propose the Randomized Gradient Boosting Machine (RGBM), which leads to substantial computational gains compared to GBM by using a randomization scheme to reduce search in the space of weak learners. We derive novel computational guarantees for RGBM. We also provide a principled guideline towards better step-size selection in RGBM that does not require a line search. Our proposed framework is inspired by a special variant of coordinate descent that combines the benefits of randomized coordinate descent and greedy coordinate descent, and may be of independent interest as an optimization algorithm. As a special case, our results for RGBM lead to superior computational guarantees for GBM. Our computational guarantees depend upon a curious geometric quantity that we call the Minimal Cosine Angle, which relates to the density of weak learners in the prediction space. On a series of numerical experiments on real datasets, we demonstrate the effectiveness of RGBM over GBM in terms of obtaining a model with good training and/or testing data fidelity with a fraction of the computational cost.
引用
下载
收藏
页码:2780 / 2808
页数:29
相关论文
共 50 条
  • [31] Epistasis detection using a permutation-based gradient boosting machine
    Che, Kai
    Liu, Xiaoyan
    Guo, Maozu
    Zhang, Junwei
    Wang, Lei
    Zhang, Yin
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 1247 - 1252
  • [32] An Adjective Selection Personality Assessment Method Using Gradient Boosting Machine Learning
    Fernandes, Bruno
    Gonzalez-Briones, Alfonso
    Novais, Paulo
    Calafate, Miguel
    Analide, Cesar
    Neves, Jose
    PROCESSES, 2020, 8 (05)
  • [33] Prediction of Cardiotoxicity for Breast Cancer Patients Using Light Gradient Boosting Machine
    Jiang, Z.
    Diao, P.
    Liang, Y.
    Dai, K.
    Li, H.
    Wang, H.
    Chen, Y.
    Lu, M.
    Kuang, Y.
    MEDICAL PHYSICS, 2021, 48 (06)
  • [34] In-Vehicle Network Anomaly Detection Using Extreme Gradient Boosting Machine
    Anjum, Afia
    Agbaje, Paul
    Hounsinou, Sena
    Olufowobi, Habeeb
    2022 11TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2022, : 100 - 105
  • [35] Comparative Study of Electricity-Theft Detection Based on Gradient Boosting Machine
    Yan, Zhongzong
    Wen, He
    2021 IEEE INTERNATIONAL INSTRUMENTATION AND MEASUREMENT TECHNOLOGY CONFERENCE (I2MTC 2021), 2021,
  • [36] Prediction of heating and cooling loads based on light gradient boosting machine algorithms
    Guo, Jiaxin
    Yun, Sining
    Meng, Yao
    He, Ning
    Ye, Dongfu
    Zhao, Zeni
    Jia, Lingyun
    Yang, Liu
    BUILDING AND ENVIRONMENT, 2023, 236
  • [37] Assessing Susceptibility of Debris Flow in Southwest China Using Gradient Boosting Machine
    Di, Baofeng
    Zhang, Hanyue
    Liu, Yongyao
    Li, Jierui
    Chen, Ningsheng
    Stamatopoulos, Constantine A.
    Luo, Yuzhou
    Zhan, Yu
    SCIENTIFIC REPORTS, 2019, 9 (1)
  • [38] Hybrid classification of Android malware based on fuzzy clustering and the gradient boosting machine
    Altyeb Altaher Taha
    Sharaf Jameel Malebary
    Neural Computing and Applications, 2021, 33 : 6721 - 6732
  • [39] A Light Gradient Boosting Machine for Remainning Useful Life Estimation of Aircraft Engines
    Li, Fei
    Zhang, Li
    Chen, Bin
    Gao, Dianzhu
    Cheng, Yijun
    Zhang, Xiaoyong
    Yang, Yingze
    Gao, Kai
    Huang, Zhiwu
    Peng, Jun
    2018 21ST INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2018, : 3562 - 3567
  • [40] Bus Travel Time Prediction Based on Light Gradient Boosting Machine Algorithm
    Wang F.-J.
    Wang F.-J.
    Wang Y.-C.
    Bian C.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2019, 19 (02): : 116 - 121