End-to-end Feature Selection Approach for Learning Skinny Trees

被引:0
|
作者
Ibrahim, Shibal [1 ]
Behdin, Kayhan [1 ]
Mazumder, Rahul [1 ]
机构
[1] MIT, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
MUTUAL INFORMATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new optimization-based approach for feature selection in tree ensembles, an important problem in statistics and machine learning. Popular tree ensemble toolkits e.g., Gradient Boosted Trees and Random Forests support feature selection post-training based on feature importance scores, while very popular, they are known to have drawbacks. We propose Skinny Trees: an end-to-end toolkit for feature selection in tree ensembles where we train a tree ensemble while controlling the number of selected features. Our optimization-based approach learns an ensemble of differentiable trees, and simultaneously performs feature selection using a grouped l0-regularizer. We use first-order methods for optimization and present convergence guarantees for our approach. We use a dense-to-sparse regularization scheduling scheme that can lead to more expressive and sparser tree ensembles. On 15 synthetic and real-world datasets, Skinny Trees can achieve 1.5 620 feature compression rates, leading up to 10 faster inference over dense trees, without any loss in performance. Skinny Trees lead to superior feature selection than many existing toolkits e.g., in terms of AUC performance for 25% feature budget, Skinny Trees outperforms LightGBM by 10.2% (up to 37.7%), and Random Forests by 3% (up to 12.5%).
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Feature Analysis and Selection for Training an End-to-End Autonomous Vehicle Controller Using Deep Learning Approach
    Yang, Shun
    Wang, Wenshuo
    Liu, Chang
    Deng, Weiwen
    Hedrick, J. Karl
    2017 28TH IEEE INTELLIGENT VEHICLES SYMPOSIUM (IV 2017), 2017, : 1033 - 1038
  • [2] End-to-End Learning of Decision Trees and Forests
    Hehn, Thomas M.
    Kooij, Julian F. P.
    Hamprecht, Fred A.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2020, 128 (04) : 997 - 1011
  • [3] End-to-End Learning of Decision Trees and Forests
    Thomas M. Hehn
    Julian F. P. Kooij
    Fred A. Hamprecht
    International Journal of Computer Vision, 2020, 128 : 997 - 1011
  • [4] Adaptive Feature Selection for End-to-End Speech Translation
    Zhang, Biao
    Titov, Ivan
    Haddow, Barry
    Sennrich, Rico
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2533 - 2544
  • [5] Satellite selection with an end-to-end deep learning network
    Huang, Panpan
    Rizos, Chris
    Roberts, Craig
    GPS SOLUTIONS, 2018, 22 (04)
  • [6] Satellite selection with an end-to-end deep learning network
    Panpan Huang
    Chris Rizos
    Craig Roberts
    GPS Solutions, 2018, 22
  • [7] An end-to-end functional spiking model for sequential feature learning
    Xie, Xiurui
    Liu, Guisong
    Cai, Qing
    Sun, Guolin
    Zhang, Malu
    Qu, Hong
    KNOWLEDGE-BASED SYSTEMS, 2020, 195
  • [8] LCSNet: End-to-end Lipreading with Channel-aware Feature Selection
    Xue, Feng
    Yang, Tian
    Liu, Kang
    Hong, Zikun
    Cao, Mingwei
    Guo, Dan
    Hong, Richang
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [9] Effects of environmental feature selection on end-to-end vehicle steering controller
    Liu, Dongjie
    Zhao, Jin
    Cao, Zhuo
    Huang, Xinnian
    Xi, Axing
    JOURNAL OF ENGINEERING-JOE, 2020, 2020 (13): : 448 - 453
  • [10] A Novel End-To-End Feature Selection and Diagnosis Method for Rotating Machinery
    Wang, Gang
    Zhao, Yang
    Zhang, Jiasi
    Ning, Yongjie
    SENSORS, 2021, 21 (06) : 1 - 25