Classification Tree Pruning Under Covariate Shift

被引:0
|
作者
Galbraith, Nicholas R. [1 ]
Kpotufe, Samory [1 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY 10027 USA
关键词
Decision trees; Q measurement; Nonhomogeneous media; Adaptation models; Measurement uncertainty; Drugs; Data models; Classification tree; covariate shift; Minkowski and Renyi dimensions; model selection and pruning; nonparametrics; RATES;
D O I
10.1109/TIT.2023.3308914
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of pruning a classification tree, that is, selecting a suitable subtree that balances bias and variance, in common situations with inhomogeneous training data. Namely, assuming access to mostly data from a distribution P-X,P-Y, but little data from a desired distribution Q(X,Y) with different X-marginals, we present the first efficient procedure for optimal pruning in such situations, when cross-validation and other penalized variants are grossly inadequate. Optimality is derived with respect to a notion of average discrepancy P-X -> Q(X) (averaged over X space) which significantly relaxes a recent notion-termed transfer-exponent-shown to tightly capture the limits of classification under such a distribution shift. Our relaxed notion can be viewed as a measure of relative dimension between distributions, as it relates to existing notions of information such as the Minkowski and Renyi dimensions.
引用
收藏
页码:456 / 481
页数:26
相关论文
共 50 条
  • [1] An adapted surrogate kernel for classification under covariate shift
    Song, Hyun-Je
    Park, Seong-Bae
    [J]. APPLIED SOFT COMPUTING, 2018, 69 : 435 - 442
  • [2] Adaptive Automatic Sleep Stage Classification under Covariate Shift
    Khalighi, Sirvan
    Sousa, Teresa
    Nunes, Urbano
    [J]. 2012 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2012, : 2259 - 2262
  • [3] Robust Classification under Covariate Shift with Application to Active Learning
    Liu, Anqi
    [J]. THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 4307 - 4308
  • [4] Model selection under covariate shift
    Sugiyama, M
    Müller, KR
    [J]. ARTIFICIAL NEURAL NETWORKS: FORMAL MODELS AND THEIR APPLICATIONS - ICANN 2005, PT 2, PROCEEDINGS, 2005, 3697 : 235 - 240
  • [5] Robust Fairness under Covariate Shift
    Rezaei, Ashkan
    Liu, Anqi
    Memarrast, Omid
    Ziebart, Brian D.
    [J]. THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 9419 - 9427
  • [6] Discriminative Learning Under Covariate Shift
    Bickel, Steffen
    Brueckner, Michael
    Scheffer, Tobias
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2009, 10 : 2137 - 2155
  • [7] Conformal Prediction Under Covariate Shift
    Tibshirani, Ryan J.
    Barber, Rina Foygel
    Candes, Emmanuel J.
    Ramdas, Aaditya
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [8] COVARIATE SHIFT APPROACH FOR INVARIANT TEXTURE CLASSIFICATION
    Hassan, Ali
    Shaukat, Arslan
    [J]. 2013 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2013,
  • [9] Fairness Violations and Mitigation under Covariate Shift
    Singh, Harvineet
    Singh, Rina
    Mhasawade, Vishwali
    Chunara, Rumi
    [J]. PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 3 - 13
  • [10] Simulator Calibration under Covariate Shift with Kernels
    Kisamori, Keiichi
    Kanagawa, Motonobu
    Yamazaki, Keisuke
    [J]. INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 1244 - 1252