Missing data imputation, matching and other applications of random recursive partitioning

被引:24
|
作者
Iacus, Stefano A. [1 ]
Porro, Giuseppe [2 ]
机构
[1] Univ Milan, Dept Econ Business & Stat, I-20122 Milan, Italy
[2] Univ Trieste, Dept Econ & Stat, I-34127 Trieste, Italy
关键词
recursive partitioning; average treatment effect estimation; classification; missing data imputation;
D O I
10.1016/j.csda.2006.12.036
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Applications of the random recursive partitioning (RRP) method are described. This method generates a proximity matrix which can be used in non-parametric matching problems such as hot-deck missing data imputation and average treatment effect estimation. RRP is a Monte Carlo procedure that randomly generates non-empty recursive partitions of the data and calculates the proximity between observations as the empirical frequency in the same cell of these random partitions over all the replications. Also, the method in the presence of missing data is invariant under monotonic transformations of the data but no other formal properties of the method are known yet. Therefore, Monte Carlo experiments were conducted in order to explore the performance of the method. A companion software is available as a package for the R statistical environment. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:773 / 789
页数:17
相关论文
共 50 条
  • [1] Recursive partitioning for missing data imputation in the presence of interaction effects
    Doove, L. L.
    Van Buuren, S.
    Dusseldorp, E.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 72 : 92 - 104
  • [2] Recursive partitioning for monotone missing at random longitudinal markers
    Stock, Shannon
    DeGruttola, Victor
    [J]. STATISTICS IN MEDICINE, 2013, 32 (06) : 978 - 994
  • [3] Multiple imputation of ordinal missing not at random data
    Hammon, Angelina
    [J]. ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2023, 107 (04) : 671 - 692
  • [4] Multiple imputation of ordinal missing not at random data
    Angelina Hammon
    [J]. AStA Advances in Statistical Analysis, 2023, 107 : 671 - 692
  • [5] Autoreplicative random forests with applications to missing value imputation
    Antonenko, Ekaterina
    Carreno, Ander
    Read, Jesse
    [J]. MACHINE LEARNING, 2024, 113 (10) : 7617 - 7643
  • [6] Siamese Autoencoder Architecture for the Imputation of Data Missing Not at Random
    Pereira, Ricardo Cardoso
    Abreu, Pedro Henriques
    Rodrigues, Pedro Pereira
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2024, 78
  • [7] Identifiable Generative Models for Missing Not at Random Data Imputation
    Ma, Chao
    Zhang, Cheng
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [8] Deep Generative Imputation Model for Missing Not At Random Data
    Chen, Jialei
    Xu, Yuanbo
    Wang, Pengyang
    Yang, Yongjian
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 316 - 325
  • [9] Multiple imputation of binary multilevel missing not at random data
    Hammon, Angelina
    Zinn, Sabine
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2020, 69 (03) : 547 - 564
  • [10] Efficient random imputation for missing data in complex surveys
    Chen, J
    Rao, JNK
    Sitter, RR
    [J]. STATISTICA SINICA, 2000, 10 (04) : 1153 - 1169