Missing data imputation, matching and other applications of random recursive partitioning

被引:24
|
作者
Iacus, Stefano A. [1 ]
Porro, Giuseppe [2 ]
机构
[1] Univ Milan, Dept Econ Business & Stat, I-20122 Milan, Italy
[2] Univ Trieste, Dept Econ & Stat, I-34127 Trieste, Italy
关键词
recursive partitioning; average treatment effect estimation; classification; missing data imputation;
D O I
10.1016/j.csda.2006.12.036
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Applications of the random recursive partitioning (RRP) method are described. This method generates a proximity matrix which can be used in non-parametric matching problems such as hot-deck missing data imputation and average treatment effect estimation. RRP is a Monte Carlo procedure that randomly generates non-empty recursive partitions of the data and calculates the proximity between observations as the empirical frequency in the same cell of these random partitions over all the replications. Also, the method in the presence of missing data is invariant under monotonic transformations of the data but no other formal properties of the method are known yet. Therefore, Monte Carlo experiments were conducted in order to explore the performance of the method. A companion software is available as a package for the R statistical environment. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:773 / 789
页数:17
相关论文
共 50 条
  • [41] Missing data analysis and imputation via latent Gaussian Markov random felds
    Department of Mathematics, School of Industrial Engineering, Albacete, Universidad de Castilla-La Mancha, Spain
    不详
    不详
    [J]. SORT, 2 (217-243):
  • [42] Missing Data Imputation With Bayesian Maximum Entropy for Internet of Things Applications
    Gonzalez-Vidal, Aurora
    Rathore, Punit
    Rao, Aravinda S.
    Mendoza-Bernal, Jose
    Palaniswami, Marimuthu
    Skarmeta-Gomez, Antonio F.
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (21) : 16108 - 16120
  • [43] Imputation of missing well log data by random forest and its uncertainty analysis
    Feng, Runhai
    Grana, Dario
    Balling, Niels
    [J]. COMPUTERS & GEOSCIENCES, 2021, 152
  • [44] Recursive Partitioning Methods for Data Imputation in the Context of Item Response Theory: A Monte Carlo Simulation
    Edwards, Julianne M.
    Finch, W. Holmes
    [J]. PSICOLOGICA, 2018, 39 (01): : 88 - 117
  • [45] Reviewing autoencoders for missing data imputation: Technical trends, applications and outcomes
    Pereira, Ricardo Cardoso
    Santos, Miriam Seoane
    Rodrigues, Pedro Pereira
    Abreu, Pedro Henriques
    [J]. Journal of Artificial Intelligence Research, 2020, 69 : 1255 - 1285
  • [46] Cost-effectiveness analysis of clinical trials with missing data: using multiple imputation to address data missing not at random
    Leurent, Baptiste
    Gomes, Manuel
    Carpenter, James
    [J]. TRIALS, 2017, 18
  • [47] Reviewing Autoencoders for Missing Data Imputation: Technical Trends, Applications and Outcomes
    Pereira, Ricardo Cardoso
    Santos, Miriam Seoane
    Rodrigues, Pedro Pereira
    Abreu, Pedro Henriques
    [J]. JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2020, 69 : 1255 - 1285
  • [48] Missing data analysis and imputation via latent Gaussian Markov random fields
    Gomez-Rubio, Virgilio
    Cameletti, Michela
    Blangiardo, Marta
    [J]. SORT-STATISTICS AND OPERATIONS RESEARCH TRANSACTIONS, 2022, 46 (02) : 217 - 244
  • [49] Dual strategy based missing completely at random type missing data imputation on the internet of medical things
    Punitha, P. Iris
    Sathiaseelan, J. G. R.
    [J]. INTERNATIONAL JOURNAL OF INTELLIGENT ENGINEERING INFORMATICS, 2023, 11 (04) : 317 - 336
  • [50] Propensity score matching after multiple imputation when a confounder has missing data
    Segalas, Corentin
    Leyrat, Clemence
    R. Carpenter, James
    Williamson, Elizabeth
    [J]. STATISTICS IN MEDICINE, 2023, 42 (07) : 1082 - 1095