Quantile Regression-Based Multiple Imputation of Missing Values - An Evaluation and Application to Corporal Punishment Data

被引:0
|
作者
Kleinke, Kristian [1 ]
Fritsch, Markus [2 ]
Stemmler, Mark [3 ]
Reinecke, Jost [4 ]
Loesel, Friedrich [3 ,5 ]
机构
[1] Univ Siegen, Dept Eductat Studies & Psychol, Adolf Reichwein Str 2a, D-57068 Siegen, Germany
[2] Univ Passau, Sch Business Econ & Informat Syst, Passau, Germany
[3] Univ Erlangen Nurnberg, Inst Psychol, Erlangen, Germany
[4] Univ Bielefeld, Fac Sociol, Bielefeld, Germany
[5] Univ Cambridge, Inst Criminol, Cambridge, England
关键词
missing values; multiple imputation; quantile regression; random forest; corporal punishment; parenting behavior; CHAINED EQUATIONS; MODELS; INFORMATION;
D O I
10.5964/meth.2317
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Quantile regression (QR) is a valuable tool for data analysis and multiple imputation (MI) of missing values - especially when standard parametric modelling assumptions are violated. Yet, Monte Carlo simulations that systematically evaluate QR-based MI in a variety of different practically relevant settings are still scarce. In this paper, we evaluate the method regarding the imputation of ordinal data and compare the results with other standard and robust imputation methods. We then apply QR-based MI to an empirical dataset, where we seek to identify risk factors for corporal punishment of children by their fathers. We compare the modelling results with previously published findings based on complete cases. Our Monte Carlo results highlight the advantages of QR-based MI over fully parametric imputation models: QR-based MI yields unbiased statistical inferences across large parts of the conditional distribution, when parametric modelling assumptions, such as normal and homoscedastic error terms, are violated. Regarding risk factors for corporal punishment, our MI results support previously published findings based on complete cases. Our empirical results indicate that the identified identified "missing at random" processes in the investigated dataset are negligible.
引用
下载
收藏
页码:205 / 230
页数:26
相关论文
共 50 条
  • [31] Development of Imputation Methods for Missing Data in Multiple Linear Regression Analysis
    Thongsri, Thidarat
    Samart, Klairung
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2022, 43 (11) : 3390 - 3399
  • [32] Multiple imputation for missing edge data: A predictive evaluation method with application to Add Health
    Wang, Cheng
    Butts, Carter T.
    Hipp, John R.
    Jose, Rupa
    Lakon, Cynthia M.
    SOCIAL NETWORKS, 2016, 45 : 89 - 98
  • [33] Copula and composite quantile regression-based estimating equations for longitudinal data
    Kangning Wang
    Wen Shan
    Annals of the Institute of Statistical Mathematics, 2021, 73 : 441 - 455
  • [34] AN APPLICATION OF SEQUENTIAL REGRESSION MULTIPLE IMPUTATION ON PANEL DATA
    Von Maltitz, Michael Johan
    Van der Merwe, Abraham Johannes
    SOUTH AFRICAN JOURNAL OF ECONOMICS, 2012, 80 (01) : 77 - 90
  • [35] Copula and composite quantile regression-based estimating equations for longitudinal data
    Wang, Kangning
    Shan, Wen
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2021, 73 (03) : 441 - 455
  • [36] Missing values in longitudinal dietary data: A multiple imputation approach based on a fully conditional specification
    Nevalainen, Jaakko
    Kenward, Michael G.
    Virtanen, Suvi A.
    STATISTICS IN MEDICINE, 2009, 28 (29) : 3657 - 3669
  • [37] Estimating missing values from the general social survey: An application of multiple imputation
    Penn, David A.
    SOCIAL SCIENCE QUARTERLY, 2007, 88 (02) : 573 - 584
  • [38] Multiple imputation scheme for overcoming the missing values and variability issues in ITS data
    Ni, DH
    Leonard, JD
    Guin, A
    Feng, CX
    JOURNAL OF TRANSPORTATION ENGINEERING, 2005, 131 (12) : 931 - 938
  • [39] A latent class based imputation method under Bayesian quantile regression framework using asymmetric Laplace distribution for longitudinal medication usage data with intermittent missing values
    Lee, Minjae
    Rahbar, Mohammad H.
    Gensler, Lianne S.
    Brown, Matthew
    Weisman, Michael
    Reveille, John D.
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2020, 30 (01) : 160 - 177
  • [40] An adaptive functional regression-based prognostic model for applications with missing data
    Fang, Xiaolei
    Zhou, Rensheng
    Gebraeel, Nagi
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2015, 133 : 266 - 274