Regression-based imputation of explanatory discrete missing data

被引:1
|
作者
Hernandez-Herrera, Gilma [1 ,2 ]
Navarro, Albert [1 ]
Morina, David [3 ]
机构
[1] Univ Autonoma Barcelona, Res Grp Psychosocial Risks, Unitat Bioestadist, Fac Med,Org Work & Hlth POWAH, Barcelona, Spain
[2] Univ Antioquia, Fac Med, Inst Invest Med, Medellin, Colombia
[3] Univ Barcelona, Dept Econometr Stat & Appl Econ, Riskctr IREA, Barcelona, Spain
关键词
COMPoisson; Count data; Hermite; Missing data; Multiple imputation; Zero-inflated; ZERO-INFLATED POISSON; GENERALIZED HERMITE; SEMIPARAMETRIC ESTIMATION; MULTIPLE IMPUTATION; MODEL;
D O I
10.1080/03610918.2022.2149805
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Imputation of missing values is a strategy for handling non-responses in surveys or data loss in measurement processes, which may be more effective than ignoring the losses and omitting them. The characteristics of variables presenting missing values must be considered when choosing the imputation method to be used; in particular when the variable is a count the literature dealing with this issue is scarce. If the variable has an excess of zeros it is necessary to consider models including parameters for handling zero-inflation. Likewise, if problems of over- or under-dispersion are observed, generalizations of the Poisson, such as the Hermite or Conway-Maxwell Poisson distributions are recommended for carrying out imputation. The aim of this study was to assess the performance of various regression models in the imputation of a discrete variable based on Poisson generalizations, in comparison with classical counting models, through a comprehensive simulation study considering a variety of scenarios and a real data example. To do so we compared the results of estimations using only complete data, and using imputations based on the most common count models. The COMPoisson distribution provides in general better results in any dispersion scenario, especially when the amount of missing information is large.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Framework for regression-based missing data imputation methods in on-line MSPC
    Arteaga, F
    Ferrer, A
    [J]. JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 439 - 447
  • [2] Task reduction using regression-based missing data imputation in sparse mobile crowdsensing
    Marchang, Ningrinla
    Meitei, Goldie M.
    Thakur, Tejendra
    [J]. JOURNAL OF SUPERCOMPUTING, 2022, 78 (14): : 15995 - 16028
  • [3] Task reduction using regression-based missing data imputation in sparse mobile crowdsensing
    Ningrinla Marchang
    Goldie M. Meitei
    Tejendra Thakur
    [J]. The Journal of Supercomputing, 2022, 78 : 15995 - 16028
  • [4] Shrinkage regression-based methods for microarray missing value imputation
    Wang, Hsiuying
    Chiu, Chia-Chun
    Wu, Yi-Ching
    Wu, Wei-Sheng
    [J]. BMC SYSTEMS BIOLOGY, 2013, 7
  • [5] Missing Data in Questionnaire Based Studies-Methodological Challenges and Success of a Regression-Based Imputation Model
    Gilchrist, Anna
    Tong, Edward
    Layton, Debbie
    Fogg, Carole
    Shakir, Saad
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2012, 21 : 404 - 404
  • [6] Quantile Regression-Based Multiple Imputation of Missing Values - An Evaluation and Application to Corporal Punishment Data
    Kleinke, Kristian
    Fritsch, Markus
    Stemmler, Mark
    Reinecke, Jost
    Loesel, Friedrich
    [J]. METHODOLOGY-EUROPEAN JOURNAL OF RESEARCH METHODS FOR THE BEHAVIORAL AND SOCIAL SCIENCES, 2021, 17 (03) : 205 - 230
  • [7] Regression-Based Approach to Test Missing Data Mechanisms
    Rouzinov, Serguei
    Berchtold, Andre
    [J]. DATA, 2022, 7 (02)
  • [8] MISSING DATA, IMPUTATION AND REGRESSION TREES
    Loh, Wei-Yin
    Zhang, Qiong
    Zhang, Wenwen
    Zhou, Peigen
    [J]. STATISTICA SINICA, 2020, 30 (04) : 1697 - 1722
  • [9] MICROARRAY MISSING DATA IMPUTATION USING REGRESSION
    Bayrak, Tuncay
    Ogul, Hasan
    [J]. 2017 13TH IASTED INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING (BIOMED), 2017, : 68 - 73
  • [10] Regression multiple imputation for missing data analysis
    Yu, Lili
    Liu, Liang
    Peace, Karl E.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2020, 29 (09) : 2647 - 2664