A Distribution-Based Multiple Imputation Method for Handling Bivariate Pesticide Data with Values below the Limit of Detection

被引:37
|
作者
Chen, Haiying [1 ]
Quandt, Sara A. [2 ]
Grzywacz, Joseph G. [3 ]
Arcury, Thomas A. [3 ]
机构
[1] Wake Forest Univ, Dept Biostat Sci, Div Publ Hlth Sci, Sch Med, Winston Salem, NC 27157 USA
[2] Wake Forest Univ, Dept Epidemiol & Prevent, Div Publ Hlth Sci, Sch Med, Winston Salem, NC 27157 USA
[3] Wake Forest Univ, Dept Family & Community Med, Sch Med, Winston Salem, NC 27157 USA
关键词
left-censoring; limit of detection; longitudinal study; maximum likelihood; multiple imputation; nondetect; repeated measures; EXPOSURE; MODELS;
D O I
10.1289/ehp.1002124
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
BACKGROUND: Environmental and biomedical researchers frequently encounter laboratory data constrained by a lower limit of detection (LOD). Commonly used methods to address these left-censored data, such as simple substitution of a constant for all values < LOD, may bias parameter estimation. In contrast, multiple imputation (MI) methods yield valid and robust parameter estimates and explicit imputed values for variables that can be analyzed as outcomes or predictors. OBJECTIVE: In this article we expand distribution-based MI methods for left-censored data to a bivariate setting, specifically, a longitudinal study with biological measures at two points in time. METHODS: We have presented the likelihood function for a bivariate normal distribution taking into account values < LOD as well as missing data assumed missing at random, and we use the estimated distributional parameters to impute values < LOD and to generate multiple plausible data sets for analysis by standard statistical methods. We conducted a simulation study to evaluate the sampling properties of the estimators, and we illustrate a practical application using data from the Community Participatory Approach to Measuring Farmworker Pesticide Exposure (PACE3) study to estimate associations between urinary acephate (APE) concentrations (indicating pesticide exposure) at two points in time and self-reported symptoms. RESULTS: Simulation study results demonstrated that imputed and observed values together were consistent with the assumed and estimated underlying distribution. Our analysis of PACE3 data using MI to impute APE values < LOD showed that urinary APE concentration was significantly associated with potential pesticide poisoning symptoms. Results based on simple substitution methods were substantially different from those based on the MI method. CONCLUSIONS: The distribution-based MI method is a valid and feasible approach to analyze bivariate data with values < LOD, especially when explicit values for the nondetections are needed. We recommend the use of this approach in environmental and biomedical research.
引用
收藏
页码:351 / 356
页数:6
相关论文
共 50 条
  • [41] Tobacco smoking and depressive symptoms in Chinese middle-aged and older adults: Handling missing values in panel data with multiple imputation
    Du, Xiahua
    Wu, Rina
    Kang, Lili
    Zhao, Longlong
    Li, Changle
    FRONTIERS IN PUBLIC HEALTH, 2022, 10
  • [42] Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth
    Zhaoyang Zhang
    Hua Fang
    Honggang Wang
    Journal of Medical Systems, 2016, 40
  • [43] Quantile Regression-Based Multiple Imputation of Missing Values - An Evaluation and Application to Corporal Punishment Data
    Kleinke, Kristian
    Fritsch, Markus
    Stemmler, Mark
    Reinecke, Jost
    Loesel, Friedrich
    METHODOLOGY-EUROPEAN JOURNAL OF RESEARCH METHODS FOR THE BEHAVIORAL AND SOCIAL SCIENCES, 2021, 17 (03) : 205 - 230
  • [44] Multiple Imputation based Clustering Validation (MIV) for Big Longitudinal Trial Data with Missing Values in eHealth
    Zhang, Zhaoyang
    Fang, Hua
    Wang, Honggang
    JOURNAL OF MEDICAL SYSTEMS, 2016, 40 (06)
  • [45] Assay of D-lactate in urine of infants and children with reference values taking into account data below detection limit
    Haschke-Becher, E
    Baumgartner, M
    Bachmann, C
    CLINICA CHIMICA ACTA, 2000, 298 (1-2) : 99 - 109
  • [46] Multiple Distribution Data Description Learning Method for Novelty Detection
    Trung Le
    Dat Tran
    Phuoc Nguyen
    Ma, Wanli
    Sharma, Dharmendra
    2011 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2011, : 2321 - 2326
  • [47] Multiple imputation for left-censored biomarker data based on Gibbs sampling method
    Lee, MinJae
    Kong, Lan
    Weissfeld, Lisa
    STATISTICS IN MEDICINE, 2012, 31 (17) : 1838 - 1848
  • [48] Privacy and security of big data in cyber physical systems using Weibull distribution-based intrusion detection
    R. Gifty
    R. Bharathi
    P. Krishnakumar
    Neural Computing and Applications, 2019, 31 : 23 - 34
  • [49] Privacy and security of big data in cyber physical systems using Weibull distribution-based intrusion detection
    Gifty, R.
    Bharathi, R.
    Krishnakumar, P.
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (Suppl 1): : 23 - 34
  • [50] Machine-Learning-Based Imputation Method for Filling Missing Values in Ground Meteorological Observation Data
    Li, Cong
    Ren, Xupeng
    Zhao, Guohui
    ALGORITHMS, 2023, 16 (09)