Estimating correlation with multiply censored data arising from the adjustment of singly censored data

被引:22
|
作者
Newton, Elizabeth [1 ]
Rudel, Ruthann [1 ]
机构
[1] Silent Spring Inst, Newton, MA 02458 USA
关键词
D O I
10.1021/es0608444
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Environmental data frequently are left censored due to detection limits of laboratory assay procedures. Left censored means that some of the observations are known only to fall below a censoring point (detection limit). This presents difficulties in statistical analysis of the data. In this paper, we examine methods for estimating the correlation between variables each of which is censored at multiple points. Multiple censoring frequently arises due to adjustment of singly censored laboratory results for physical sample size. We discuss maximum likelihood (ML) estimation of the correlation and introduce a new method (cp.mle2) that, instead of using the multiply censored data directly, relies on ML estimates of the covariance of the singly censored laboratory data. We compare the ML methods with Kendall's tau-b (ck.taub) which is a modification Kendall's tau adjusted for ties, and several commonly used simple substitution methods: correlations estimated with nondetects set to the detection limit divided by 2 and correlations based on detects only (cs.det) with nondetects set to missing. The methods are compared based on simulations and real data. In the simulations, censoring levels are varied from 0 to 90%, rho from -0.8 to 0.8, and nu (variance of physical sample size) is set to 0 and 0.5, for a total of 550 parameter combinations with 1000 replications at each combination. We find that with increasing levels of censoring most of the correlation methods are highly biased. The simple substitution methods in general tend toward zero if singly censored and one if multiply censored. ck.taub tends toward zero. Least biased is cp.mle2, however, it has higher variance than some of the other estimators. Overall, cs.det performs the worst and cp.mle2 the best.
引用
收藏
页码:221 / 228
页数:8
相关论文
共 50 条
  • [11] Estimating panel data duration models with censored data
    Lee, Sokbae
    [J]. ECONOMETRIC THEORY, 2008, 24 (05) : 1254 - 1276
  • [12] A METHOD FOR GRAPHICAL ANALYSIS OF MULTIPLY CENSORED LIFE DATA
    NELSON, W
    [J]. TECHNOMETRICS, 1969, 11 (01) : 218 - &
  • [13] Highest posterior density estimation from multiply censored Pareto data
    Fernandez, Arturo J.
    [J]. STATISTICAL PAPERS, 2008, 49 (02) : 333 - 341
  • [14] Highest posterior density estimation from multiply censored Pareto data
    Arturo J. Fernández
    [J]. Statistical Papers, 2008, 49 : 333 - 341
  • [15] MONOTONE ESTIMATING EQUATIONS FOR CENSORED-DATA
    FYGENSON, M
    RITOV, Y
    [J]. ANNALS OF STATISTICS, 1994, 22 (02): : 732 - 746
  • [16] ESTIMATING THE RELATIVE RISK WITH CENSORED-DATA
    BEGUN, JM
    REID, N
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1983, 78 (382) : 337 - 341
  • [17] ESTIMATING LIFETIME MEDICAL COSTS FROM CENSORED CLAIMS DATA
    Hwang, J.
    Hu, T.
    Lee, L. J.
    Wang, J.
    [J]. VALUE IN HEALTH, 2015, 18 (07) : A690 - A691
  • [18] Estimating Nested Logit Models with Censored Data
    Newman, Jeffrey P.
    Ferguson, Mark E.
    Garrow, Laurie A.
    [J]. TRANSPORTATION RESEARCH RECORD, 2013, (2343) : 62 - 67
  • [19] Estimating period prevalence using censored data
    Logie, John W.
    Feudjo-Tepie, Maurille A.
    [J]. PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2006, 15 : S233 - S233
  • [20] Estimating lifetime medical costs from censored claims data
    Hwang, Jing-Shiang
    Hu, Tsuey-Hwa
    Lee, Lukas Jyuhn-Hsiarn
    Wang, Jung-Der
    [J]. HEALTH ECONOMICS, 2017, 26 (12) : E332 - E344