Nonparametric estimation of the lifetime and disease onset distributions for a survival-sacrifice model

被引:0
|
作者
Gomes, Antonio Eduardo [1 ]
Groeneboom, Piet [2 ]
Wellner, Jon A. [3 ]
机构
[1] Univ Brasilia, Dept Estat, BR-70910900 Brasilia, DF, Brazil
[2] Delft Univ Technol, Bldg 28,Van Mourik Broekmanweg 6, NL-2628 XE Delft, Netherlands
[3] Univ Washington, Dept Stat, Box 354322, Seattle, WA 98195 USA
来源
ELECTRONIC JOURNAL OF STATISTICS | 2019年 / 13卷 / 02期
关键词
MLE; survival sacrifice model; self-consistency equation; Volterra integral equation; primal-dual interior point algorithm; EM algorithm; smooth functionals; CENTRAL-LIMIT-THEOREM;
D O I
10.1214/19-EJS1598
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In carcinogenicity experiments with animals where the tumor is not palpable it is common to observe only the time of death of the animal, the cause of death (the tumor or another independent cause, as sacrifice) and whether the tumor was present at the time of death. These last two indicator variables are evaluated after an autopsy. Defining the non-negative variables T-1 (time of tumor onset), T-2 (time of death from the tumor) and C (time of death from an unrelated cause), we observe (Y, Delta(1), Delta(2)), where Y = min{T-2, C}, Delta(1) = 1{T-1 <= C}, and Delta(2) = 1{T-2 <= C}. The random variables T-1 and T-2 are independent of C and have a joint distribution such that P(T-1 <= T-2) = 1. Some authors call this model a "survival-sacrifice model". [20] (generally to be denoted by LJP (1997)) proposed a Weighted Least Squares estimator for F-1 (the marginal distribution function of T-1), using the Kaplan-Meier estimator of F-2 (the marginal distribution function of T2). The authors claimed that their estimator is more efficient than the MLE (maximum likelihood estimator) of F1 and that the Kaplan-Meier estimator is more efficient than the MLE of F2. However, we show that the MLE of F-1 was not computed correctly, and that the (claimed) MLE estimate of F-1 is even undefined in the case of active constraints. In our simulation study we used a primal-dual interior point algorithm to obtain the true MLE of F-1. The results showed a better performance of the MLE of F-1 over the weighted least squares estimator in LJP (1997) for points where F-1 is close to F-2. Moreover, application to the model, used in the simulation study of LJP (1997), showed smaller variances of the MLE estimators of the first and second moments for both F-1 and F-2, and sample sizes from 100 up to 5000, in comparison to the estimates, based on the weighted least squares estimator for F-1, proposed in LJP (1997), and the Kaplan-Meier estimator for F-2. R scripts are provided for computing the estimates either with the primal-dual interior point method or by the EM algorithm. In spite of the long history of the model in the biometrics literature (since about 1982), basic properties of the real maximum likelihood estimator (MLE) were still unknown. We give necessary and sufficient conditions for the MLE (Theorem 3.1), as an element of a cone, where the number of generators of the cone increases quadratically with sample size. From this and a self-consistency equation, turned into a Volterra integral equation, we derive the consistency of the MLE (Theorem 4.1). We conjecture that (under some natural conditions) one can extend the methods, used to prove consistency, to proving that the MLE is root n consistent for F-2 and cube root n convergent for F-1, but this has presently not yet been proved.
引用
收藏
页码:3195 / 3242
页数:48
相关论文
共 50 条