A pseudoscore estimator for regression problems with two-phase sampling

被引:100
|
作者
Chatterjee, N
Chen, YH
Breslow, NE
机构
[1] NCI, Div Canc Epidemiol & Genet, Rockville, MD 20852 USA
[2] Acad Sinica, Inst Stat Sci, Taipei 115, Taiwan
[3] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[4] Univ Washington, Dept Stat, Seattle, WA 98195 USA
关键词
measurement error; missing data; pseudolikelihood; response selective sampling; restricted sampling; semiparametric inference;
D O I
10.1198/016214503388619184
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Two-phase stratified sampling designs yield efficient estimates of population parameters in regression models while minimizing the costs of data collection. In measurement error problems, for example, error-free covariates are ascertained only for units selected in a validation sample. Estimators proposed heretofore for such designs require all units to have positive probability of being selected. We describe a new semiparametric estimator that relaxes this assumption and that is applicable to, for example, case-only or control-only validation sampling for binary regression problems. It uses a weighted empirical covariate distribution, with weights determined by the regression model, to estimate the score equations. Implementation is relatively easy for both discrete and continuous outcome data. For designs that are amenable to alternative methods, simulation studies show that the new estimator outperforms the currently available weighted and pseudolikelihood methods and often achieves efficiency comparable to that of semiparametric maximum likelihood. The simulations also demonstrate the vulnerability of the case-only or control-only designs to model misspecification. These results are illustrated by the analysis of data from a population-based case-control study of leprosy.
引用
收藏
页码:158 / 168
页数:11
相关论文
共 50 条
  • [41] BAYESIAN INFERENCE FOR NONRESPONSE TWO-PHASE SAMPLING
    Zhang, Yue
    Chen, Henian
    Zhang, Nanhua
    [J]. STATISTICA SINICA, 2018, 28 (04) : 2167 - 2187
  • [42] A two-phase sampling scheme and πps designs
    Laitila, Thomas
    Olofsson, Jens
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2011, 141 (05) : 1646 - 1654
  • [43] Two-phase sampling method of resident trip
    Key Laboratory of Transportation Engineering, Beijing University of Technology, Beijing 100022, China
    [J]. Beijing Gongye Daxue Xuebao J. Beijing Univ. Technol., 2007, 8 (828-833+852):
  • [44] An augmented two-phase method for spread sampling
    Lu, Jie
    Chen, Hongchang
    Zhang, Zhen
    Zhang, Jianpeng
    Zhang, Zheng
    [J]. ELECTRONICS LETTERS, 2022, 58 (24) : 905 - 907
  • [45] Stratified Two-Phase Ranked Set Sampling
    Arnab, Raghunath
    Anderson, George
    Olaomi, John O.
    Rodriguez, B. C.
    [J]. PAKISTAN JOURNAL OF STATISTICS AND OPERATION RESEARCH, 2019, 15 (04) : 867 - 879
  • [46] Variance Estimation under Two-Phase Sampling
    Saegusa, Takumi
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2015, 42 (04) : 1078 - 1091
  • [47] Estimation of Mode Using Two-phase Sampling
    Lamichhane, Rajan
    Singh, Sarjinder
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2016, 45 (07) : 2586 - 2597
  • [48] Variance estimation for two-phase stratified sampling
    Binder, DA
    Babyak, C
    Brodeur, M
    Hidiroglou, M
    Jocelyn, W
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2000, 28 (04): : 751 - 764
  • [49] Neutrosophic Estimators in Two-Phase Survey Sampling
    Yadav V.K.
    Prasad S.
    [J]. Neutrosophic Sets and Systems, 2023, 61 : 534 - 578
  • [50] Optimal linear estimation in two-phase sampling
    Merkouris, Takis
    [J]. SURVEY METHODOLOGY, 2022, 48 (02) : 439 - 461