Problems due to small samples and sparse data in conditional logistic regression analysis

被引:0
|
作者
Greenland, S [1 ]
Schwartzbaum, JA
Finkle, WD
机构
[1] Univ Calif Los Angeles, Sch Publ Hlth, Dept Epidemiol, Los Angeles, CA 90024 USA
[2] Ohio State Univ, Sch Publ Hlth, Div Epidemiol & Biometr, Columbus, OH 43210 USA
[3] Consolidated Res Inc, Los Angeles, CA USA
关键词
bias (epidemiology); case-control studies; epidemiologic methods; logistic models; matched-pair analysis; odds ratio; regression analysis; risk assessment;
D O I
暂无
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Conditional logistic regression was developed to avoid "sparse-data" biases that can arise in ordinary logistic regression analysis. Nonetheless, it is a large-sample method that can exhibit considerable bias when certain types of matched sets are infrequent or when the model contains too many parameters. Sparse-data bias can cause misleading inferences about confounding, effect modification, dose response, and induction periods, and can interact with other biases. In this paper, the authors describe these problems in the context of matched case-control analysis and provide examples from a study of electrical wiring and childhood leukemia and a study of diet and glioma, The same problems can arise in any likelihood-based analysis, including ordinary logistic regression. The problems can be detected by careful inspection of data and by examining the sensitivity of estimates to category boundaries, variables in the model, and transformations of those variables. One can also apply various bias corrections or turn to methods less sensitive to sparse data than conditional likelihood, such as Bayesian and empirical-Bayes (hierarchical regression) methods.
引用
收藏
页码:531 / 539
页数:9
相关论文
共 50 条
  • [1] Re: "Problems due to small samples and sparse data in conditional logistic regression analysis"
    Neuenschwander, BE
    Zwahlen, M
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2000, 152 (07) : 688 - 688
  • [2] Small-sample and sparse-data problems in conditional logistic regression analysis.
    Greenland, S
    Schwartzbaum, JA
    Finkle, WD
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 1999, 149 (11) : S16 - S16
  • [3] Bias-reduced and separation-proof conditional logistic regression with small or sparse data sets
    Heinze, Georg
    Puhr, Rainer
    [J]. STATISTICS IN MEDICINE, 2010, 29 (7-8) : 770 - 777
  • [4] Conditional Logistic Regression With Survey Data
    Graubard, Barry I.
    Korn, Edward L.
    [J]. STATISTICS IN BIOPHARMACEUTICAL RESEARCH, 2011, 3 (02): : 398 - 408
  • [5] Sparse data and use of logistic regression
    Siddarth, Prabha
    [J]. EPILEPSIA, 2018, 59 (05) : 1085 - 1086
  • [6] Sparse logistic regression on functional data
    Xu, Yunnan
    Du, Pang
    Robertson, John
    Senger, Ryan
    [J]. STATISTICS AND ITS INTERFACE, 2022, 15 (02) : 171 - 179
  • [7] CONDITIONAL LOGISTIC-REGRESSION WITH MISSING DATA
    GIBBONS, LE
    HOSMER, DW
    [J]. COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 1991, 20 (01) : 109 - 120
  • [8] Inferential tools in penalized logistic regression for small and sparse data: A comparative study
    Siino, Marianna
    Fasola, Salvatore
    Muggeo, Vito M. R.
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2018, 27 (05) : 1365 - 1375
  • [9] Logistic Regression Under Sparse Data Conditions
    Walker, David A.
    Smith, Thomas J.
    [J]. JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2019, 18 (02)
  • [10] Analysis of sparse data in logistic regression in medical research: A newer approach
    Devika, S.
    Jeyaseelan, L.
    Sebastian, G.
    [J]. JOURNAL OF POSTGRADUATE MEDICINE, 2016, 62 (01) : 26 - 31