Confidence intervals for multinomial logistic regression in sparse data

被引:42
|
作者
Bull, Shelley B.
Lewinger, Juan Pablo
Lee, Sophia S. F.
机构
[1] Mt Sinai Hosp, Samuel Lunenfeld Res Inst, Prosserman Ctr Hlth Res, Toronto, ON M5G 1X5, Canada
[2] Univ Toronto, Dept Publ Hlth Sci, Toronto, ON, Canada
[3] Univ Toronto, Dept Stat, Toronto, ON, Canada
基金
加拿大健康研究院;
关键词
asymptotic bias; Bayesian estimates; bias reduction; continuous covariate; data separation; infinite estimates; Jeffreys prior; odds ratio; polychotomous logistic regression; polytomous logistic regression; small samples;
D O I
10.1002/sim.2518
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Logistic regression is one of the most widely used regression models in practice, but alternatives to conventional maximum likelihood estimation methods may be more appropriate for small or sparse samples. Modification of the logistic regression score function to remove first-order bias is equivalent to penalizing the likelihood by the Jeffreys prior, and yields penalized maximum likelihood estimates (PLEs) that always exist, even in samples in which maximum likelihood estimates (MLEs) are infinite. PLEs are an attractive alternative in small-to-moderate-sized samples, and are preferred to exact conditional MLEs when there are continuous covariates. We present methods to construct confidence intervals (CI) in the penalized multinomial logistic regression model, and compare Cl coverage and length for the PLE-based methods to that of conventional MLE-based methods in trinomial logistic regressions with both binary and continuous covariates. Based on simulation studies in sparse data sets, we recommend profile CIs over asymptotic Wald-type intervals for the PLEs in all cases. Furthermore, when finite sample bias and data separation are likely to occur, we prefer PLE profile CIs over MLE methods. Copyright (c) 2006 John Wiley & Sons, Ltd.
引用
收藏
页码:903 / 918
页数:16
相关论文
共 50 条
  • [21] ON SIMULTANEOUS CONFIDENCE INTERVALS FOR MULTINOMIAL PROPORTIONS
    GOODMAN, LA
    TECHNOMETRICS, 1965, 7 (02) : 247 - &
  • [22] Confidence Intervals on Regression Models with Censored Data
    Orbe, Jesus
    Nunez-Anton, Vicente
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2013, 42 (09) : 2140 - 2159
  • [23] Multinomial Logistic Regression Ensembles
    Lee, Kyewon
    Ahn, Hongshik
    Moon, Hojin
    Kodell, Ralph L.
    Chen, James J.
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2013, 23 (03) : 681 - 694
  • [24] Multinomial and ordinal logistic regression
    Sainani, Kristin L.
    PM&R, 2021, 13 (09) : 1050 - 1055
  • [25] Semisupervised Hyperspectral Image Classification Using Soft Sparse Multinomial Logistic Regression
    Li, Jun
    Bioucas-Dias, Jose M.
    Plaza, Antonio
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2013, 10 (02) : 318 - 322
  • [26] Usual and shortest confidence intervals on odds ratios from logistic regression
    Wilson, PD
    Langenberg, P
    AMERICAN STATISTICIAN, 1999, 53 (04): : 332 - 335
  • [27] Semiparametric Multinomial Logistic Regression for Multivariate Point Pattern Data
    Hessellund, Kristian Bjorn
    Xu, Ganggang
    Guan, Yongtao
    Waagepetersen, Rasmus
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1500 - 1515
  • [28] Mixtures of logistic normal multinomial regression models for microbiome data
    Dai, Wenshu
    Fang, Yuan
    Subedi, Sanjeena
    JOURNAL OF APPLIED STATISTICS, 2025, 52 (03) : 624 - 655
  • [29] Logistic Regression Under Sparse Data Conditions
    Walker, David A.
    Smith, Thomas J.
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2019, 18 (02)
  • [30] Communication-efficient distributed large-scale sparse multinomial logistic regression
    Lei, Dajiang
    Huang, Jie
    Chen, Hao
    Li, Jie
    Wu, Yu
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2023, 35 (18):