Confidence intervals for multinomial logistic regression in sparse data

被引:42
|
作者
Bull, Shelley B.
Lewinger, Juan Pablo
Lee, Sophia S. F.
机构
[1] Mt Sinai Hosp, Samuel Lunenfeld Res Inst, Prosserman Ctr Hlth Res, Toronto, ON M5G 1X5, Canada
[2] Univ Toronto, Dept Publ Hlth Sci, Toronto, ON, Canada
[3] Univ Toronto, Dept Stat, Toronto, ON, Canada
基金
加拿大健康研究院;
关键词
asymptotic bias; Bayesian estimates; bias reduction; continuous covariate; data separation; infinite estimates; Jeffreys prior; odds ratio; polychotomous logistic regression; polytomous logistic regression; small samples;
D O I
10.1002/sim.2518
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Logistic regression is one of the most widely used regression models in practice, but alternatives to conventional maximum likelihood estimation methods may be more appropriate for small or sparse samples. Modification of the logistic regression score function to remove first-order bias is equivalent to penalizing the likelihood by the Jeffreys prior, and yields penalized maximum likelihood estimates (PLEs) that always exist, even in samples in which maximum likelihood estimates (MLEs) are infinite. PLEs are an attractive alternative in small-to-moderate-sized samples, and are preferred to exact conditional MLEs when there are continuous covariates. We present methods to construct confidence intervals (CI) in the penalized multinomial logistic regression model, and compare Cl coverage and length for the PLE-based methods to that of conventional MLE-based methods in trinomial logistic regressions with both binary and continuous covariates. Based on simulation studies in sparse data sets, we recommend profile CIs over asymptotic Wald-type intervals for the PLEs in all cases. Furthermore, when finite sample bias and data separation are likely to occur, we prefer PLE profile CIs over MLE methods. Copyright (c) 2006 John Wiley & Sons, Ltd.
引用
收藏
页码:903 / 918
页数:16
相关论文
共 50 条
  • [41] Multinomial Logistic Regression in Workers' Health
    Grilo, Luis M.
    Grilo, Helena L.
    Goncalves, Sonia P.
    Junca, Ana
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE OF COMPUTATIONAL METHODS IN SCIENCES AND ENGINEERING 2017 (ICCMSE-2017), 2017, 1906
  • [42] Pliable lasso for the multinomial logistic regression
    Asenso, Theophilus Quachie
    Zhang, Hai
    Liang, Yong
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2022, 51 (11) : 3596 - 3611
  • [43] An Application on Multinomial Logistic Regression Model
    El-Habil, Abdalla M.
    PAKISTAN JOURNAL OF STATISTICS AND OPERATION RESEARCH, 2012, 8 (02) : 271 - 291
  • [44] MULTINOMIAL LOGISTIC-REGRESSION ALGORITHM
    BOHNING, D
    ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1992, 44 (01) : 197 - 200
  • [45] Logistic Regression Multinomial for Arrhythmia Detection
    Behadada, Omar
    Trovati, Marcello
    Chikh, M. A.
    Bessis, Nik
    Korkontzelos, Yannis
    2016 IEEE 1ST INTERNATIONAL WORKSHOPS ON FOUNDATIONS AND APPLICATIONS OF SELF* SYSTEMS (FAS*W), 2016, : 133 - 137
  • [46] Assessing Accident Risk using Ordinal Regression and Multinomial Logistic Regression Data Generation
    Alicioglu, Gulsum
    Sun, Bo
    Ho, Shen Shyang
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [47] Multinomial logistic regression with missing outcome data: An application to cancer subtypes
    Wang, Ching-Yun
    Hsu, Li
    STATISTICS IN MEDICINE, 2020, 39 (24) : 3299 - 3312
  • [48] A Logistic Normal Multinomial Regression Model for Microbiome Compositional Data Analysis
    Xia, Fan
    Chen, Jun
    Fung, Wing Kam
    Li, Hongzhe
    BIOMETRICS, 2013, 69 (04) : 1053 - 1063
  • [49] Multinomial logistic regression-based feature selection for hyperspectral data
    Pal, Mahesh
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2012, 14 (01): : 214 - 220
  • [50] CONFIDENCE INTERVALS FOR PARAMETERS OF LOGISTIC DISTRIBUTION
    ANTLE, C
    KLIMKO, L
    HARKNESS, W
    BIOMETRIKA, 1970, 57 (02) : 397 - &