Comparison of penalized logistic regression models for rare event case

被引:3
|
作者
Olmus, Hulya [1 ]
Nazman, Ezgi [1 ]
Erbas, Semra [2 ]
机构
[1] Gazi Univ, Stat, Ankara, Turkey
[2] Univ Kyrenia, Fac Arts & Sci, Karakum, Northern Cyprus, Turkey
关键词
Firth LR; FLIC; FLAC; Rare event; Predicted probability bias; BIAS; ESTIMATORS; REDUCTION; RATIO;
D O I
10.1080/03610918.2019.1676438
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The occurrence rate of the event of interest might be quite small (rare) in some cases, although sample size is large enough for Binary Logistic Regression (LR) model. In studies where the sample size is not large enough, the parameters to be estimated might be biased because of rare event case. Parameter estimations of LR model are usually obtained using Newton?Raphson (NR) algorithm for Maximum Likelihood Estimation (MLE). It is known that these estimations are usually biased in small samples but asymptotically unbiased. On the other hand, initial parameter values are sensitive for parameter estimation in NR for MLE. Our aim of the study is to present an approach on parameter estimation bias using inverse conditional distributions based on distribution assumption giving true parameter values and to compare this approach on different penalized LR methods. With this aim, LR, Firth LR, FLIC and FLAC methods were compared in terms of parameter estimation bias, predicted probability bias and Root Mean Squared Error (RMSE) for different sample sizes, event and correlation rates conducting a detailed Monte Carlo simulation study. Findings suggest that FLIC method should be preferred in rare event and small sample cases.
引用
收藏
页码:1578 / 1590
页数:13
相关论文
共 50 条
  • [41] A Penalized Logistic Regression Approach to Detection Based Phone Classification
    Siniscalchi, Sabato Marco
    Svendsen, Torbjorn
    Lee, Chin-Hui
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 2390 - 2393
  • [42] Gene and pathway identification with Lp penalized Bayesian logistic regression
    Liu, Zhenqiu
    Gartenhaus, Ronald B.
    Tan, Ming
    Jiang, Feng
    Jiao, Xiaoli
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [43] Rare Event Classification with Weighted Logistic Regression for Identifying Repeating Fast Radio Bursts
    Herrera-Martin, Antonio
    Craiu, Radu V.
    Eadie, Gwendolyn M.
    Stenning, David C.
    Bingham, Derek
    Gaensler, B. M.
    Pleunis, Ziggy
    Scholz, Paul
    Mckinven, Ryan
    Kharel, Bikash
    Masui, Kiyoshi W.
    ASTROPHYSICAL JOURNAL, 2025, 982 (01):
  • [44] Ovarian Cancer Risk Factors in a Defined Population Using Rare Event Logistic Regression
    Haem, Elham
    Heydari, Seyyed Taghi
    Zare, Najaf
    Lankarani, Kamran B.
    Barooti, Esmat
    Sharif, Farkhondeh
    MIDDLE EAST JOURNAL OF CANCER, 2015, 6 (01) : 1 - 9
  • [45] Assessing the risk of windshear occurrence at HKIA using rare-event logistic regression
    Chen, Feng
    Peng, Haorong
    Chan, Pak-wai
    Ma, Xiaoxiang
    Zeng, Xiaoqing
    METEOROLOGICAL APPLICATIONS, 2020, 27 (06)
  • [46] Comparison of Deep Learning, Machine Learning, and Penalized Logistic Regression for Predicting Clinical Deterioration in Oncology Inpatients
    Lyons, P.
    Li, D.
    McEvoy, C.
    Westervelt, P.
    Gage, B.
    Lu, C.
    Kollef, M. H.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2020, 201
  • [47] On the Complexity of Logistic Regression Models
    Bulso, Nicola
    Marsili, Matteo
    Roudi, Yasser
    NEURAL COMPUTATION, 2019, 31 (08) : 1592 - 1623
  • [48] Penalized logistic regression for high-dimensional DNA methylation data with case-control studies
    Sun, Hokeun
    Wang, Shuang
    BIOINFORMATICS, 2012, 28 (10) : 1368 - 1375
  • [49] Endogeneity in logistic regression models
    Avery, G
    EMERGING INFECTIOUS DISEASES, 2005, 11 (03) : 503 - 504
  • [50] Multicollinearity in Logistic Regression Models
    Bayman, Emine Ozgur
    Dexter, Franklin
    ANESTHESIA AND ANALGESIA, 2021, 133 (02): : 362 - 365