Comparison of penalized logistic regression models for rare event case

被引:3
|
作者
Olmus, Hulya [1 ]
Nazman, Ezgi [1 ]
Erbas, Semra [2 ]
机构
[1] Gazi Univ, Stat, Ankara, Turkey
[2] Univ Kyrenia, Fac Arts & Sci, Karakum, Northern Cyprus, Turkey
关键词
Firth LR; FLIC; FLAC; Rare event; Predicted probability bias; BIAS; ESTIMATORS; REDUCTION; RATIO;
D O I
10.1080/03610918.2019.1676438
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The occurrence rate of the event of interest might be quite small (rare) in some cases, although sample size is large enough for Binary Logistic Regression (LR) model. In studies where the sample size is not large enough, the parameters to be estimated might be biased because of rare event case. Parameter estimations of LR model are usually obtained using Newton?Raphson (NR) algorithm for Maximum Likelihood Estimation (MLE). It is known that these estimations are usually biased in small samples but asymptotically unbiased. On the other hand, initial parameter values are sensitive for parameter estimation in NR for MLE. Our aim of the study is to present an approach on parameter estimation bias using inverse conditional distributions based on distribution assumption giving true parameter values and to compare this approach on different penalized LR methods. With this aim, LR, Firth LR, FLIC and FLAC methods were compared in terms of parameter estimation bias, predicted probability bias and Root Mean Squared Error (RMSE) for different sample sizes, event and correlation rates conducting a detailed Monte Carlo simulation study. Findings suggest that FLIC method should be preferred in rare event and small sample cases.
引用
收藏
页码:1578 / 1590
页数:13
相关论文
共 50 条
  • [31] Seemingly unrelated penalized regression models
    Ghasemi, Adel
    Najarzadeh, Dariush
    Khazaei, Mojtaba
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024,
  • [32] Comparison Of Statistical Tests In Logistic Regression: The Case Of Hypernatreamia
    Katsaragakis, Stylianos
    Koukouvinos, Christos
    Stylianou, Stella
    Theodoraki, Eleni-Maria
    JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2005, 4 (02) : 514 - 521
  • [33] Classification using partial least squares with penalized logistic regression
    Fort, G
    Lambert-Lacroix, S
    BIOINFORMATICS, 2005, 21 (07) : 1104 - 1111
  • [34] Penalized principal logistic regression for sparse sufficient dimension reduction
    Shin, Seung Jun
    Artemiou, Andreas
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2017, 111 : 48 - 58
  • [35] Fingerprint Matching Based on Neighboring Information and Penalized Logistic Regression
    Cao, Kai
    Yang, Xin
    Tian, Jie
    Zhang, Yangyang
    Li, Peng
    Tao, Xunqiang
    ADVANCES IN BIOMETRICS, 2009, 5558 : 617 - 626
  • [36] Simultaneous factors selection and fusion of their levels in penalized logistic regression
    Kaufmann, Lea
    Kateri, Maria
    ELECTRONIC JOURNAL OF STATISTICS, 2024, 18 (02): : 4235 - 4291
  • [37] Advanced colorectal neoplasia risk stratification by penalized logistic regression
    Lin, Yunzhi
    Yu, Menggang
    Wang, Sijian
    Chappell, Richard
    Imperiale, Thomas F.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (04) : 1677 - 1691
  • [38] Evaluating penalized logistic regression models to predict Heat-Related Electric grid stress days
    Bramer, L. M.
    Rounds, J.
    Burleyson, C. D.
    Fortin, D.
    Hathaway, J.
    Rice, J.
    Kraucunas, I.
    APPLIED ENERGY, 2017, 205 : 1408 - 1418
  • [39] Isolated-word recognition with penalized logistic regression machines
    Birkenes, Oystein
    Matsui, Tomoko
    Tanabe, Kunio
    2006 IEEE International Conference on Acoustics, Speech and Signal Processing, Vols 1-13, 2006, : 405 - 408
  • [40] Overlapping Haplotype Association Analysis via Penalized Logistic Regression
    Ayers, Kristin L.
    Cordell, Heather J.
    GENETIC EPIDEMIOLOGY, 2010, 34 (08) : 947 - 947