A comparison of model choice strategies for logistic regression

被引:0
|
作者
Karhunen, Markku [1 ]
机构
[1] Finnish Environm Inst Syke, Built Environm Solut Unit, Latokartanonkaari 11, Helsinki 00790, Finland
关键词
Model choice; Logistic regression; Logit regression; Monte Carlo simulations; Sensitivity; Specificity; SELECTION; GENE;
D O I
10.2478/jdis-2024-0001
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Purpose: The purpose of this study is to develop and compare model choice strategies in context of logistic regression. Model choice means the choice of the covariates to be included in the model. Design/methodology/approach: The study is based on Monte Carlo simulations. The methods are compared in terms of three measures of accuracy: specificity and two kinds of sensitivity. A loss function combining sensitivity and specificity is introduced and used for a final comparison. Findings: The choice of method depends on how much the users emphasize sensitivity against specificity. It also depends on the sample size. For a typical logistic regression setting with a moderate sample size and a small to moderate effect size, either BIC, BICc or Lasso seems to be optimal. Research limitations: Numerical simulations cannot cover the whole range of data-generating processes occurring with real-world data. Thus, more simulations are needed. Practical implications Researchers can refer to these results if they believe that their data-generating process is somewhat similar to some of the scenarios presented in this paper. Alternatively, they could run their own simulations and calculate the loss function. Originality/value: This is a systematic comparison of model choice algorithms and heuristics in context of logistic regression. The distinction between two types of sensitivity and a comparison based on a loss function are methodological novelties.
引用
收藏
页码:37 / 52
页数:16
相关论文
共 50 条
  • [21] A Comparison of Pretest, Stein-Type and Penalty Estimators in Logistic Regression Model
    Reangsephet, Orawan
    Lisawadi, Supranee
    Ahmed, Syed Ejaz
    [J]. PROCEEDINGS OF THE ELEVENTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, 2018, : 19 - 34
  • [22] The comparison of logistic regression model selection methods for the prediction of coronary artery disease
    Colak, Cemil
    Colak, M. Cengiz
    Orman, Mebmet N.
    [J]. ANATOLIAN JOURNAL OF CARDIOLOGY, 2007, 7 (01): : 6 - 11
  • [23] Improved ridge regression estimators for the logistic regression model
    Saleh, A. K. Md. E.
    Kibria, B. M. Golam
    [J]. COMPUTATIONAL STATISTICS, 2013, 28 (06) : 2519 - 2558
  • [24] Gaussian Process Regression Model in Spatial Logistic Regression
    Sofro, A.
    Oktaviarina, A.
    [J]. MATHEMATICS, INFORMATICS, SCIENCE AND EDUCATION INTERNATIONAL CONFERENCE (MISEIC), 2018, 947
  • [25] Quality of life domains associated with health perception in the elderly: comparison of the Logistic Regression model with that of Poisson Regression
    Magalhaes Junior, Ediney
    Silva, Jaqueline T.
    Espinosa, Mariano M.
    Rodrigues, Dayane C.
    [J]. SIGMAE, 2019, 8 (02): : 576 - 583
  • [26] Comparison of logistic regression and linear regression in modeling percentage data
    Zao, LH
    Chen, YH
    Schaffner, DW
    [J]. APPLIED AND ENVIRONMENTAL MICROBIOLOGY, 2001, 67 (05) : 2129 - 2135
  • [27] A Comparison of Classification/Regression Trees and Logistic Regression in Failure Models
    Irimia-Dieguez, A. I.
    Blanco-Oliver, A.
    Vazquez-Cueto, M. J.
    [J]. 2ND GLOBAL CONFERENCE ON BUSINESS, ECONOMICS, MANAGEMENT AND TOURISM, 2015, 23 : 9 - 14
  • [28] Comparison of multinomial logistic regression and logistic regression: which is more efficient in allocating land use?
    Lin, Yingzhi
    Deng, Xiangzheng
    Li, Xing
    Ma, Enjun
    [J]. FRONTIERS OF EARTH SCIENCE, 2014, 8 (04) : 512 - 523
  • [29] Comparison of multinomial logistic regression and logistic regression: which is more efficient in allocating land use?
    Yingzhi Lin
    Xiangzheng Deng
    Xing Li
    Enjun Ma
    [J]. Frontiers of Earth Science, 2014, 8 : 512 - 523
  • [30] Comparison of multinomial logistic regression and logistic regression:which is more efficient in allocating land use?
    Yingzhi LIN
    Xiangzheng DENG
    Xing LI
    Enjun MA
    [J]. Frontiers of Earth Science., 2014, 8 (04) - 523