On the relationship between multicollinearity and separation in logistic regression

被引:13
|
作者
Zeng, Guoping [1 ]
Zeng, Emily [2 ]
机构
[1] 4522 Oak Shores Dr, Plano, TX 75024 USA
[2] Univ Calif Berkeley, Coll Letters & Sci, Berkeley, CA 94720 USA
关键词
Multicollinearity; Separation; Complete separation; Quasi-complete separation; Logistic regression; Maximum likelihood estimate; EXISTENCE;
D O I
10.1080/03610918.2019.1589511
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multicollinearity and separation are two major issues in logistic regression. In this paper, for the first time we study the relationship between multicollinearity and separation. We analytically prove that multicollinearity implies quasi-complete separation. Through counter examples, we show that multicollinearity does not always imply complete separation and that separation does not always imply multicollinearity. We also present the consequences of multicollinearity and separation. We analytically prove that multicollinearity means no finite solution to maximum likelihood estimate and that separation means no finite solution to maximum likelihood estimate.
引用
收藏
页码:1989 / 1997
页数:9
相关论文
共 50 条