Identification of multiple high leverage points in logistic regression

被引:11
|
作者
Imon, A. H. M. Rahmatullah [1 ]
Hadi, Ali S. [2 ]
机构
[1] Ball State Univ, Dept Math Sci, Muncie, IN 47306 USA
[2] Amer Univ Cairo, Dept Math, Cairo, Egypt
关键词
logistic regression; covariates; high leverage points; masking; swamping; group deletion; robust regression; deletion median distance from the median; Monte Carlo simulation; LINEAR-REGRESSION; INFLUENTIAL OBSERVATIONS; DIAGNOSTICS;
D O I
10.1080/02664763.2013.822057
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Leverage values are being used in regression diagnostics as measures of unusual observations in the X-space. Detection of high leverage observations or points is crucial due to their responsibility for masking outliers. In linear regression, high leverage points (HLP) are those that stand far apart from the center (mean) of the data and hence the most extreme points in the covariate space get the highest leverage. But Hosemer and Lemeshow [Applied logistic regression, Wiley, New York, 1980] pointed out that in logistic regression, the leverage measure contains a component which can make the leverage values of genuine HLP misleadingly very small and that creates problem in the correct identification of the cases. Attempts have been made to identify the HLP based on the median distances from the mean, but since they are designed for the identification of a single high leverage point they may not be very effective in the presence of multiple HLP due to their masking (false-negative) and swamping (false-positive) effects. In this paper we propose a new method for the identification of multiple HLP in logistic regression where the suspect cases are identified by a robust group deletion technique and they are confirmed using diagnostic techniques. The usefulness of the proposed method is then investigated through several well-known examples and a Monte Carlo simulation.
引用
收藏
页码:2601 / 2616
页数:16
相关论文
共 50 条
  • [1] Identification of High Leverage Points in Binary Logistic Regression
    Fitrianto, Anwar
    Wendy, Tham
    [J]. 4TH INTERNATIONAL CONFERENCE ON QUANTITATIVE SCIENCES AND ITS APPLICATIONS (ICOQSIA 2016), 2016, 1782
  • [2] Identification and classification of multiple outliers, high leverage points and influential observations in linear regression
    Nurunnabi, A. A. M.
    Nasser, M.
    Imon, A. H. M. R.
    [J]. JOURNAL OF APPLIED STATISTICS, 2016, 43 (03) : 509 - 525
  • [3] The Effect of High Leverage Points on the Logistic Ridge Regression Estimator Having Multicollinearity
    Ariffin, Syaiba Balqish
    Midi, Habshah
    [J]. PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON MATHEMATICAL SCIENCES, 2014, 1602 : 1105 - 1111
  • [4] The Effect of High Leverage Points on the Maximum Estimated Likelihood for Separation in Logistic Regression
    Ariffin, Syaiba Balqish
    Midi, Habshah
    Arasan, Jayanthi
    Rana, Md Sohel
    [J]. 2ND ISM INTERNATIONAL STATISTICAL CONFERENCE 2014 (ISM-II): EMPOWERING THE APPLICATIONS OF STATISTICAL AND MATHEMATICAL SCIENCES, 2015, 1643 : 402 - 408
  • [5] Fast improvised diagnostic robust measure for the identification of high leverage points in multiple linear regression
    Midi, Habshah
    Ismaeel, Shelan Saied
    [J]. JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2018, 21 (06): : 1003 - 1019
  • [6] The performance of diagnostic-robust generalized potentials for the identification of multiple high leverage points in linear regression
    Habshah, M.
    Norazan, M. R.
    Imon, A. H. M. Rahmatullah
    [J]. JOURNAL OF APPLIED STATISTICS, 2009, 36 (05) : 507 - 520
  • [7] A Remedial Measure of Multicollinearity in Multiple Linear Regression in the Presence of High Leverage Points
    Ismaeel, Shelan Saied
    Midi, Habshah
    Omar, Kurdistan M. Taher
    [J]. SAINS MALAYSIANA, 2024, 53 (04): : 907 - 920
  • [8] Identification of multiple outliers in logistic regression
    Imon, A. H. M. Rahmatullah
    Hadi, Ali S.
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2008, 37 (11) : 1697 - 1709
  • [9] Robust Bootstrap Procedure for Estimation of Binary Logistic Regression Model in the Presence of High Leverage Points with Medical Applications
    Habshah, M.
    Ariffin, S. B.
    Imon, A. H. M. R.
    [J]. INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS & STATISTICS, 2016, 54 (01): : 10 - 32
  • [10] ROBUST JACKKNIFE RIDGE REGRESSION TO COMBAT MULTICOLLINEARITY AND HIGH LEVERAGE POINTS IN MULTIPLE LINEAR REGRESSIONS
    Alguraibawi, Mohammed
    Midi, Habshah
    Rana, Sohel
    [J]. ECONOMIC COMPUTATION AND ECONOMIC CYBERNETICS STUDIES AND RESEARCH, 2015, 49 (04): : 305 - 322