Procedures for the identification of multiple influential observations in linear regression

被引:20
|
作者
Nurunnabi, A. A. M. [1 ]
Hadi, Ali S. [2 ]
Imon, A. H. M. R. [3 ]
机构
[1] Rajshahi Univ, Dept Stat, SLG, Rajshahi 6205, Bangladesh
[2] Amer Univ Cairo, Dept Math & Actuarial Sci, New Cairo, Egypt
[3] Ball State Univ, Dept Math Sci, Muncie, IN 47306 USA
关键词
leverage values; outliers; regression diagnostics; Cook's distance; swamping; masking; BACON; least trimmed squares; least median of squares; OUTLIER; UNMASKING;
D O I
10.1080/02664763.2013.868418
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Since the seminal paper by Cook (1977) in which he introduced Cook's distance, the identification of influential observations has received a great deal of interest and extensive investigation in linear regression. It is well documented that most of the popular diagnostic measures that are based on single-case deletion can mislead the analysis in the presence of multiple influential observations because of the well-known masking and/or swamping phenomena. Atkinson (1981) proposed a modification of Cook's distance. In this paper we propose a further modification of the Cook's distance for the identification of a single influential observation. We then propose new measures for the identification of multiple influential observations, which are not affected by the masking and swamping problems. The efficiency of the new statistics is presented through several well-known data sets and a simulation study.
引用
收藏
页码:1315 / 1331
页数:17
相关论文
共 50 条
  • [1] Measures and procedures for the identification of locally influential observations in linear regression
    Rancel, MMS
    Sierra, MAG
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1999, 28 (02) : 343 - 366
  • [2] Fast Improvised Influential Distance for the Identification of Influential Observations in Multiple Linear Regression
    Midi, Habshah
    Sani, Muhammad
    Ismaeel, Shelan Saied
    Arasan, Jayanthi
    [J]. SAINS MALAYSIANA, 2021, 50 (07): : 2085 - 2094
  • [3] Identifying multiple influential observations in linear regression
    Imon, AHMR
    [J]. JOURNAL OF APPLIED STATISTICS, 2005, 32 (09) : 929 - 946
  • [4] Identification of multiple influential observations in logistic regression
    Nurunnabi, A. A. M.
    Imon, A. H. M. Rahmatullah
    Nasser, M.
    [J]. JOURNAL OF APPLIED STATISTICS, 2010, 37 (10) : 1605 - 1624
  • [5] Identification and classification of multiple outliers, high leverage points and influential observations in linear regression
    Nurunnabi, A. A. M.
    Nasser, M.
    Imon, A. H. M. R.
    [J]. JOURNAL OF APPLIED STATISTICS, 2016, 43 (03) : 509 - 525
  • [6] Detecting Multiple Influential Observations in High Dimensional Linear Regression
    Zhao, Junlong
    Zhang, Ying
    Niu, Lu
    [J]. ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2015, PT III, 2015, 9227 : 55 - 64
  • [7] An Efficient Method of Identification of Influential Observations in Multiple Linear Regression and Its Application to Real Data
    Midi, Habshah
    Hendi, Hasan Talib
    Uraibi, Hassan
    Arasan, Jayanthi
    Ismaeel, Shelan Saied
    [J]. SAINS MALAYSIANA, 2023, 52 (12): : 3589 - 3602
  • [8] INFLUENTIAL OBSERVATIONS IN LINEAR-REGRESSION
    COOK, RD
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (365) : 169 - 174
  • [9] Identifying Influential Observations in Multiple Regression
    Camilleri, Carmel
    Alter, Udi
    Cribbie, Robert A.
    [J]. QUANTITATIVE METHODS FOR PSYCHOLOGY, 2024, 20 (02): : 96 - 105
  • [10] A Diagnostic Measure for Influential Observations in Linear Regression
    Nurunnabi, A. A. M.
    Imon, A. H. M. Rahmatullah
    Nasser, M.
    [J]. COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2011, 40 (07) : 1169 - 1183