A practical method to account for outliers in simple linear regression using the median of slopes

被引:0
|
作者
Tedeschi, Luis O. [1 ]
Galyean, Michael L. [2 ]
机构
[1] Texas A&M Univ, Dept Anim Sci, College Stn, TX 77843 USA
[2] Texas Tech Univ, Dept Vet Sci, Lubbock, TX 79409 USA
来源
SCIENTIA AGRICOLA | 2024年 / 81卷
关键词
estimation; methods; relationship; robust; statistics; EFFICIENCY; ENERGY; GAIN;
D O I
10.1590/1678-992X-2022-0209
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
The ordinary least squares (OLS) can be affected by errors associated with heteroscedasticity and outliers, and extreme points can influence the regression parameters. Methods based on the median rather than on the mean and variance are more resistant to outliers and extreme points. These methods could be used to obtain regression parameter estimates that reflect more accurately the genuine relationship between the Y and X variables, leading to better identification of outliers and extreme points by comparing the slopes and intercepts of both methods. The Theil-Sen (TS) regression computes all possible pairwise slopes and determines the median of slopes as the regression slope. Here, we illustrated the potential use of TS and frequently used robust regression (RR) techniques to single linear regression using synthetic datasets and a practical problem in animal science. Three synthetic datasets were created assuming the normal distribution of Y and X values: one was free of outliers, while the other two had one or two clusters of outliers but the same X values. The TS, OLS, and RR had nearly identical regression parameter estimates for the dataset without synthetic outliers. However, the intercept and slope estimates by the OLS method differed considerably from the TS and RR methods when one or two clusters of outliers were included. The TS approach could be used to indirectly determine the presence of outliers or extreme points by comparing the 95 % confidence interval of the TS and OLS parameter estimates.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Proposal of a bootstrap procedure using measures of influence in non-linear regression models with outliers
    de Andrade, Larissa Ribeiro
    Cirillo, Marcelo Angelo
    Beijo, Luiz Alberto
    ACTA SCIENTIARUM-TECHNOLOGY, 2014, 36 (01) : 93 - 99
  • [32] Some Exponential Estimators in Sample Survey using Robust Regression Method in the Presence of Outliers
    Kumar Yadav, Vinay
    Prasad, Shakti
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2024, 45 (04) : 1674 - 1690
  • [33] Identifying Multiple Outliers in Linear Functional Relationship Model Using a Robust Method
    Ghapor, Adilah Abdul
    Zubairi, Yong Zulina
    Al Mamun, Sayed Md.
    Hassan, Siti Fatimah
    Aruchunan, Elayaraja
    Mokhtar, Nurkhairany Amyra
    SAINS MALAYSIANA, 2023, 52 (05): : 1595 - 1606
  • [34] USING RECURSIVE RESIDUALS, CALCULATED ON ADAPTIVELY-ORDERED OBSERVATIONS, TO IDENTIFY OUTLIERS IN LINEAR-REGRESSION
    KIANIFARD, F
    SWALLOW, WH
    BIOMETRICS, 1989, 45 (02) : 571 - 585
  • [35] Fuzzy linear regression using rank transform method
    Jung, Hye-Young
    Yoon, Jin Hee
    Choi, Seung Hoe
    FUZZY SETS AND SYSTEMS, 2015, 274 : 97 - 108
  • [36] Robust linear regression using Theil's method
    Glaister, P
    JOURNAL OF CHEMICAL EDUCATION, 2005, 82 (10) : 1472 - 1473
  • [37] Characters recognition method based on vector field and simple linear regression model
    Izumi, T
    Hattori, T
    Kitajima, H
    Yamasaki, T
    IEEE INTERNATIONAL SYMPOSIUM ON COMMUNICATIONS AND INFORMATION TECHNOLOGIES 2004 (ISCIT 2004), PROCEEDINGS, VOLS 1 AND 2: SMART INFO-MEDIA SYSTEMS, 2004, : 498 - 503
  • [38] A Simple and Convenient Method of Multiple Linear Regression To Calculate Iodine Molecular Constants
    Cooper, Paul D.
    JOURNAL OF CHEMICAL EDUCATION, 2010, 87 (07) : 687 - 690
  • [39] Improving the reliability of RPM measurement of UAV propeller using outliers removal technique A practical method to detect and substitute the outliers in the RPM measurement
    Muliadi, J.
    Setiyoko, A.
    Kusumoputro, B.
    2018 INTERNATIONAL CONFERENCE ON CONTROL, ELECTRONICS, RENEWABLE ENERGY AND COMMUNICATIONS (ICCEREC), 2018, : 110 - 114
  • [40] The method of purging applied to repeated cross-sectional data - Practical applications using logistic and linear regression analysis
    Te Grotenhuis, M
    Eisinga, R
    Scheepers, P
    QUALITY & QUANTITY, 2004, 38 (01) : 1 - 16