Combating outliers and multicollinearity in linear regression model using robust Kibria-Lukman mixed with principal component estimator, simulation and computation

被引:10
|
作者
Arum, K. C. [1 ]
Ugwuowo, F. I. [1 ]
Oranye, H. E. [1 ]
Alakija, T. O. [2 ]
Ugah, T. E. [1 ]
Asogwa, O. C. [3 ]
机构
[1] Univ Nigeria, Dept Stat, Nsukka, Nigeria
[2] Yaba Coll Technol, Dept Stat, Yaba Lagos, Nigeria
[3] Fed Univ, Dept Stat, Ikwo, Ebonyi, Nigeria
关键词
Ridge estimator; Kibria-Lukman estimator; M-estimator; Principal component; Multicollinearity; Outliers; RIDGE;
D O I
10.1016/j.sciaf.2023.e01566
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Scholars usually adopt the method of least squared to model the relationship between a response variable and two or more explanatory variables. Ordinary least squares estima-tor's performance is good when there is no outliers and multicollinearity in the regres-sion model dataset. Outliers and multicollinearity can occur together in a regression model dataset and least squares estimator suffers setback when both problems subsist. This study considers developing a new estimator to address both problems. We combined the princi-pal component estimator (PCE), M-estimator and Kibria-Lukman estimator (KLE) to derive new estimator called robust PC-KL. Robust PC-KL estimator inherits the characteristics of M-estimator, KLE, and PCE which makes it efficient in handling both problems individually and jointly. We examined the performance of the robust PC-KL estimator with other exist-ing estimators using mean squared error (MSE) as performance evaluation criteria through simulation design and real life application. Robust PC-KL estimator outperformed other es-timators compared with in this study based on theoretical comparison, simulation design and real life application by having the smallest MSE.(c) 2023 The Author(s). Published by Elsevier B.V. on behalf of African Institute of Mathematical Sciences / Next Einstein Initiative. This is an open access article under the CC BY license ( http://creativecommons.org/licenses/by/4.0/ )
引用
收藏
页数:17
相关论文
共 38 条
  • [31] Principal Component Regression and Linear Mixed Model in Association Analysis of Structured Samples: Competitors or Complements?
    Zhang, Yiwei
    Pan, Wei
    GENETIC EPIDEMIOLOGY, 2015, 39 (03) : 149 - 155
  • [32] Application of Linear Mixed-Effects Model, Principal Component Analysis, and Clustering to Direct Energy Deposition Fabricated Parts Using FEM Simulation Data
    Pazireh, Syamak
    Mirazimzadeh, Seyedeh Elnaz
    Urbanic, Jill
    MATERIALS, 2024, 17 (20)
  • [33] Robust Liu Estimator Used to Combat Some Challenges in Partially Linear Regression Model by Improving LTS Algorithm Using Semidefinite Programming
    Altukhaes, Waleed B.
    Roozbeh, Mahdi
    Mohamed, Nur A.
    MATHEMATICS, 2024, 12 (17)
  • [34] Development of a model to reproduce observed suspended sediment distributions in the southern North Sea using Principal Component Analysis and Multiple Linear Regression
    McManus, JP
    Prandle, D
    CONTINENTAL SHELF RESEARCH, 1997, 17 (07) : 761 - &
  • [35] Source apportionment of soil heavy metals using robust absolute principal component scores-robust geographically weighted regression (RAPCS-RGWR) receptor model
    Qu, Mingkai
    Wang, Yan
    Huang, Biao
    Zhao, Yongcun
    SCIENCE OF THE TOTAL ENVIRONMENT, 2018, 626 : 203 - 210
  • [36] Using a Local Framework Combining Principal Component Regression and Monte Carlo Simulation for Uncertainty and Sensitivity Analysis of a Domestic Energy Model in Sub-City Areas
    Urquizo, Javier
    Calderon, Carlos
    James, Philip
    ENERGIES, 2017, 10 (12)
  • [37] Source apportionment of water pollutants in Poyang Lake Basin in China using absolute principal component score-multiple linear regression model combined with land-use parameters
    Hou, Xikang
    Gao, Wei
    Zhang, Meng
    Xia, Rui
    Zhang, Yuan
    Zhang, Kai
    Wang, Xiao
    Yang, Kai
    Chen, Xiaofei
    Deng, Yu
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2022, 10
  • [38] Robust principal component analysis-multivariate adaptive regression splines (rPCA-MARS) model for determining total acid number (TAN) and total base number (TBN) of crude oil samples using attenuated total reflectance fourier transform infrared (ATR-FTIR) spectroscopy
    Mohammadi, Mahsa
    Khorrami, Mohammadreza Khanmohammadi
    Rezaei, Arezoo
    Vatanparast, Hamid
    Khorrami, Mohammad Mahdi Khanmohammadi
    VIBRATIONAL SPECTROSCOPY, 2023, 129