Multivariate Spatial Outlier Detection Using Robust Geographically Weighted Methods

被引:42
|
作者
Harris, Paul [1 ]
Brunsdon, Chris [2 ]
Charlton, Martin [1 ]
Juggins, Steve [3 ]
Clarke, Annemarie [4 ]
机构
[1] Natl Univ Ireland Maynooth, Natl Ctr Geocomputat, Maynooth, Kildare, Ireland
[2] Univ Liverpool, Liverpool L69 3BX, Merseyside, England
[3] Newcastle Univ, Sch Geog Polit & Sociol, Newcastle Upon Tyne NE1 7RU, Tyne & Wear, England
[4] APEM Ltd, Llantrisant, Wales
基金
爱尔兰科学基金会;
关键词
Non-stationarity; Mahalanobis distance; Principal components analysis; Co-kriging cross-validation; Freshwater acidification; Anomaly detection; DATA TRANSFORMATION; REGRESSION; VARIOGRAM; VISUALIZATION; GEOSTATISTICS; ASSOCIATION; MODELS; LOADS;
D O I
10.1007/s11004-013-9491-0
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
Outlier detection is often a key task in a statistical analysis and helps guard against poor decision-making based on results that have been influenced by anomalous observations. For multivariate data sets, large Mahalanobis distances in raw data space or large Mahalanobis distances in principal components analysis, transformed data space, are routinely used to detect outliers. Detection in principal components analysis space can also utilise goodness of fit distances. For spatial applications, however, these global forms can only detect outliers in a non-spatial manner. This can result in false positive detections, such as when an observation's spatial neighbours are similar, or false negative detections such as when its spatial neighbours are dissimilar. To avoid mis-classifications, we demonstrate that a local adaptation of various global methods can be used to detect multivariate spatial outliers. In particular, we account for local spatial effects via the use of geographically weighted data with either Mahalanobis distances or principal components analysis. Detection performance is assessed using simulated data as well as freshwater chemistry data collected over all of Great Britain. Results clearly show value in both geographically weighted methods to outlier detection.
引用
收藏
页码:1 / 31
页数:31
相关论文
共 50 条
  • [41] Robust outlier detection using the instability factor
    Ha, Jihyun
    Seok, Seulgi
    Lee, Jong-Seok
    KNOWLEDGE-BASED SYSTEMS, 2014, 63 : 15 - 23
  • [42] OUTLIER DETECTION USING ROBUST MEASURES OF SCALE
    IGLEWICZ, B
    MARTINEZ, J
    BIOMETRICS, 1981, 37 (03) : 614 - 614
  • [43] Finding hyperspectral anomalies using multivariate outlier detection
    Smetek, Timothy E.
    Bauer, Kenneth W.
    2007 IEEE AEROSPACE CONFERENCE, VOLS 1-9, 2007, : 2095 - 2118
  • [44] Production Multivariate Outlier Detection Using Principal Components
    O'Neill, Peter M.
    2008 IEEE INTERNATIONAL TEST CONFERENCE, VOLS 1 AND 2, PROCEEDINGS, 2008, : 11 - 20
  • [45] Modelling urban spatial structure using Geographically Weighted Regression
    Noresah, M. S.
    Ruslan, R.
    18TH WORLD IMACS CONGRESS AND MODSIM09 INTERNATIONAL CONGRESS ON MODELLING AND SIMULATION: INTERFACING MODELLING AND SIMULATION WITH MATHEMATICAL AND COMPUTATIONAL SCIENCES, 2009, : 1950 - 1956
  • [46] Robust Multivariate Control Chart for Outlier Detection Using Hierarchical Cluster Tree in SW2
    Fan, Shu-Kai S.
    Huang, Hui-Kuo
    Chang, Yuan-Jung
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2013, 29 (07) : 971 - 985
  • [47] Spatial Analysis of the 2018 Logistics Performance Index Using Multivariate Kernel Function to Improve Geographically Weighted Regression Models
    Xiao, Ivan Runhua
    Jaller, Miguel
    Phong, David
    Zhu, Haihao
    TRANSPORTATION RESEARCH RECORD, 2022, 2676 (02) : 44 - 58
  • [48] Multivariate outlier detection in Stata
    Verardi, Vincenzo
    Dehon, Catherine
    STATA JOURNAL, 2010, 10 (02): : 259 - 266
  • [49] OUTLIER DETECTION IN MULTIVARIATE CALIBRATION
    WALCZAK, B
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1995, 28 (02) : 259 - 272
  • [50] Robust Fitting of a Wrapped Normal Model to Multivariate Circular Data and Outlier Detection
    Greco, Luca
    Saraceno, Giovanni
    Agostinelli, Claudio
    STATS, 2021, 4 (02): : 454 - 471