Multiple Outlier Detection: Hypothesis Tests versus Model Selection by Information Criteria

被引:45
|
作者
Lehmann, Ruediger [1 ]
Loesler, Michael [2 ]
机构
[1] Univ Appl Sci Dresden, Fac Spatial Informat, Friedrich List Pl 1, D-01069 Dresden, Germany
[2] Frankfurt Univ Appl Sci, Fac Architecture Civil Engn & Geomat, Lab Ind Metrol, Nibelungenpl 1, D-60318 Frankfurt, Germany
关键词
Least-squares adjustment; Outlier detection; Hypothesis test; Information criterion; Akaike information criterion (AIC); Data snooping; Model selection; REGRESSION;
D O I
10.1061/(ASCE)SU.1943-5428.0000189
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The detection of multiple outliers can be interpreted as a model selection problem. Models that can be selected are the null model, which indicates an outlier free set of observations, or a class of alternative models, which contain a set of additional bias parameters. A common way to select the right model is by using a statistical hypothesis test. In geodesy data snooping is most popular. Another approach arises from information theory. Here, the Akaike information criterion (AIC) is used to select an appropriate model for a given set of observations. The AIC is based on the Kullback-Leibler divergence, which describes the discrepancy between the model candidates. Both approaches are discussed and applied to test problems: the fitting of a straight line and a geodetic network. Some relationships between data snooping and information criteria are discussed. When compared, it turns out that the information criteria approach is more simple and elegant. Along with AIC there are many alternative information criteria for selecting different outliers, and it is not clear which one is optimal. (C) 2016 American Society of Civil Engineers.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] On measures of information and divergence and model selection criteria
    Karagrigoriou, Alex
    Papaioannou, Takis
    STATISTICAL MODELS AND METHODS FOR BIOMEDICAL AND TECHNICAL SYSTEMS, 2008, : 503 - +
  • [22] An assessment of information criteria for motion model selection
    Torr, PHS
    1997 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, PROCEEDINGS, 1997, : 47 - 52
  • [23] Model selection rates of information based criteria
    Chaurasia, Ashok
    Harel, Ofer
    ELECTRONIC JOURNAL OF STATISTICS, 2013, 7 : 2762 - 2793
  • [24] A comparative study of information criteria for model selection
    Nakamura, Tomomichi
    Judd, Kevin
    Mees, Alistair I.
    Small, Michael
    INTERNATIONAL JOURNAL OF BIFURCATION AND CHAOS, 2006, 16 (08): : 2153 - 2175
  • [25] Information Criteria in Model Selection for Mixing Processes
    Masayuki Uchida
    Nakahiro Yoshida
    Statistical Inference for Stochastic Processes, 2001, 4 (1) : 73 - 98
  • [26] A multi-source information fusion model for outlier detection
    Zhang, Pengfei
    Li, Tianrui
    Wang, Guoqiang
    Wang, Dexian
    Lai, Pei
    Zhang, Fan
    INFORMATION FUSION, 2023, 93 : 192 - 208
  • [27] Consistency of detection of the number of signals using multiple hypothesis tests
    Chung, Pei-Jung
    2007 IEEE International Conference on Acoustics, Speech, and Signal Processing, Vol II, Pts 1-3, 2007, : 1125 - 1128
  • [28] Nonexistence of Rigorous Tests for Multiple Outlier Detection in Least-Squares Adjustment
    Baselga, Sergio
    JOURNAL OF SURVEYING ENGINEERING, 2011, 137 (03) : 109 - 112
  • [29] Unsupervised Feature Selection for Outlier Detection in Categorical Data using Mutual Information
    Suri, N. N. R. Ranga
    Murty, M. Narasimha
    Athithan, G.
    2012 12TH INTERNATIONAL CONFERENCE ON HYBRID INTELLIGENT SYSTEMS (HIS), 2012, : 253 - 258
  • [30] Enhancing Detection Model for Multiple Hypothesis Tracking
    Chen, Jiahui
    Sheng, Hao
    Zhang, Yang
    Xiong, Zhang
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 2143 - 2152