Multiple Outlier Detection: Hypothesis Tests versus Model Selection by Information Criteria

被引:45
|
作者
Lehmann, Ruediger [1 ]
Loesler, Michael [2 ]
机构
[1] Univ Appl Sci Dresden, Fac Spatial Informat, Friedrich List Pl 1, D-01069 Dresden, Germany
[2] Frankfurt Univ Appl Sci, Fac Architecture Civil Engn & Geomat, Lab Ind Metrol, Nibelungenpl 1, D-60318 Frankfurt, Germany
关键词
Least-squares adjustment; Outlier detection; Hypothesis test; Information criterion; Akaike information criterion (AIC); Data snooping; Model selection; REGRESSION;
D O I
10.1061/(ASCE)SU.1943-5428.0000189
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
The detection of multiple outliers can be interpreted as a model selection problem. Models that can be selected are the null model, which indicates an outlier free set of observations, or a class of alternative models, which contain a set of additional bias parameters. A common way to select the right model is by using a statistical hypothesis test. In geodesy data snooping is most popular. Another approach arises from information theory. Here, the Akaike information criterion (AIC) is used to select an appropriate model for a given set of observations. The AIC is based on the Kullback-Leibler divergence, which describes the discrepancy between the model candidates. Both approaches are discussed and applied to test problems: the fitting of a straight line and a geodetic network. Some relationships between data snooping and information criteria are discussed. When compared, it turns out that the information criteria approach is more simple and elegant. Along with AIC there are many alternative information criteria for selecting different outliers, and it is not clear which one is optimal. (C) 2016 American Society of Civil Engineers.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] MODEL SELECTION CRITERIA AND GRANGER CAUSALITY TESTS - AN EMPIRICAL NOTE
    URBAIN, JP
    ECONOMICS LETTERS, 1989, 29 (04) : 317 - 320
  • [42] Multiple criteria linear programming model for portfolio selection
    Włodzimierz Ogryczak
    Annals of Operations Research, 2000, 97 : 143 - 162
  • [43] Exact posterior distributions and model selection criteria for multiple change-point detection problems
    Rigaill, G.
    Lebarbier, E.
    Robin, S.
    STATISTICS AND COMPUTING, 2012, 22 (04) : 917 - 929
  • [45] Exact posterior distributions and model selection criteria for multiple change-point detection problems
    G. Rigaill
    E. Lebarbier
    S. Robin
    Statistics and Computing, 2012, 22 : 917 - 929
  • [46] Model selection using information criteria, but is the "best" model any good?
    Mac Nally, Ralph
    Duncan, Richard P.
    Thomson, James R.
    Yen, Jian D. L.
    JOURNAL OF APPLIED ECOLOGY, 2018, 55 (03) : 1441 - 1444
  • [47] Extended Bayesian information criteria for model selection with large model spaces
    Chen, Jiahua
    Chen, Zehua
    BIOMETRIKA, 2008, 95 (03) : 759 - 771
  • [48] Nonlinear predictive model selection and model averaging using information criteria
    Gu, Yuanlin
    Wei, Hua-Liang
    Balikhin, Michael M.
    SYSTEMS SCIENCE & CONTROL ENGINEERING, 2018, 6 (01) : 319 - 328
  • [49] Outlier Detection in Multiple Circular Regression Model using DFFITc Statistic
    Alkasadi, Najla Ahmed
    Ibrahim, Safwati
    Abuzaid, Au H. M.
    Yusoff, Mohd Irwan
    Hamid, Hashibah
    Zhe, Leow Wai
    Abd Razak, Amelia B. T.
    SAINS MALAYSIANA, 2019, 48 (07): : 1557 - 1563
  • [50] A Comparative analysis of multiple outlier detection procedures in the linear regression model
    Wisnowski, JW
    Montgomery, DC
    Simpson, JR
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2001, 36 (03) : 351 - 382