Statistical Analysis of Nearest Neighbor Methods for Anomaly Detection

被引:0
|
作者
Gu, Xiaoyi [1 ]
Akoglu, Leman [2 ]
Rinaldo, Alessandro [1 ]
机构
[1] Carnegie Mellon Univ, Dept Stat & Data Sci, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Heinz Coll Informat Syst & Publ Policy, Pittsburgh, PA 15213 USA
关键词
SUPPORT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nearest-neighbor (NN) procedures are well studied and widely used in both supervised and unsupervised learning problems. In this paper we are concerned with investigating the performance of NN-based methods for anomaly detection. We first show through extensive simulations that NN methods compare favorably to some of the other state-of-the-art algorithms for anomaly detection based on a set of benchmark synthetic datasets. We further consider the performance of NN methods on real datasets, and relate it to the dimensionality of the problem. Next, we analyze the theoretical properties of NN-methods for anomaly detection by studying a more general quantity called distance-to-measure (DTM), originally developed in the literature on robust geometric and topological inference. We provide finite-sample uniform guarantees for the empirical DTM and use them to derive misclassification rates for anomalous observations under various settings. In our analysis we rely on Huber's contamination model and formulate mild geometric regularity assumptions on the underlying distribution of the data.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] An Anomaly Detection Framework Based on Autoencoder and Nearest Neighbor
    Guo, Jia
    Liu, Guannan
    Zuo, Yuan
    Wu, Junjie
    [J]. 2018 15TH INTERNATIONAL CONFERENCE ON SERVICE SYSTEMS AND SERVICE MANAGEMENT (ICSSSM), 2018,
  • [2] A Review of Anomaly Detection Techniques Based on Nearest Neighbor
    Zhao, Ming
    Chen, Jingchao
    Li, Yang
    [J]. PROCEEDINGS OF THE 2018 INTERNATIONAL CONFERENCE ON COMPUTER MODELING, SIMULATION AND ALGORITHM (CMSA 2018), 2018, 151 : 290 - 292
  • [3] CANF: Clustering and anomaly detection method using nearest and farthest neighbor
    Faroughi, Azadeh
    Javidan, Reza
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 89 : 166 - 177
  • [4] Anomaly detection based on improved k-nearest neighbor rough sets
    Chen, Xiwen
    Yuan, Zhong
    Feng, Shan
    [J]. International Journal of Approximate Reasoning, 1600, Elsevier Inc. (176):
  • [5] Isolation-based anomaly detection using nearest-neighbor ensembles
    Bandaragoda, Tharindu R.
    Ting, Kai Ming
    Albrecht, David
    Liu, Fei Tony
    Zhu, Ye
    Wells, Jonathan R.
    [J]. COMPUTATIONAL INTELLIGENCE, 2018, 34 (04) : 968 - 998
  • [6] Anomaly Detection Based on Local Nearest Neighbor Distance Descriptor in Crowded Scenes
    Hu, Xing
    Hu, Shiqiang
    Zhang, Xiaoyu
    Zhang, Huanlong
    Luo, Lingkun
    [J]. SCIENTIFIC WORLD JOURNAL, 2014,
  • [7] Log-Based Anomaly Detection with the Improved K-Nearest Neighbor
    Wang, Bingming
    Ying, Shi
    Cheng, Guoli
    Wang, Rui
    Yang, Zhe
    Dong, Bo
    [J]. INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2020, 30 (02) : 239 - 262
  • [8] A BIBLIOGRAPHY OF NEAREST NEIGHBOR METHODS IN DESIGN AND ANALYSIS OF EXPERIMENTS
    GILL, PS
    [J]. BIOMETRICAL JOURNAL, 1991, 33 (04) : 455 - 459
  • [9] STATISTICAL ANALYSIS OF k-NEAREST NEIGHBOR COLLABORATIVE RECOMMENDATION
    Biau, Gerard
    Cadre, Benoit
    Rouviere, Laurent
    [J]. ANNALS OF STATISTICS, 2010, 38 (03): : 1568 - 1592
  • [10] USE OF NEAREST NEIGHBOR METHODS
    DEVOS, S
    [J]. TIJDSCHRIFT VOOR ECONOMISCHE EN SOCIALE GEOGRAFIE, 1973, 64 (05) : 307 - 319