Cluster-based outlier detection

被引:144
|
作者
Duan, Lian [1 ]
Xu, Lida [2 ,3 ]
Liu, Ying [4 ]
Lee, Jun [5 ]
机构
[1] Univ Iowa, Dept Management Sci, Iowa City, IA 52242 USA
[2] Beijing Jiaotong Univ, Coll Econ & Management, Beijing 100044, Peoples R China
[3] Old Dominion Univ, Dept Informat Technol & Decis Sci, Norfolk, VA 23529 USA
[4] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing, Peoples R China
[5] Chinese Acad Sci, China Sci & Technol Network, Beijing, Peoples R China
关键词
Outlier detection; Cluster-based outlier; LDBSCAN; Local outlier factor; FEATURE SPACE THEORY;
D O I
10.1007/s10479-008-0371-9
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Outlier detection has important applications in the field of data mining, such as fraud detection, customer behavior analysis, and intrusion detection. Outlier detection is the process of detecting the data objects which are grossly different from or inconsistent with the remaining set of data. Outliers are traditionally considered as single points; however, there is a key observation that many abnormal events have both temporal and spatial locality, which might form small clusters that also need to be deemed as outliers. In other words, not only a single point but also a small cluster can probably be an outlier. In this paper, we present a new definition for outliers: cluster-based outlier, which is meaningful and provides importance to the local data behavior, and how to detect outliers by the clustering algorithm LDBSCAN (Duan et al. in Inf. Syst. 32(7):978-986, 2007) which is capable of finding clusters and assigning LOF (Breunig et al. in Proceedings of the 2000 ACM SIG MOD International Conference on Manegement of Data, ACM Press, pp. 93-104, 2000) to single points.
引用
收藏
页码:151 / 168
页数:18
相关论文
共 50 条
  • [21] Cluster-based multivariate outlier identification and re-weighted regression in linear models
    Alih, Ekele
    Ong, Hong Choon
    JOURNAL OF APPLIED STATISTICS, 2015, 42 (05) : 938 - 955
  • [22] Cluster-based Intrusion Detection Method for Internet of Things
    Choudhary, Sarika
    Kesswani, Nishtha
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [23] UNSUPERVISED PARSIMONIOUS CLUSTER-BASED ANOMALY DETECTION (PCAD)
    Miller, David J.
    Kesidis, George
    Qiu, Zhicong
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [24] A fast and noise resilient cluster-based anomaly detection
    Bigdeli, Elnaz
    Mohammadi, Mahdi
    Raahemi, Bijan
    Matwin, Stan
    PATTERN ANALYSIS AND APPLICATIONS, 2017, 20 (01) : 183 - 199
  • [25] Forward vehicle detection using cluster-based AdaBoost
    Baek, Yeul-Min
    Kim, Whoi-Yul
    OPTICAL ENGINEERING, 2014, 53 (10)
  • [26] Cluster-based Voice Activity Detection for Mobile Devices
    Park, Sangjun
    Lee, Seunghyung
    Park, Jinuk
    Hahn, Minsoo
    2016 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2016,
  • [27] An adaptive approach for cluster-based intrusion detection in VANET
    Muthumeenakshi, R.
    Katharine, A. Vanitha
    INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2022, 20 (01) : 58 - +
  • [28] Cluster-based Sorted Neighborhood for Efficient Duplicate Detection
    Samiei, Ahmad
    Naumann, Felix
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 202 - 209
  • [29] A fast and noise resilient cluster-based anomaly detection
    Elnaz Bigdeli
    Mahdi Mohammadi
    Bijan Raahemi
    Stan Matwin
    Pattern Analysis and Applications, 2017, 20 : 183 - 199
  • [30] Cluster-Based Boosting
    Miller, L. Dee
    Soh, Leen-Kiat
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2015, 27 (06) : 1491 - 1504