Cluster-based outlier detection

被引:133
|
作者
Duan, Lian [1 ]
Xu, Lida [2 ,3 ]
Liu, Ying [4 ]
Lee, Jun [5 ]
机构
[1] Univ Iowa, Dept Management Sci, Iowa City, IA 52242 USA
[2] Beijing Jiaotong Univ, Coll Econ & Management, Beijing 100044, Peoples R China
[3] Old Dominion Univ, Dept Informat Technol & Decis Sci, Norfolk, VA 23529 USA
[4] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing, Peoples R China
[5] Chinese Acad Sci, China Sci & Technol Network, Beijing, Peoples R China
关键词
Outlier detection; Cluster-based outlier; LDBSCAN; Local outlier factor; FEATURE SPACE THEORY;
D O I
10.1007/s10479-008-0371-9
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Outlier detection has important applications in the field of data mining, such as fraud detection, customer behavior analysis, and intrusion detection. Outlier detection is the process of detecting the data objects which are grossly different from or inconsistent with the remaining set of data. Outliers are traditionally considered as single points; however, there is a key observation that many abnormal events have both temporal and spatial locality, which might form small clusters that also need to be deemed as outliers. In other words, not only a single point but also a small cluster can probably be an outlier. In this paper, we present a new definition for outliers: cluster-based outlier, which is meaningful and provides importance to the local data behavior, and how to detect outliers by the clustering algorithm LDBSCAN (Duan et al. in Inf. Syst. 32(7):978-986, 2007) which is capable of finding clusters and assigning LOF (Breunig et al. in Proceedings of the 2000 ACM SIG MOD International Conference on Manegement of Data, ACM Press, pp. 93-104, 2000) to single points.
引用
收藏
页码:151 / 168
页数:18
相关论文
共 50 条
  • [1] Cluster-based outlier detection
    Lian Duan
    Lida Xu
    Ying Liu
    Jun Lee
    [J]. Annals of Operations Research, 2009, 168 : 151 - 168
  • [2] A Cluster-Based Outlier Detection Scheme for Multivariate Data
    Jobe, J. Marcus
    Pokojovy, Michael
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2015, 110 (512) : 1543 - 1551
  • [3] Cluster-Based Outlier Detection Using Unsupervised Extreme Learning Machines
    Wang, Xite
    Shen, Derong
    Bai, Mei
    Nie, Tiezheng
    Kou, Yue
    Yu, Ge
    [J]. PROCEEDINGS OF ELM-2015, VOL 1: THEORY, ALGORITHMS AND APPLICATIONS (I), 2016, 6 : 135 - 146
  • [4] A cluster-based Outlier detection method without pre-clustering
    Ren, DM
    Wang, BY
    Perrizo, W
    [J]. COMPUTER APPLICATIONS IN INDUSTRY AND ENGINEERING, 2004, : 177 - 180
  • [5] From Cluster-Based Outlier Detection to Time Series Discord Discovery
    Nguyen Huy Kha
    Duong Tuan Anh
    [J]. TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2015, 2015, 9441 : 16 - 28
  • [6] A Distributed Algorithm for the Cluster-Based Outlier Detection Using Unsupervised Extreme Learning Machines
    Wang, Xite
    Bai, Mei
    Shen, Derong
    Nie, Tiezheng
    Kou, Yue
    Yu, Ge
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2017, 2017
  • [7] A Comparative Study of Cluster Based Outlier Detection, Distance Based Outlier Detection and Density Based Outlier Detection Techniques
    Mandhare, Harshada C.
    Idate, S. R.
    [J]. 2017 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2017, : 931 - 935
  • [8] Outlier detection based on cluster outlier factor and mutual density
    Zhang, Zhongping
    Zhu, Mengfan
    Qiu, Jingyang
    Liu, Cong
    Zhang, Debin
    Qi, Jie
    [J]. International Journal of Intelligent Information and Database Systems, 2019, 12 (1-2) : 91 - 108
  • [9] Outlier detection based on cluster outlier factor and mutual density
    Zhang, Zhongping
    Qiu, Jingyang
    Liu, Cong
    Zhu, Mengfan
    Zhang, Debin
    [J]. Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2019, 25 (09): : 2314 - 2323
  • [10] Residues Cluster-Based Segmentation and Outlier-Detection Method for Large-Scale Phase Unwrapping
    Yu, Hanwen
    Li, Zhenfang
    Bao, Zheng
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2011, 20 (10) : 2865 - 2875