Two-phase clustering process for outliers detection

被引:227
|
作者
Jiang, MF [1 ]
Tseng, SS [1 ]
Su, CM [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp & Informat Sci, Hsinchu 30050, Taiwan
关键词
outliers; k-means clustering; two-phase clustering; MST;
D O I
10.1016/S0167-8655(00)00131-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a two-phase clustering algorithm for outliers detection is proposed. Tn;e first modify the traditional k-means algorithm in Phase 1 by using a heuristic "if one new input pattern is far enough away from all clusters centers, then assign it as a new cluster center". It results that the data points in the same cluster may be most likely all outliers or all non-outliers. And then we construct a minimum spanning tree (MST) in Phase 2 and remove the longest edge. The small clusters, the tree with less number of nodes, are selected and regarded as outlier. The experimental results show that our process works well. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:691 / 700
页数:10
相关论文
共 50 条
  • [41] Two-Phase Multiobjective Genetic Algorithm for Constrained Circuit Clustering on FPGAs
    Wang, Yuan
    Walker, James Alfred
    Bale, Simon J.
    Trefzer, Martin A.
    Tyrrell, Andy M.
    2015 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2015, : 1183 - 1190
  • [42] TPICDS: A Two-Phase Parallel Approach for Incremental Clustering of Data Streams
    Alazeez, Ammar Al Abd
    Jassim, Sabah
    Du, Hongbo
    EURO-PAR 2018: PARALLEL PROCESSING WORKSHOPS, 2019, 11339 : 5 - 16
  • [43] WCDS: A Two-Phase Weightless Neural System for Data Stream Clustering
    Cardoso, Douglas O.
    Franca, Felipe M. G.
    Gama, Joao
    NEW GENERATION COMPUTING, 2017, 35 (04) : 391 - 416
  • [44] A two-phase heuristic for the bottleneck k-hyperplane clustering problem
    Amaldi, Edoardo
    Dhyani, Kanika
    Liberti, Leo
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2013, 56 (03) : 619 - 633
  • [45] Website Community Mining from Query Logs with Two-Phase Clustering
    Bing, Lidong
    Lam, Wai
    Jameel, Shoaib
    Lu, Chunliang
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2014, PART II, 2014, 8404 : 201 - 212
  • [46] A two-phase heuristic for the bottleneck k-hyperplane clustering problem
    Edoardo Amaldi
    Kanika Dhyani
    Leo Liberti
    Computational Optimization and Applications, 2013, 56 : 619 - 633
  • [47] A Two-Phase Clustering Algorithm to Tackle Mobility in Mobile Sensor Networks
    Khoshkholghi, Mohammad Ali
    Abdullah, Azizol
    Subramaniam, Shamala
    Othman, Mohamed
    PROCEEDINGS OF THE 2013 ASIA-PACIFIC COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY CONFERENCE, 2013, : 121 - 128
  • [48] Two-phase collaborative filtering algorithm based on co-clustering
    Wu H.
    Wang Y.-J.
    Wang Z.
    Wang X.-L.
    Du S.-Z.
    Ruan Jian Xue Bao/Journal of Software, 2010, 21 (05): : 1042 - 1054
  • [49] A Two-Phase Object Detection Solution for Aerial Images
    Xing, Chen
    Liang, Xi
    Zhang, Pengliang
    2020 IEEE 8TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2020, : 119 - 122
  • [50] A rotationally invariant two-phase scheme for corner detection
    Sheu, HT
    Hu, WC
    PATTERN RECOGNITION, 1996, 29 (05) : 819 - 828