Two-phase clustering process for outliers detection

被引:227
|
作者
Jiang, MF [1 ]
Tseng, SS [1 ]
Su, CM [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp & Informat Sci, Hsinchu 30050, Taiwan
关键词
outliers; k-means clustering; two-phase clustering; MST;
D O I
10.1016/S0167-8655(00)00131-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a two-phase clustering algorithm for outliers detection is proposed. Tn;e first modify the traditional k-means algorithm in Phase 1 by using a heuristic "if one new input pattern is far enough away from all clusters centers, then assign it as a new cluster center". It results that the data points in the same cluster may be most likely all outliers or all non-outliers. And then we construct a minimum spanning tree (MST) in Phase 2 and remove the longest edge. The small clusters, the tree with less number of nodes, are selected and regarded as outlier. The experimental results show that our process works well. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:691 / 700
页数:10
相关论文
共 50 条
  • [21] Outliers Detection Strategy for a Curve Clustering Algorithm
    Antonio, Balzanella
    Romano, Elvira
    Verde, Rosanna
    DATA ANALYSIS AND CLASSIFICATION, 2010, : 391 - +
  • [22] The timeline of mentalization: Distinguishing a two-phase process from mind detection to mind attribution
    Ruzzante, Daniela
    Vaes, Jeroen
    NEUROPSYCHOLOGIA, 2021, 160
  • [23] A two-phase clustering algorithm based on artificial immune network
    Zhong, J
    Wu, ZF
    Wu, KG
    Ou, L
    Zhu, ZZ
    Zhou, Y
    ADVANCES IN NATURAL COMPUTATION, PT 2, PROCEEDINGS, 2005, 3611 : 814 - 821
  • [24] TPACC: A novel two-phase ant colony clustering algorithm
    School of Computer Science and Technology, Beijing Institute of Technology, Beijing 100081, China
    不详
    J. Comput. Inf. Syst., 2006, 4 (1211-1218):
  • [25] Two-phase clustering algorithm with density exploring distance measure
    Ma, Jingjing
    Jiang, Xiangming
    Gong, Maoguo
    CAAI TRANSACTIONS ON INTELLIGENCE TECHNOLOGY, 2018, 3 (01) : 59 - 64
  • [26] Two-Phase Genetic Algorithm for Social Network Graphs Clustering
    Kohout, Jan
    Neruda, Roman
    2013 IEEE 27TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2013, : 197 - 202
  • [27] Mining entity latent semantic relationships by two-phase clustering
    Zhao, Ke
    Li, Qingzhong
    Yan, Zhongmin
    Li, Hui
    Chen, Zhiyong
    Journal of Computational Information Systems, 2015, 11 (21): : 7731 - 7739
  • [28] Analysis on the superiority of phase separation in two-phase anerobic process
    Harbin Jianzhu Daxue Xuebao/Journal of Harbin University of Civil Engineering and Architecture, 31 (02): : 50 - 56
  • [29] Phase detection aided thermometry (PDaT) for two-phase flow
    Takeyama, Mao
    Kunugi, Tomoaki
    Yokomine, Takehiko
    Kawara, Zensaku
    INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2018, 118 : 492 - 497
  • [30] RUL prediction based on two-phase wiener process
    Liu, Kai
    Zou, Tian-Ji
    Xin, Min-Cheng
    Lv, Cong-Min
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2022, 38 (07) : 3829 - 3843