In this paper, a two-phase clustering algorithm for outliers detection is proposed. Tn;e first modify the traditional k-means algorithm in Phase 1 by using a heuristic "if one new input pattern is far enough away from all clusters centers, then assign it as a new cluster center". It results that the data points in the same cluster may be most likely all outliers or all non-outliers. And then we construct a minimum spanning tree (MST) in Phase 2 and remove the longest edge. The small clusters, the tree with less number of nodes, are selected and regarded as outlier. The experimental results show that our process works well. (C) 2001 Elsevier Science B.V. All rights reserved.
机构:
China Univ Min & Technol, Sch Environm & Spatial Informat, Xuzhou 221116, Jiangsu, Peoples R ChinaChina Univ Min & Technol, Sch Environm & Spatial Informat, Xuzhou 221116, Jiangsu, Peoples R China
Li, Feng
Shi, Wenzhong
论文数: 0引用数: 0
h-index: 0
机构:
Hong Kong Polytech Univ, Smart Cities Res Inst, Dept Land Surveying & Geoinformat, Hung Hom,Kowloon, Hong Kong, Peoples R ChinaChina Univ Min & Technol, Sch Environm & Spatial Informat, Xuzhou 221116, Jiangsu, Peoples R China
Shi, Wenzhong
Zhang, Hua
论文数: 0引用数: 0
h-index: 0
机构:
China Univ Min & Technol, Sch Environm & Spatial Informat, Xuzhou 221116, Jiangsu, Peoples R ChinaChina Univ Min & Technol, Sch Environm & Spatial Informat, Xuzhou 221116, Jiangsu, Peoples R China