Detecting clusters and Outliers for multi-dimensional data

被引:4
|
作者
Shi, Yong [1 ]
机构
[1] Kennesaw State Univ, Dept Comp Sci & Informat Syst, Kennesaw, GA 30144 USA
关键词
D O I
10.1109/MUE.2008.19
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Nowadays many data mining algorithms focus on clustering methods. There tire also a lot of approaches designed for outlier detection. We observe that, in many situations, clusters and outliers are concepts whose meanings are inseparable to each other especially for those data sets with noise. Thus, it is necessary to treat clusters and outliers as concepts of the same importance in data analysis. In this paper we present a cluster-outlier iterative detection algorithm, tending to detect the clusters and outliers in another perspective for noisy data sets. In this algorithm, clusters are detected and adjusted according to the intra-relationship within clusters and the inter-relationship between clusters and outliers, and vice versa. The adjustment and modification of the clusters and outliers are performed iteratively until a certain termination condition is reached. This data processing algorithm can be applied in many fields such as pattern recognition, data clustering and signal processing.
引用
收藏
页码:429 / 432
页数:4
相关论文
共 50 条
  • [1] In Pursuit of Outliers in Multi-dimensional Data Streams
    Sadik, Shiblee
    Gruenwald, Le
    Leal, Eleazar
    [J]. 2016 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2016, : 512 - 521
  • [2] Towards exploring interactive relationship between clusters and outliers in multi-dimensional data analysis
    Shi, Y
    Zhang, AD
    [J]. ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 518 - 519
  • [3] Wadjet: Finding Outliers in Multiple Multi-dimensional Heterogeneous Data Streams
    Sadik, Shiblee
    Gruenwald, Le
    Leal, Eleazar
    [J]. 2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2018, : 1232 - 1235
  • [4] Grid-ODF: Detecting outliers effectively and efficiently in large multi-dimensional databases
    Wang, W
    Zhang, J
    Wang, H
    [J]. COMPUTATIONAL INTELLIGENCE AND SECURITY, PT 1, PROCEEDINGS, 2005, 3801 : 765 - 770
  • [5] Exploring linear projections for revealing clusters, outliers, and trends in subsets of multi-dimensional datasets
    Xia, Jiazhi
    Gao, Le
    Kong, Kezhi
    Zhao, Ying
    Chen, Yi
    Kui, Xiaoyan
    Liang, Yixiong
    [J]. JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 2018, 48 : 52 - 60
  • [6] Detecting and ranking outliers in high-dimensional data
    Kaur, Amardeep
    Datta, Amitava
    [J]. INTERNATIONAL JOURNAL OF ADVANCES IN ENGINEERING SCIENCES AND APPLIED MATHEMATICS, 2019, 11 (01) : 75 - 87
  • [7] Detecting and ranking outliers in high-dimensional data
    Amardeep Kaur
    Amitava Datta
    [J]. International Journal of Advances in Engineering Sciences and Applied Mathematics, 2019, 11 : 75 - 87
  • [8] Visualizing multi-dimensional data
    Eick, SG
    [J]. COMPUTER GRAPHICS-US, 2000, 34 (01): : 61 - 67
  • [9] Visualizing multi-dimensional data
    Eick, Stephen G.
    [J]. Computer Graphics (ACM), 2000, 34 (01): : 61 - 67
  • [10] Detecting Projected Outliers in High-Dimensional Data Streams
    Zhang, Ji
    Gao, Qigang
    Wang, Hai
    Liu, Qing
    Xu, Kai
    [J]. DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 629 - +