Visual analytics for the clustering capability of data

被引:0
|
作者
ZhiMao Lu
Chen Liu
Qi Zhang
ChunXiang Zhang
DongMei Fan
Peng Yang
机构
[1] Harbin Engineering University,Pattern Recognition and Natural Computation Laboratory
[2] Dalian University of Technology,School of Computer Science and Technology
[3] Harbin University of Science and Technology,School of Software
来源
关键词
data mining; clustering analysis; visual analysis; minimum distance spectrum; nearest neighbor spectrum; outliers;
D O I
暂无
中图分类号
学科分类号
摘要
Clustering analysis is an unsupervised method to find hidden structures in datasets and has been widely used in various fields. However, it is always difficult for users to understand, evaluate, and explain the clustering results in the spaces with dimension greater than three. Although high-dimensional visualization of clustering technology can express clustering results well, it still has significant limitations. In this paper, a visualization cluster analysis method based on the minimum distance spectrum (MinDS) is proposed, aimed at reducing the problems of clustering multidimensional datasets. First, the concept of MinDS is defined based on the distance between high-dimensional data. MinDS can map any dataset from high-dimensional space to a lower dimension to determine whether the data set is separable. Next, a clustering method which can automatically determine the number of categories is designed based on MinDS. This method is not only able to cluster a dataset with clear boundaries, but can also cluster the dataset with fuzzy boundaries through the edge corrosion strategy based on the energy of each data point. In addition, strategies for removing noise and identifying outliers are designed to clean datasets according to the characteristics of MinDS. The experimental results presented validate the feasibility and effectiveness of the proposed schemes and show that the proposed approach is simple, stable, and efficient, and can achieve multidimensional visualization cluster analysis of complex datasets.
引用
下载
收藏
页码:1 / 14
页数:13
相关论文
共 50 条
  • [1] Visual analytics for the clustering capability of data
    LU ZhiMao
    LIU Chen
    ZHANG Qi
    ZHANG ChunXiang
    FAN DongMei
    YANG Peng
    Science China(Information Sciences), 2013, 56 (05) : 131 - 144
  • [2] Visual analytics for the clustering capability of data
    Lu ZhiMao
    Liu Chen
    Zhang Qi
    Zhang ChunXiang
    Fan DongMei
    Yang Peng
    SCIENCE CHINA-INFORMATION SCIENCES, 2013, 56 (05) : 1 - 14
  • [3] A Visual Analytics Framework for Interactively Clustering Scent Data
    Huang L.
    Zhang J.
    Wu H.
    Lu Q.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2020, 32 (07): : 1026 - 1041
  • [4] Clustering and Classification for Time Series Data in Visual Analytics: A Survey
    Ali, Mohammed
    Alqahtani, Ali
    Jones, Mark W.
    Xie, Xianghua
    IEEE ACCESS, 2019, 7 : 181314 - 181338
  • [5] Automated Clustering for Data Analytics
    Byrnes, Paul E.
    JOURNAL OF EMERGING TECHNOLOGIES IN ACCOUNTING, 2019, 16 (02) : 43 - 58
  • [6] A Novel Visual analytics Approach for Clustering Large-Scale Social Data
    Wang, Zhangye
    Zhou, Juanxia
    Chen, Wei
    Chen, Chang
    Liao, Jiyuan
    Maciejewski, Ross
    2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATA, 2013,
  • [7] A Visual Analytics Approach for Radar Signal Clustering
    Zhao Y.
    Qian W.
    Li Y.
    Zhang R.
    Wu Q.
    Chen B.
    Zhou F.
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2019, 31 (10): : 1653 - 1665
  • [8] A Visual Analytics Approach for Interactive Document Clustering
    Sherkat, Ehsan
    Milios, Evangelos E.
    Minghim, Rosane
    ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2020, 10 (01)
  • [9] XCluSim: a visual analytics tool for interactively comparing multiple clustering results of bioinformatics data
    Sehi L'Yi
    Bongkyung Ko
    DongHwa Shin
    Young-Joon Cho
    Jaeyong Lee
    Bohyoung Kim
    Jinwook Seo
    BMC Bioinformatics, 16
  • [10] XCluSim: a visual analytics tool for interactively comparing multiple clustering results of bioinformatics data
    L'Yi, Sehi
    Ko, Bongkyung
    Shin, DongHwa
    Cho, Young-Joon
    Lee, Jaeyong
    Kim, Bohyoung
    Seo, Jinwook
    BMC BIOINFORMATICS, 2015, 16