Visual analytics for the clustering capability of data

被引:0
|
作者
ZhiMao Lu
Chen Liu
Qi Zhang
ChunXiang Zhang
DongMei Fan
Peng Yang
机构
[1] Harbin Engineering University,Pattern Recognition and Natural Computation Laboratory
[2] Dalian University of Technology,School of Computer Science and Technology
[3] Harbin University of Science and Technology,School of Software
来源
关键词
data mining; clustering analysis; visual analysis; minimum distance spectrum; nearest neighbor spectrum; outliers;
D O I
暂无
中图分类号
学科分类号
摘要
Clustering analysis is an unsupervised method to find hidden structures in datasets and has been widely used in various fields. However, it is always difficult for users to understand, evaluate, and explain the clustering results in the spaces with dimension greater than three. Although high-dimensional visualization of clustering technology can express clustering results well, it still has significant limitations. In this paper, a visualization cluster analysis method based on the minimum distance spectrum (MinDS) is proposed, aimed at reducing the problems of clustering multidimensional datasets. First, the concept of MinDS is defined based on the distance between high-dimensional data. MinDS can map any dataset from high-dimensional space to a lower dimension to determine whether the data set is separable. Next, a clustering method which can automatically determine the number of categories is designed based on MinDS. This method is not only able to cluster a dataset with clear boundaries, but can also cluster the dataset with fuzzy boundaries through the edge corrosion strategy based on the energy of each data point. In addition, strategies for removing noise and identifying outliers are designed to clean datasets according to the characteristics of MinDS. The experimental results presented validate the feasibility and effectiveness of the proposed schemes and show that the proposed approach is simple, stable, and efficient, and can achieve multidimensional visualization cluster analysis of complex datasets.
引用
收藏
页码:1 / 14
页数:13
相关论文
共 50 条
  • [31] Visual Analytics for High Dimensional Data
    Inselberg, Alfred
    Anthopoulos, Leonidas G.
    WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 1683 - 1687
  • [32] Visual Analytics for Root DNS Data
    Krokos, Eric
    Rowden, Alexander
    Whitley, Kirsten
    Varshney, Amitabh
    2018 IEEE SYMPOSIUM ON VISUALIZATION FOR CYBER SECURITY (VIZSEC 2018), 2018,
  • [33] Continuous Clustering in Big Data Learning Analytics
    Govindarajan, Kannan
    Somasundaram, Thamarai Selvi
    Kumar, Vivekanandan S.
    Kinshuk
    2013 IEEE FIFTH INTERNATIONAL CONFERENCE ON TECHNOLOGY FOR EDUCATION (T4E 2013), 2013, : 61 - 64
  • [34] Big Data Analytics Capability Ecosystem Model for SMEs
    Falahat, Mohammad
    Cheah, Phaik Kin
    Jayabalan, Jayamalathi
    Lee, Corrinne Mei Jyin
    Kai, Sia Bik
    SUSTAINABILITY, 2023, 15 (01)
  • [35] BIG DATA ANALYTICS AS A STRATEGIC CAPABILITY: A SYSTEMATIC REVIEW
    Bogdan, Mihai
    Borza, Anca
    PROCEEDINGS OF THE 13TH INTERNATIONAL MANAGEMENT CONFERENCE: MANAGEMENT STRATEGIES FOR HIGH PERFORMANCE (IMC 2019), 2019, : 575 - 583
  • [36] Big Data Analytics: A Key Capability for Competitive Advantage
    Bedeley, Rudolph T.
    Nemati, Hamid
    AMCIS 2014 PROCEEDINGS, 2014,
  • [37] Big Data, Big Data Analytics Capability, and Sustainable Innovation Performance
    Hao, Shengbin
    Zhang, Haili
    Song, Michael
    SUSTAINABILITY, 2019, 11 (24)
  • [38] Visual analytics of genealogy with attribute-enhanced topological clustering
    Sun, Ling
    Zhang, Xiang
    Pan, Xiaan
    Liu, Yuhua
    Yu, Wanghao
    Xu, Ting
    Liu, Fang
    Chen, Weifeng
    Wang, Yigang
    Su, Weihua
    Zhou, Zhiguang
    JOURNAL OF VISUALIZATION, 2022, 25 (02) : 361 - 377
  • [39] Visual analytics of genealogy with attribute-enhanced topological clustering
    Ling Sun
    Xiang Zhang
    Xiaan Pan
    Yuhua Liu
    Wanghao Yu
    Ting Xu
    Fang Liu
    Weifeng Chen
    Yigang Wang
    Weihua Su
    Zhiguang Zhou
    Journal of Visualization, 2022, 25 : 361 - 377
  • [40] Towards a Systematic Combination of Dimension Reduction and Clustering in Visual Analytics
    Wenskovitch, John
    Crandell, Ian
    Ramakrishnan, Naren
    House, Leanna
    Leman, Scotland
    North, Chris
    IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2018, 24 (01) : 131 - 141