Interactive Exploration of Subspace Clusters for High Dimensional Data

被引:0
|
作者
Kristensen, Jesper [1 ]
Mai, Son T. [1 ,3 ]
Assent, Ira [1 ]
Jacobsen, Jon [1 ]
Bay Vo [2 ]
Anh Le [3 ]
机构
[1] Aarhus Univ, Dept Comp Sci, Aarhus, Denmark
[2] Ho Chi Minh City Univ Technol, Fac Informat Technol, Ho Chi Minh City, Vietnam
[3] Univ Transport, Dept Comp Sci, Ho Chi Minh City, Vietnam
关键词
Subspace clustering; Interactive clustering;
D O I
10.1007/978-3-319-64468-4_25
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
PreDeCon is a fundamental clustering algorithm for finding arbitrarily shaped clusters hidden in high-dimensional feature spaces of data, which is an important research topic and has many potential applications. However, it suffers from very high runtime as well as lack of interactions with users. Our algorithm, called AnyPDC, introduces a novel approach to cope with these problems by casting PreDeCon into an anytime algorithm. It quickly produces an approximate result and iteratively refines it toward the result of PreDeCon at the end. This scheme not only significantly speeds up the algorithm but also provides interactions with users during its execution. Experiments conducted on real large datasets show that AnyPDC acquires good approximate results very early, leading to an order of magnitude speedup factor compared to PreDeCon. More interestingly, while anytime techniques usually end up slower than batch ones, AnyPDC is faster than PreDeCon even if it run to the end.
引用
收藏
页码:327 / 342
页数:16
相关论文
共 50 条
  • [1] Dimension Reconstruction for Visual Exploration of Subspace Clusters in High-dimensional Data
    Zhou, Fangfang
    Li, Juncai
    Huang, Wei
    Zhao, Ying
    Yuan, Xiaoru
    Liang, Xing
    Shi, Yang
    [J]. 2016 IEEE PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS), 2016, : 128 - 135
  • [2] Dimension Projection Matrix/Tree: Interactive Subspace Visual Exploration and Analysis of High Dimensional Data
    Yuan, Xiaoru
    Ren, Donghao
    Wang, Zuchao
    Guo, Cong
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2013, 19 (12) : 2625 - 2633
  • [3] Discovering the Skyline of Subspace Clusters in High-Dimensional Data
    Chen, Guanhua
    Ma, Xiuli
    Yang, Dongqing
    Tang, Shiwei
    [J]. FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 2, PROCEEDINGS, 2008, : 439 - +
  • [4] Progressive subspace skyline clusters mining on high dimensional data
    Hu, Rong
    Lu, Yansheng
    Zou, Lei
    Zhou, Chong
    [J]. EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 268 - +
  • [5] Mapper Interactive: A Scalable, Extendable, and Interactive Toolbox for the Visual Exploration of High-Dimensional Data
    Zhou, Youjia
    Chalapathi, Nithin
    Rathore, Archit
    Zhao, Yaodong
    Wang, Bei
    [J]. 2021 IEEE 14TH PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS 2021), 2021, : 101 - 110
  • [6] Visual Exploration of High-Dimensional Data through Subspace Analysis and Dynamic Projections
    Liu, S.
    Wang, B.
    Thiagarajan, J. J.
    Bremer, P. -T.
    Pascucci, V.
    [J]. COMPUTER GRAPHICS FORUM, 2015, 34 (03) : 271 - 280
  • [7] Subspace clustering of high dimensional data
    Domeniconi, C
    Papadopoulos, D
    Gunopulos, D
    Ma, S
    [J]. Proceedings of the Fourth SIAM International Conference on Data Mining, 2004, : 517 - 521
  • [8] Targeted projection pursuit for interactive exploration of high-dimensional data sets
    Faith, Joe
    [J]. 11TH INTERNATIONAL CONFERENCE INFORMATION VISUALIZATION, 2007, : 286 - 292
  • [9] Focused multidimensional scaling: interactive visualization for exploration of high-dimensional data
    Urpa, Lea M.
    Anders, Simon
    [J]. BMC BIOINFORMATICS, 2019, 20 (1)
  • [10] Focused multidimensional scaling: interactive visualization for exploration of high-dimensional data
    Lea M. Urpa
    Simon Anders
    [J]. BMC Bioinformatics, 20