Hub-based subspace clustering

被引:3
|
作者
Mani, Priya [1 ]
Domeniconi, Carlotta [1 ]
机构
[1] George Mason Univ, Dept Comp Sci, 4400 Univ Dr MSN 4A5, Fairfax, VA 22030 USA
基金
美国国家科学基金会;
关键词
Hubness; Subspace clustering; Graph-based meta-features; Selective sampling;
D O I
10.1016/j.neucom.2020.06.098
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data often exists in subspaces embedded within a high-dimensional space. Subspace clustering seeks to group data according to the dimensions relevant to each subspace. This requires the estimation of subspaces as well as the clustering of data. Subspace clustering becomes increasingly challenging in high dimensional spaces due to the curse of dimensionality which affects reliable estimations of distances and density. Recently, another aspect of high-dimensional spaces has been observed, known as the hubness phenomenon, whereby few data points appear frequently as nearest neighbors of the rest of the data. The distribution of neighbor occurrences becomes skewed with increasing intrinsic dimensionality of the data, and few points with high neighbor occurrences emerge as hubs. Hubs exhibit useful geometric properties and have been leveraged for clustering data in the full-dimensional space. In this paper, we study hubs in the context of subspace clustering. We present new characterizations of hubs in relation to subspaces, and design graph-based meta-features to identify a subset of hubs which are well fit to serve as seeds for the discovery of local latent subspaces and clusters. We propose and evaluate a hubness-driven algorithm to find subspace clusters, and show that our approach is superior to the baselines, and is competitive against state-of-the-art subspace clustering methods. We also identify the data characteristics that make hubs suitable for subspace clustering. Such characterization gives valuable guidelines to data mining practitioners. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:193 / 209
页数:17
相关论文
共 50 条
  • [1] On Reliability Evaluation of Hub-Based Networks
    Chen, Shin-Guang
    [J]. PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INTELLIGENT TECHNOLOGIES AND ENGINEERING SYSTEMS (ICITES2013), 2014, 293 : 1147 - 1153
  • [2] IoT Interoperability: A Hub-based Approach
    Blackstock, Michael
    Lea, Rodger
    [J]. 2014 INTERNATIONAL CONFERENCE ON THE INTERNET OF THINGS (IOT), 2014, : 79 - 84
  • [3] Hub-based vibration control of multiple rotating airfoils
    Szász, G
    Flowers, GT
    Hartfield, RJ
    [J]. JOURNAL OF PROPULSION AND POWER, 2000, 16 (06) : 1155 - 1163
  • [4] Hub-based truck platooning: Potentials and profitability
    Larsen, Rune
    Rich, Jeppe
    Rasmussen, Thomas Kjaer
    [J]. TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2019, 127 : 249 - 264
  • [5] Mining hub-based protein complexes In massrve biological networks
    Lin, Zhijie
    Chen, Yan
    Wu, Shiwei
    Xiong, Yun
    Zhu, Yangyong
    Zheng, Guangyong
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [6] A Hub-Based Labeling Algorithm for Shortest Paths in Road Networks
    Abraham, Ittai
    Delling, Daniel
    Goldberg, Andrew V.
    Werneck, Renato F.
    [J]. EXPERIMENTAL ALGORITHMS, 2011, 6630 : 230 - 241
  • [7] A Hub-based Graph Management for Efficient Repetition Path Traversing
    Kusu, Kazuma
    Hatano, Kenji
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2021), 2021, : 188 - 191
  • [8] Analysis of Human Metabolic Core Using Hub-Based Centrality
    Hui, Du
    [J]. 2009 ETP INTERNATIONAL CONFERENCE ON FUTURE COMPUTER AND COMMUNICATION (FCC 2009), 2009, : 88 - 90
  • [9] Hub-based simulation and graphics hardware accelerated visualization for nanotechnology applications
    Qiao, Wei
    McLennan, Michael
    Kennell, Rick
    Ebert, David S.
    Klimeck, Gerhard
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2006, 12 (05) : 1061 - 1068
  • [10] Strategic Hub-Based Platoon Coordination Under Uncertain Travel Times
    Johansson, Alexander
    Nekouei, Ehsan
    Johansson, Karl Henrik
    Martensson, Jonas
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 8277 - 8287