Self-organizing subspace clustering for high-dimensional and multi-view data

被引:20
|
作者
Araujo, Aluizio F. R. [1 ]
Antonino, Victor O. [1 ]
Ponce-Guevara, Karina L. [1 ]
机构
[1] Univ Fed Pernambuco, Ctr Informat, BR-50740560 Recife, PE, Brazil
关键词
Subspace clustering; Multi-view clustering; High-dimensional data; Self-organizing maps; ALGORITHM; SEGMENTATION; CANCER; MODEL;
D O I
10.1016/j.neunet.2020.06.022
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A surge in the availability of data from multiple sources and modalities is correlated with advances in how to obtain, compress, store, transfer, and process large amounts of complex high-dimensional data. The clustering challenge increases with the growth of data dimensionality which decreases the discriminate power of the distance metrics. Subspace clustering aims to group data drawn from a union of subspaces. In such a way, there is a large number of state-of-the-art approaches and we divide them into families regarding the method used in the clustering. We introduce a soft subspace clustering algorithm, a Self-organizing Map (SOM) with a time-varying structure, to cluster data without any prior knowledge of the number of categories or of the neural network topology, both determined during the training process. The model also assigns proper relevancies (weights) to different dimensions, capturing from the learning process the influence of each dimension on uncovering clusters. We employ a number of real-world datasets to validate the model. This algorithm presents a competitive performance in a diverse range of contexts among them data mining, gene expression, multi-view, computer vision and text clustering problems which include high-dimensional data. Extensive experiments suggest that our method very often outperforms the state-of-the-art approaches in all types of problems considered. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页码:253 / 268
页数:16
相关论文
共 50 条
  • [1] Adaptive multi-view subspace clustering for high-dimensional data
    Yan, Fei
    Wang, Xiao-dong
    Zeng, Zhi-qiang
    Hong, Chao-qun
    [J]. PATTERN RECOGNITION LETTERS, 2020, 130 : 299 - 305
  • [2] A Self-Organizing Tensor Architecture for Multi-View Clustering
    He, Lifang
    Lu, Chun-Ta
    Chen, Yong
    Zhang, Jiawei
    Shen, Linlin
    Yu, Philip S.
    Wang, Fei
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1007 - 1012
  • [3] Self-Organizing Map for Multi-view Text Clustering
    Fraj, Maha
    Ben Hajkacem, Mohamed Aymen
    Essoussi, Nadia
    [J]. BIG DATA ANALYTICS AND KNOWLEDGE DISCOVERY (DAWAK 2020), 2020, 12393 : 396 - 408
  • [4] Multi Self-Organizing Map (SOM) Pipeline Architecture for Multi-View Clustering
    Jamil, Saadia
    Rehman, Eid
    Shahzad, Tariq
    Ishtiaq, Muhammad
    Mazhar, Tehseen
    Yasin Ghadi, Yazeed
    Ahmed, Arfan
    [J]. IEEE ACCESS, 2024, 12 : 85806 - 85821
  • [5] Self-representation Subspace Clustering for Incomplete Multi-view Data
    Liu, Jiyuan
    Liu, Xinwang
    Zhang, Yi
    Zhang, Pei
    Tu, Wenxuan
    Wang, Siwei
    Zhou, Sihang
    Liang, Weixuan
    Wang, Siqi
    Yang, Yuexiang
    [J]. PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 2726 - 2734
  • [6] Using self-organizing maps to visualize high-dimensional data
    Penn, BS
    [J]. COMPUTERS & GEOSCIENCES, 2005, 31 (05) : 531 - 544
  • [7] Multi-View Subspace Clustering
    Gao, Hongchang
    Nie, Feiping
    Li, Xuelong
    Huang, Heng
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4238 - 4246
  • [8] Visualizing high-dimensional input data with growing self-organizing maps
    Delgado, Soledad
    Gonzalo, Consuelo
    Martinez, Estibaliz
    Arquero, Agueda
    [J]. COMPUTATIONAL AND AMBIENT INTELLIGENCE, 2007, 4507 : 580 - +
  • [9] Subspace selection for clustering high-dimensional data
    Baumgartner, C
    Plant, C
    Kailing, K
    Kriegel, HP
    Kröger, P
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 11 - 18
  • [10] Local Adaptive Receptive Field Dimension Selective Self-Organizing Map for Multi-View Clustering
    Antonino, Victor O.
    Araujo, Aluizio F. R.
    [J]. 2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 698 - 705