Semi supervised approach towards subspace clustering

被引:5
|
作者
Harikumar, Sandhya [1 ]
Akhil, A. S. [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Dept Comp Sci & Engn, Amritapuri, India
关键词
Subspace clustering; semi-supervised; information gain; entropy;
D O I
10.3233/JIFS-169456
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-dimensional data analysis is quite inevitable due to emerging technologies in various domains such as finance, healthcare, genomics and signal processing. Though data sets generated in these domains are high-dimensional, intrinsic dimensions that provide meaningful information are often much smaller. Conventionally, unsupervised clustering methods known as subspace clustering are utilized for finding clusters in different subspaces of high dimensional data, by identifying relevant features, irrespective of labels associated with each instance. Available label information, if incorporated in clustering algorithm, can bias the algorithm towards solutions more consistent with our knowledge, leading to improved cluster quality. Therefore, an Information Gain based Semi-supervised-subspace Clustering (IGSC) is proposed that identifies a subset of important attributes based on the known label for each data instance. The information about the labels associated with data sets is integrated with the search strategy for subspaces to leverage them into a model based clustering approach. Our experimentation on 13 real world labeled data sets proves the feasibility of IGSC and we validate the clusters obtained, using an improvised Davies Bouldin Index (DBI) for semi-supervised clusters.
引用
收藏
页码:1619 / 1629
页数:11
相关论文
共 50 条
  • [1] SISC: A Text Classification Approach Using Semi Supervised Subspace Clustering
    Ahmed, Mohammad Salim
    Khan, Latifur
    [J]. 2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 1 - 6
  • [2] Exploiting constraint inconsistence for dimension selection in subspace clustering: A semi-supervised approach
    Zhang, Xianchao
    Qiu, Yang
    Wu, Yao
    [J]. NEUROCOMPUTING, 2011, 74 (17) : 3598 - 3608
  • [3] Semi-supervised sparse subspace clustering with manifold regularization
    Xing, Zhiwei
    Peng, Jigen
    He, Xingshi
    Tian, Mengnan
    [J]. APPLIED INTELLIGENCE, 2024, 54 (9-10) : 6836 - 6845
  • [4] Unified Discriminative and Coherent Semi-Supervised Subspace Clustering
    Wang, Weiwei
    Yang, Chunyu
    Chen, Huazhu
    Feng, Xiangchu
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (05) : 2461 - 2470
  • [5] A splitting method for the locality regularized semi-supervised subspace clustering
    Liang, Renli
    Bai, Yanqin
    Lin, Hai Xiang
    [J]. OPTIMIZATION, 2020, 69 (05) : 1069 - 1096
  • [6] Spectral clustering: A semi-supervised approach
    Chen, Weifu
    Feng, Guocan
    [J]. NEUROCOMPUTING, 2012, 77 (01) : 229 - 242
  • [7] A SUPERVISORY APPROACH TO SEMI-SUPERVISED CLUSTERING
    Conroy, Bryan
    Xi, Yongxin Taylor
    Ramadge, Peter
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 1858 - 1861
  • [8] Towards an approach using metric learning for interactive semi-supervised clustering of images
    Viet Minh Vu
    Hien Phuong Lai
    Visani, Muriel
    [J]. 2016 EIGHTH INTERNATIONAL CONFERENCE ON KNOWLEDGE AND SYSTEMS ENGINEERING (KSE), 2016, : 357 - 362
  • [9] A new semi-supervised subspace clustering algorithm on fitting mixture models
    Kim, YB
    Gao, J
    [J]. PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 208 - 215
  • [10] Distance metric learning guided adaptive subspace semi-supervised clustering
    Yin, Xuesong
    Hu, Enliang
    [J]. FRONTIERS OF COMPUTER SCIENCE IN CHINA, 2011, 5 (01): : 100 - 108