Kernel conditional clustering and kernel conditional semi-supervised learning

被引:2
|
作者
He, Xiao [1 ,2 ]
Gumbsch, Thomas [1 ,2 ]
Roqueiro, Damian [1 ,2 ]
Borgwardt, Karsten [1 ,2 ]
机构
[1] Swiss Fed Inst Technol, Dept Biosyst Sci & Engn, Basel, Switzerland
[2] Swiss Inst Bioinformat, Basel, Switzerland
关键词
Conditional clustering; Conditional semi-supervised learning; Conditional dependence measure; Alternative clustering; Label propagation;
D O I
10.1007/s10115-019-01334-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The results of clustering are often affected by covariates that are independent of the clusters one would like to discover. Traditionally, alternative clustering algorithms can be used to solve such clustering problems. However, these suffer from at least one of the following problems: (1) Continuous covariates or nonlinearly separable clusters cannot be handled; (2) assumptions are made about the distribution of the data; (3) one or more hyper-parameters need to be set. The presence of covariates also has an effect in a different type of problem such as semi-supervised learning. To the best of our knowledge, there is no existing method addressing the semi-supervised learning setting in the presence of covariates. Here we propose two novel algorithms, named kernel conditional clustering (KCC) and kernel conditional semi-supervised learning (KCSSL), whose objectives are derived from a kernel-based conditional dependence measure. KCC is parameter-light and makes no assumptions about the cluster structure, the covariates, or the distribution of the data, while KCSSL is fully parameter-free. On both simulated and real-world datasets, the proposed KCC and KCSSL algorithms perform better than state-of-the-art methods. The former detects the ground truth cluster structures more accurately, and the latter makes more accurate predictions.
引用
收藏
页码:899 / 925
页数:27
相关论文
共 50 条
  • [1] Kernel conditional clustering and kernel conditional semi-supervised learning
    Xiao He
    Thomas Gumbsch
    Damian Roqueiro
    Karsten Borgwardt
    [J]. Knowledge and Information Systems, 2020, 62 : 899 - 925
  • [2] Scalable semi-supervised clustering by spectral kernel learning
    Baghshah, M. Soleymani
    Afsari, F.
    Shouraki, S. Bagheri
    Eslami, E.
    [J]. PATTERN RECOGNITION LETTERS, 2014, 45 : 161 - 171
  • [3] Semi-supervised clustering with metric learning: An adaptive kernel method
    Yin, Xuesong
    Chen, Songcan
    Hu, Enliang
    Zhang, Daoqiang
    [J]. PATTERN RECOGNITION, 2010, 43 (04) : 1320 - 1333
  • [4] A Novel Multiple Kernel Learning Approach for Semi-Supervised Clustering
    Zare, T.
    Sadeghi, M. T.
    Abutalebi, H. R.
    [J]. 2013 8TH IRANIAN CONFERENCE ON MACHINE VISION & IMAGE PROCESSING (MVIP 2013), 2013, : 451 - 456
  • [5] Kernel-based metric learning for semi-supervised clustering
    Baghshah, Mahdieh Soleymani
    Shouraki, Saeed Bagheri
    [J]. NEUROCOMPUTING, 2010, 73 (7-9) : 1352 - 1361
  • [6] Kernel Conditional Clustering
    He, Xiao
    Gumbsch, Thomas
    Roqueiro, Damian
    Borgwardt, Karsten
    [J]. 2017 17TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2017, : 157 - 166
  • [7] Quantum semi-supervised kernel learning
    Seyran Saeedi
    Aliakbar Panahi
    Tom Arodz
    [J]. Quantum Machine Intelligence, 2021, 3
  • [8] Semi-Supervised Kernel Mean Shift Clustering
    Anand, Saket
    Mittal, Sushil
    Tuzel, Oncel
    Meer, Peter
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2014, 36 (06) : 1201 - 1215
  • [9] Quantum semi-supervised kernel learning
    Saeedi, Seyran
    Panahi, Aliakbar
    Arodz, Tom
    [J]. QUANTUM MACHINE INTELLIGENCE, 2021, 3 (02)
  • [10] Semi-supervised graph clustering: a kernel approach
    Kulis, Brian
    Basu, Sugato
    Dhillon, Inderjit
    Mooney, Raymond
    [J]. MACHINE LEARNING, 2009, 74 (01) : 1 - 22