Kernelized evolutionary distance metric learning for semi-supervised clustering

被引:4
|
作者
Kalintha, Wasin [1 ]
Ono, Satoshi [2 ]
Numao, Masayuki [3 ]
Fukui, Ken-ichi [3 ]
机构
[1] Osaka Univ, Grad Sch Informat Sci & Technol, 8-1 Mihogaoka, Ibaraki, Osaka 5670047, Japan
[2] Kagoshima Univ, Grad Sch Sci & Engn, Kagoshima, Japan
[3] Osaka Univ, Inst Sci & Ind Res, Osaka, Japan
关键词
Clustering; neighbor graph; cluster validity index; distance metric learning; kernelization; differential evolution; ADAPTING CONTROL PARAMETERS; DIFFERENTIAL EVOLUTION; DIMENSIONALITY REDUCTION; OPTIMIZATION;
D O I
10.3233/IDA-184283
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study proposes a novel distance metric learning method called evolutionary distance metric learning (EDML) to improve clustering quality that simultaneously evaluates inter- and intra-clusters. While we also provide an extension which integrates kernelization technique to the proposed method namely kernelized evolutionary distance metric learning (K-EDML). Hence, the non-linear transformation of distance metric can be performed while maintaining all properties of EDML. The proposed methods are able to handle either class label or pairwise constraints and directly improve any clustering index as an objective function. Both can be viewed as utilization of cluster-level soft constraints, unlike other instance-level hard constraints which sometimes collapse the clustering. Also, maintaining neighbor relation of clusters can lead to better visualization of the clustering result. For multimodality problem of the objective function, an evolutionary algorithm (EA), differential evolution with self-adapting control parameters and generalized opposition-based learning (GOjDE), is employed to optimize a metric transform matrix based on the Mahalanobis distance. We empirically demonstrate the drawback of EDML in non-linearly separable input space and illustrate the benefit of kernel function to extension K-EDML method by showing its superior result benefits to other clustering algorithms in the semi-supervised clustering on various real-world datasets.
引用
收藏
页码:1271 / 1297
页数:27
相关论文
共 50 条
  • [1] Kernelized Evolutionary Distance Metric Learning for Semi-Supervised Clustering
    Kalintha, Wasin
    Ono, Satoshi
    Numao, Masayuki
    Fukui, Ken-ichi
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4945 - 4946
  • [2] Evolutionary Distance Metric Learning Approach to Semi-Supervised Clustering with Neighbor Relations
    Fukui, Ken-ichi
    Ono, Satoshi
    Megano, Taishi
    Numao, Masayuki
    [J]. 2013 IEEE 25TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI), 2013, : 398 - 403
  • [3] Semi-supervised distributed clustering with Mahalanobis distance metric learning
    Yuecheng Y.
    Jiandong W.
    Guansheng Z.
    Bin G.
    [J]. International Journal of Digital Content Technology and its Applications, 2010, 4 (09) : 132 - 140
  • [4] Distance metric learning guided adaptive subspace semi-supervised clustering
    Yin, Xuesong
    Hu, Enliang
    [J]. FRONTIERS OF COMPUTER SCIENCE IN CHINA, 2011, 5 (01): : 100 - 108
  • [5] Semi-Supervised Distance Metric Learning for Collaborative Image Retrieval and Clustering
    Hoi, Steven C. H.
    Liu, Wei
    Chang, Shih-Fu
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2010, 6 (03)
  • [6] Distance metric learning guided adaptive subspace semi-supervised clustering
    Xuesong Yin
    Enliang Hu
    [J]. Frontiers of Computer Science in China, 2011, 5 : 100 - 108
  • [7] Semi-supervised Clustering with Deep Metric Learning
    Li, Xiaocui
    Yin, Hongzhi
    Zhou, Ke
    Chen, Hongxu
    Sadiq, Shazia
    Zhou, Xiaofang
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2019, 11448 : 383 - 386
  • [8] A semi-supervised multiview spectral clustering algorithm based on distance metric learning
    Yang J.
    Deng T.
    [J]. Sichuan Daxue Xuebao (Gongcheng Kexue Ban)/Journal of Sichuan University (Engineering Science Edition), 2016, 48 (01): : 146 - 151
  • [9] LEARNING DISTANCE METRIC FOR SEMI-SUPERVISED IMAGE SEGMENTATION
    Jia, Yangqing
    Zhang, Changshui
    [J]. 2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 3204 - 3207
  • [10] Semi-supervised hybrid clustering by integrating Gaussian mixture model and distance metric learning
    Zhang, Yihao
    Wen, Junhao
    Wang, Xibin
    Jiang, Zhuo
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2015, 45 (01) : 113 - 130