Projected Affinity Values for Nystrom Spectral Clustering

被引:1
|
作者
He, Li [1 ]
Zhu, Haifei [1 ]
Zhang, Tao [1 ]
Yang, Honghong [2 ,3 ]
Guan, Yisheng [1 ]
机构
[1] Guangdong Univ Technol, Dept Electromech Engn, Guangzhou 510006, Guangdong, Peoples R China
[2] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2R3, Canada
[3] Northwestern Polytech Univ, Sch Automat, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Nystrom approximation; out-of-sample; empirical affinity; machine learning;
D O I
10.3390/e20070519
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
In kernel methods, Nystrom approximation is a popular way of calculating out-of-sample extensions and can be further applied to large-scale data clustering and classification tasks. Given a new data point, Nystrom employs its empirical affinity vector, k, for calculation. This vector is assumed to be a proper measurement of the similarity between the new point and the training set. In this paper, we suggest replacing the affinity vector by its projections on the leading eigenvectors learned from the training set, i.e., using k* = Sigma(c)(i=1)k(T)u(i)u(i) instead, where u(i) is the i-th eigenvector of the training set and c is the number of eigenvectors used, which is typically equal to the number of classes designed by users. Our work is motivated by the constraints that in kernel space, the kernel-mapped new point should (a) also lie on the unit sphere defined by the Gaussian kernel and (b) generate training set affinity values close to k. These two constraints define a Quadratic Optimization Over a Sphere (QOOS) problem. In this paper, we prove that the projection on the leading eigenvectors, rather than the original affinity vector, is the solution to the QOOS problem. The experimental results show that the proposed replacement of k by k* slightly improves the performance of the Nystrom approximation. Compared with other affinity matrix modification methods, our k* obtains comparable or higher clustering performance in terms of accuracy and Normalized Mutual Information (NMI).
引用
收藏
页数:17
相关论文
共 50 条
  • [41] A NYSTROM-BASED SUBTRACTIVE CLUSTERING METHOD
    Sun, Zhi-Hai
    Wei, Xue-Hui
    Zhou, Wen-Hui
    [J]. 2012 INTERNATIONAL CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (LCWAMTIP), 2012, : 74 - 77
  • [42] Dynamic Affinity Graph Construction for Spectral Clustering Using Multiple Features
    Li, Zhihui
    Nie, Feiping
    Chang, Xiaojun
    Yang, Yi
    Zhang, Chengqi
    Sebe, Nicu
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (12) : 6323 - 6332
  • [43] Research on Spectral Clustering Algorithms Based on Building Different Affinity Matrix
    Xu Degang
    Zhao Panlei
    Gui Weihua
    Yang Chunhua
    Xie Yongfang
    [J]. 2013 25TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2013, : 3160 - 3165
  • [44] Affinity Learning Via Self-Supervised Diffusion for Spectral Clustering
    Ye, Jianfeng
    Li, Qilin
    Yu, Jinlong
    Wang, Xincheng
    Wang, Huaming
    [J]. IEEE ACCESS, 2021, 9 : 7170 - 7182
  • [45] Automatic Scale Parameters in Affinity Matrix Construction for Improved Spectral Clustering
    Mohanavalli, S.
    Jaisakthi, S. M.
    Aravindan, Chandrabose
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2016, 30 (10)
  • [46] Nystrom-based spectral clustering using airborne LiDAR point cloud data for individual tree segmentation
    Pang, Yong
    Wang, Weiwei
    Du, Liming
    Zhang, Zhongjun
    Liang, Xiaojun
    Li, Yongning
    Wang, Zuyuan
    [J]. INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2021, 14 (10) : 1452 - 1476
  • [47] An improved Nystrom spectral graph clustering using k-core decomposition as a sampling strategy for large networks
    Tu, Jingzhi
    Mei, Gang
    Piccialli, Francesco
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (06) : 3673 - 3684
  • [48] Projected Gustafson Kessel Clustering
    Kumar, Naveen
    Puri, Charu
    [J]. ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, PROCEEDINGS, 2009, 5908 : 431 - 438
  • [49] Projected clustering for categorical datasets
    Kim, Minho
    Ramakrishna, R. S.
    [J]. PATTERN RECOGNITION LETTERS, 2006, 27 (12) : 1405 - 1417
  • [50] Fast algorithms for projected clustering
    Aggarwal, CC
    Procopiuc, C
    Wolf, JL
    Yu, PS
    Park, JS
    [J]. SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999: SIGMOD99: PROCEEDINGS OF THE 1999 ACM SIGMOD - INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 1999, : 61 - 72