Correntropy-Based Hypergraph Regularized NMF for Clustering and Feature Selection on Multi-Cancer Integrated Data

被引:87
|
作者
Yu, Na [1 ]
Wu, Ming-Juan [1 ]
Liu, Jin-Xing [1 ]
Zheng, Chun-Hou [1 ]
Xu, Yong [2 ]
机构
[1] Qufu Normal Univ, Sch Informat Sci & Engn, Rizhao 276826, Peoples R China
[2] Harbin Inst Technol, Biocomp Res Ctr, Shenzhen 518055, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Robustness; Cancer; Linear programming; Gene expression; Manifolds; Optimization; Correntropy; clustering; feature selection; hypergraph regularization; non-negative matrix factorization (NMF); NONNEGATIVE MATRIX FACTORIZATION; EXPRESSION; STAT3; HEAD; ANGIOGENESIS; MINIMIZATION; ACTIVATION; CARCINOMA; PROTEIN; GENES;
D O I
10.1109/TCYB.2020.3000799
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Non-negative matrix factorization (NMF) has become one of the most powerful methods for clustering and feature selection. However, the performance of the traditional NMF method severely degrades when the data contain noises and outliers or the manifold structure of the data is not taken into account. In this article, a novel method called correntropy-based hypergraph regularized NMF (CHNMF) is proposed to solve the above problem. Specifically, we use the correntropy instead of the Euclidean norm in the loss term of CHNMF, which will improve the robustness of the algorithm. And the hypergraph regularization term is also applied to the objective function, which can explore the high-order geometric information in more sample points. Then, the half-quadratic (HQ) optimization technique is adopted to solve the complex optimization problem of CHNMF. Finally, extensive experimental results on multi-cancer integrated data indicate that the proposed CHNMF method is superior to other state-of-the-art methods for clustering and feature selection.
引用
收藏
页码:3952 / 3963
页数:12
相关论文
共 50 条
  • [31] Time Series Clustering: A Complex Network-Based Approach for Feature Selection in Multi-Sensor Data
    Bonacina, Fabrizio
    Miele, Eric Stefan
    Corsini, Alessandro
    MODELLING, 2020, 1 (01):
  • [32] Embedded Feature Selection on Graph-Based Multi-View Clustering
    Zhao, Wenhui
    Li, Guangfei
    Yang, Haizhou
    Gao, Quanxue
    Wang, Qianqian
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 15, 2024, : 17016 - 17023
  • [33] Band Selection Algorithm Based on Multi-Feature and Affinity Propagation Clustering
    Zhuang, Junbin
    Chen, Wenying
    Huang, Xunan
    Yan, Yunyi
    REMOTE SENSING, 2025, 17 (02)
  • [34] Query-based Multi-document Summarization using Non-negative Semantic Feature and NMF Clustering
    Park, Sun
    Cha, ByungRae
    NCM 2008: 4TH INTERNATIONAL CONFERENCE ON NETWORKED COMPUTING AND ADVANCED INFORMATION MANAGEMENT, VOL 2, PROCEEDINGS, 2008, : 609 - 614
  • [35] A fast gene selection method for multi-cancer classification using multiple support vector data description
    Cao, Jin
    Zhang, Li
    Wang, Bangjun
    Li, Fanzhang
    Yang, Jiwen
    JOURNAL OF BIOMEDICAL INFORMATICS, 2015, 53 : 381 - 389
  • [36] Credit Risk Evaluation Based on Data Mining and Integrated Feature Selection
    Deng, Yuanjie
    Wei, Ying
    Li, Yujun
    2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020), 2020,
  • [37] Graph-based unsupervised feature selection and multiview clustering for microarray data
    Swarnkar, Tripti
    Mitra, Pabitra
    JOURNAL OF BIOSCIENCES, 2015, 40 (04) : 755 - 767
  • [38] A GA-based Feature Selection for High-dimensional Data Clustering
    Sun, Mei
    Xiong, Langhuan
    Sun, Haojun
    Jiang, Dazhi
    THIRD INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING, 2009, : 769 - 772
  • [39] Clustering Mining Method of College Students' Employment Data Based on Feature Selection
    Qi, Mei-bin
    ADVANCED HYBRID INFORMATION PROCESSING, PT I, 2022, 416 : 105 - 115
  • [40] Graph-based unsupervised feature selection and multiview clustering for microarray data
    Tripti Swarnkar
    Pabitra Mitra
    Journal of Biosciences, 2015, 40 : 755 - 767