HTRPCA: Hypergraph Regularized Tensor Robust Principal Component Analysis for Sample Clustering in Tumor Omics Data

被引:7
|
作者
Zhao, Yu-Ying [1 ]
Jiao, Cui-Na [1 ]
Wang, Mao-Li [1 ]
Liu, Jin-Xing [1 ,2 ]
Wang, Juan [1 ]
Zheng, Chun-Hou [1 ]
机构
[1] Qufu Normal Univ, Sch Comp Sci, Rizhao, Peoples R China
[2] Rizhao Huilian Zhongchuang Inst Intelligent Techn, Rizhao 276826, Peoples R China
基金
中国国家自然科学基金;
关键词
Low-rank tensor; Hypergraph; Sample clustering; Tensor robust principal component analysis; FACTORIZATION;
D O I
10.1007/s12539-021-00441-8
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In recent years, clustering analysis of cancer genomics data has gained widespread attention. However, limited by the dimensions of the matrix, the traditional methods cannot fully mine the underlying geometric structure information in the data. Besides, noise and outliers inevitably exist in the data. To solve the above two problems, we come up with a new method which uses tensor to represent cancer omics data and applies hypergraph to save the geometric structure information in original data. This model is called hypergraph regularized tensor robust principal component analysis (HTRPCA). The data processed by HTRPCA becomes two parts, one of which is a low-rank component that contains pure underlying structure information between samples, and the other is some sparse interference points. So we can use the low-rank component for clustering. This model can retain complex geometric information between more sample points due to the addition of the hypergraph regularization. Through clustering, we can demonstrate the effectiveness of HTRPCA, and the experimental results on TCGA datasets demonstrate that HTRPCA precedes other advanced methods. [GRAPHICS]
引用
收藏
页码:22 / 33
页数:12
相关论文
共 50 条
  • [21] Approximate Bayesian Algorithm for Tensor Robust Principal Component Analysis
    Srakar, Andrej
    [J]. NEW FRONTIERS IN BAYESIAN STATISTICS, BAYSM 2021, 2022, 405 : 1 - 9
  • [22] Latent graph-regularized inductive robust principal component analysis
    Wei, Lai
    Zhou, Rigui
    Yin, Jun
    Zhu, Changming
    Zhang, Xiafen
    Liu, Hao
    [J]. KNOWLEDGE-BASED SYSTEMS, 2019, 177 : 68 - 81
  • [23] Robust principal component analysis for functional data
    N. Locantore
    J. S. Marron
    D. G. Simpson
    N. Tripoli
    J. T. Zhang
    K. L. Cohen
    Graciela Boente
    Ricardo Fraiman
    Babette Brumback
    Christophe Croux
    Jianqing Fan
    Alois Kneip
    John I. Marden
    Daniel Peña
    Javier Prieto
    Jim O. Ramsay
    Mariano J. Valderrama
    Ana M. Aguilera
    N. Locantore
    J. S. Marron
    D. G. Simpson
    N. Tripoli
    J. T. Zhang
    K. L. Cohen
    [J]. Test, 1999, 8 (1) : 1 - 73
  • [24] Robust principal component analysis for functional data
    Peña, D
    Prieto, J
    [J]. TEST, 1999, 8 (01) : 56 - 60
  • [25] Integrative and regularized principal component analysis of multiple sources of data
    Liu, Binghui
    Shen, Xiaotong
    Pan, Wei
    [J]. STATISTICS IN MEDICINE, 2016, 35 (13) : 2235 - 2250
  • [26] Recovery of Corrupted Data in Wireless Sensor Networks Using Tensor Robust Principal Component Analysis
    Zhang, Xiaoyue
    He, Jingfei
    Li, Yunpei
    Chi, Yue
    Zhou, Yatong
    [J]. IEEE COMMUNICATIONS LETTERS, 2021, 25 (10) : 3389 - 3393
  • [27] Tensor Robust Principal Component Analysis via Tensor Fibered Rank and lp Minimization
    Gao, Kaixin
    Huang, Zheng-Hai
    [J]. SIAM JOURNAL ON IMAGING SCIENCES, 2023, 16 (01): : 423 - 460
  • [28] Robust hypergraph regularized non-negative matrix factorization for sample clustering and feature selection in multi-view gene expression data
    Yu, Na
    Gao, Ying-Lian
    Liu, Jin-Xing
    Wang, Juan
    Shang, Junliang
    [J]. HUMAN GENOMICS, 2019, 13 (Suppl 1) : 46
  • [29] Robust hypergraph regularized non-negative matrix factorization for sample clustering and feature selection in multi-view gene expression data
    Na Yu
    Ying-Lian Gao
    Jin-Xing Liu
    Juan Wang
    Junliang Shang
    [J]. Human Genomics, 13
  • [30] A random version of principal component analysis in data clustering
    Palese, Luigi Leonardo
    [J]. COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2018, 73 : 57 - 64