Biclustering by sparse canonical correlation analysis

被引:5
|
作者
Pimentel, Harold [1 ]
Hu, Zhiyue [2 ]
Huang, Haiyan [2 ]
机构
[1] Univ Calif Berkeley, Dept Comp Sci, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
biclustering; SCCA; gene clusters;
D O I
10.1007/s40484-017-0127-0
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
BackgroundDeveloping appropriate computational tools to distill biological insights from large-scale gene expression data has been an important part of systems biology. Considering that gene relationships may change or only exist in a subset of collected samples, biclustering that involves clustering both genes and samples has become increasingly important, especially when the samples are pooled from a wide range of experimental conditions.MethodsIn this paper, we introduce a new biclustering algorithm to find subsets of genomic expression features (EFs) (e.g., genes, isoforms, exon inclusion) that show strong "group interactions" under certain subsets of samples. Group interactions are defined by strong partial correlations, or equivalently, conditional dependencies between EFs after removing the influences of a set of other functionally related EFs. Our new biclustering method, named SCCA-BC, extends an existing method for group interaction inference, which is based on sparse canonical correlation analysis (SCCA) coupled with repeated random partitioning of the gene expression data set.ResultsSCCA-BC gives sensible results on real data sets and outperforms most existing methods in simulations. Software is available at https://github.com/pimentel/scca-bc.ConclusionsSCCA-BC seems to work in numerous conditions and the results seem promising for future extensions. SCCA-BC has the ability to find different types of bicluster patterns, and it is especially advantageous in identifying a bicluster whose elements share the same progressive and multivariate normal distribution with a dense covariance matrix.
引用
下载
收藏
页码:56 / 67
页数:12
相关论文
共 50 条
  • [1] Biclustering by sparse canonical correlation analysis
    Harold Pimentel
    Zhiyue Hu
    Haiyan Huang
    Quantitative Biology, 2018, 6 (01) : 56 - 67
  • [2] Sparse canonical correlation analysis
    David R. Hardoon
    John Shawe-Taylor
    Machine Learning, 2011, 83 : 331 - 353
  • [3] Sparse canonical correlation analysis
    Hardoon, David R.
    Shawe-Taylor, John
    MACHINE LEARNING, 2011, 83 (03) : 331 - 353
  • [4] Robust sparse canonical correlation analysis
    Wilms, Ines
    Croux, Christophe
    BMC SYSTEMS BIOLOGY, 2016, 10
  • [5] Sparse Weighted Canonical Correlation Analysis
    MIN Wenwen
    LIU Juan
    ZHANG Shihua
    Chinese Journal of Electronics, 2018, 27 (03) : 459 - 466
  • [6] Sparse Weighted Canonical Correlation Analysis
    Min Wenwen
    Liu Juan
    Zhang Shihua
    CHINESE JOURNAL OF ELECTRONICS, 2018, 27 (03) : 459 - 466
  • [7] Sparse Generalized Canonical Correlation Analysis (DSGCCA)
    Guo, Chenfeng
    Wu, Dongrui
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 1959 - 1964
  • [8] MINIMAX ESTIMATION IN SPARSE CANONICAL CORRELATION ANALYSIS
    Gao, Chao
    Ma, Zongming
    Ren, Zhao
    Zhou, Harrison H.
    ANNALS OF STATISTICS, 2015, 43 (05): : 2168 - 2197
  • [9] Comparison of penalty functions for sparse canonical correlation analysis
    Chalise, Prabhakar
    Fridley, Brooke L.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (02) : 245 - 254
  • [10] A Mathematical Programming Approach to Sparse Canonical Correlation Analysis
    Amorosi, Lavinia
    Padellini, Tullia
    Puerto, Justo
    Valverde, Carlos
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 237