Biclustering by sparse canonical correlation analysis

被引:5
|
作者
Pimentel, Harold [1 ]
Hu, Zhiyue [2 ]
Huang, Haiyan [2 ]
机构
[1] Univ Calif Berkeley, Dept Comp Sci, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
biclustering; SCCA; gene clusters;
D O I
10.1007/s40484-017-0127-0
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
BackgroundDeveloping appropriate computational tools to distill biological insights from large-scale gene expression data has been an important part of systems biology. Considering that gene relationships may change or only exist in a subset of collected samples, biclustering that involves clustering both genes and samples has become increasingly important, especially when the samples are pooled from a wide range of experimental conditions.MethodsIn this paper, we introduce a new biclustering algorithm to find subsets of genomic expression features (EFs) (e.g., genes, isoforms, exon inclusion) that show strong "group interactions" under certain subsets of samples. Group interactions are defined by strong partial correlations, or equivalently, conditional dependencies between EFs after removing the influences of a set of other functionally related EFs. Our new biclustering method, named SCCA-BC, extends an existing method for group interaction inference, which is based on sparse canonical correlation analysis (SCCA) coupled with repeated random partitioning of the gene expression data set.ResultsSCCA-BC gives sensible results on real data sets and outperforms most existing methods in simulations. Software is available at https://github.com/pimentel/scca-bc.ConclusionsSCCA-BC seems to work in numerous conditions and the results seem promising for future extensions. SCCA-BC has the ability to find different types of bicluster patterns, and it is especially advantageous in identifying a bicluster whose elements share the same progressive and multivariate normal distribution with a dense covariance matrix.
引用
收藏
页码:56 / 67
页数:12
相关论文
共 50 条
  • [21] Distributed Sparse Canonical Correlation Analysis in Clustering Sensor Data
    Chen, Jia
    Schizas, Ioannis D.
    2013 ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2013, : 639 - 643
  • [22] Extensions of Sparse Canonical Correlation Analysis with Applications to Genomic Data
    Witten, Daniela M.
    Tibshirani, Robert J.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2009, 8 (01)
  • [23] Image retrieval approach based on sparse canonical correlation analysis
    Zhuang, Ling
    Zhuang, Yue-Ting
    Wu, Jiang-Qin
    Ye, Zhen-Chao
    Wu, Fei
    Ruan Jian Xue Bao/Journal of Software, 2012, 23 (05): : 1295 - 1304
  • [24] Resistant multiple sparse canonical correlation
    Coleman, Jacob
    Replogle, Joseph
    Chandler, Gabriel
    Hardin, Johanna
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2016, 15 (02) : 123 - 138
  • [25] Canonical correlation analysis based on local sparse representation and linear discriminative analysis
    Xia, J.-M. (jianmingeei@163.com), 1600, Northeast University (29):
  • [26] Spatial Correlation Analysis Using Canonical Correlation Decomposition for Sparse Sonar Array Processing
    Zhao, Yinghui
    Azimi-Sadjadi, Mahmood R.
    Wachowski, Neil
    Klausner, Nick
    2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2739 - 2744
  • [27] Sparse Canonical Correlation Analysis Applied to fMRI and Genetic Data Fusion
    Boutte, David
    Liu, Jingyu
    2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 422 - 426
  • [28] A fault detection method based on sparse dynamic canonical correlation analysis
    Hu, Xuguang
    Wu, Ping
    Pan, Haipeng
    He, Yuchen
    CANADIAN JOURNAL OF CHEMICAL ENGINEERING, 2024, 102 (03): : 1188 - 1202
  • [29] An iterative penalized least squares approach to sparse canonical correlation analysis
    Mai, Qing
    Zhang, Xin
    BIOMETRICS, 2019, 75 (03) : 734 - 744
  • [30] The group sparse canonical correlation analysis method in the imaging genetics research
    Wu, Jie
    Xu, Jiawei
    Chen, Wei
    Sun, Deyan
    2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2554 - 2557