Finding Correlated Biclusters from Gene Expression Data

被引:33
|
作者
Yang, Wen-Hui [1 ,2 ]
Dai, Dao-Qing [1 ,2 ]
Yan, Hong [3 ,4 ]
机构
[1] Sun Yat Sen Zhongshan Univ, Ctr Comp Vis, Guangzhou 510275, Guangdong, Peoples R China
[2] Sun Yat Sen Zhongshan Univ, Dept Math, Fac Math & Comp, Guangzhou 510275, Guangdong, Peoples R China
[3] City Univ Hong Kong, Dept Elect Engn, Kowloon, Hong Kong, Peoples R China
[4] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia
关键词
Biclustering; pattern classification; gene expression data; singular-value decomposition; data mining; biology computing; SINGULAR-VALUE DECOMPOSITION; MICROARRAY DATA; DISCRIMINANT-ANALYSIS; CLUSTER-ANALYSIS; PATTERNS; MODELS;
D O I
10.1109/TKDE.2010.150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting biologically relevant information from DNA microarrays is a very important task for drug development and test, function annotation, and cancer diagnosis. Various clustering methods have been proposed for the analysis of gene expression data, but when analyzing the large and heterogeneous collections of gene expression data, conventional clustering algorithms often cannot produce a satisfactory solution. Biclustering algorithm has been presented as an alternative approach to standard clustering techniques to identify local structures from gene expression data set. These patterns may provide clues about the main biological processes associated with different physiological states. In this paper, different from existing bicluster patterns, we first introduce a more general pattern: correlated bicluster, which has intuitive biological interpretation. Then, we propose a novel transform technique based on singular value decomposition so that identifying correlated-bicluster problem from gene expression matrix is transformed into two global clustering problems. The Mixed-Clustering algorithm and the Lift algorithm are devised to efficiently produce delta-corBiclusters. The biclusters obtained using our method from gene expression data sets of multiple human organs and the yeast Saccharomyces cerevisiae demonstrate clear biological meanings.
引用
收藏
页码:568 / 584
页数:17
相关论文
共 50 条
  • [31] A new FCA-based method for identifying biclusters in gene expression data
    Amina Houari
    Wassim Ayadi
    Sadok Ben Yahia
    International Journal of Machine Learning and Cybernetics, 2018, 9 : 1879 - 1893
  • [32] Finding groups in gene expression data
    Hand, DJ
    Heard, NA
    JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2005, (02): : 215 - 225
  • [33] Discovering biclusters in gene expression data based on high-dimensional linear geometries
    Gan, Xiangchao
    Liew, Alan Wee-Chung
    Yan, Hong
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [34] A comparison and evaluation of five biclustering algorithms by quantifying goodness of biclusters for gene expression data
    Li, Li
    Guo, Yang
    Wu, Wenwu
    Shi, Youyi
    Cheng, Jian
    Tao, Shiheng
    BIODATA MINING, 2012, 5
  • [35] Discovering Low Overlapping Biclusters in Gene Expression Data Through Generic Association Rules
    Houari, Amina
    Ayadi, Wassim
    Ben Yahia, Sadok
    MODEL AND DATA ENGINEERING, MEDI 2015, 2015, 9344 : 139 - 153
  • [36] Discovering coherent biclusters from gene expression data using zero-suppressed binary decision diagrams
    Yoon, S
    Nardini, C
    Benini, L
    De Micheli, G
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2005, 2 (04) : 339 - 354
  • [37] Finding biclusters by random projections
    Lonardi, S
    Szpankowski, W
    Yang, QF
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2004, 3109 : 102 - 116
  • [38] Automatic Generation of Biclusters from Gene Expression Data Using Multi-objective Simulated Annealing Approach
    Sahoo, Pracheta
    Acharya, Sudipta
    Saha, Sriparna
    2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 2174 - 2179
  • [39] A comparison and evaluation of five biclustering algorithms by quantifying goodness of biclusters for gene expression data
    Li Li
    Yang Guo
    Wenwu Wu
    Youyi Shi
    Jian Cheng
    Shiheng Tao
    BioData Mining, 5
  • [40] Finding biclusters by random projections
    Lonardi, Stefano
    Szpankowski, Wojciech
    Yang, Qiaofeng
    THEORETICAL COMPUTER SCIENCE, 2006, 368 (03) : 217 - 230