Finding Correlated Biclusters from Gene Expression Data

被引:33
|
作者
Yang, Wen-Hui [1 ,2 ]
Dai, Dao-Qing [1 ,2 ]
Yan, Hong [3 ,4 ]
机构
[1] Sun Yat Sen Zhongshan Univ, Ctr Comp Vis, Guangzhou 510275, Guangdong, Peoples R China
[2] Sun Yat Sen Zhongshan Univ, Dept Math, Fac Math & Comp, Guangzhou 510275, Guangdong, Peoples R China
[3] City Univ Hong Kong, Dept Elect Engn, Kowloon, Hong Kong, Peoples R China
[4] Univ Sydney, Sch Elect & Informat Engn, Sydney, NSW 2006, Australia
关键词
Biclustering; pattern classification; gene expression data; singular-value decomposition; data mining; biology computing; SINGULAR-VALUE DECOMPOSITION; MICROARRAY DATA; DISCRIMINANT-ANALYSIS; CLUSTER-ANALYSIS; PATTERNS; MODELS;
D O I
10.1109/TKDE.2010.150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Extracting biologically relevant information from DNA microarrays is a very important task for drug development and test, function annotation, and cancer diagnosis. Various clustering methods have been proposed for the analysis of gene expression data, but when analyzing the large and heterogeneous collections of gene expression data, conventional clustering algorithms often cannot produce a satisfactory solution. Biclustering algorithm has been presented as an alternative approach to standard clustering techniques to identify local structures from gene expression data set. These patterns may provide clues about the main biological processes associated with different physiological states. In this paper, different from existing bicluster patterns, we first introduce a more general pattern: correlated bicluster, which has intuitive biological interpretation. Then, we propose a novel transform technique based on singular value decomposition so that identifying correlated-bicluster problem from gene expression matrix is transformed into two global clustering problems. The Mixed-Clustering algorithm and the Lift algorithm are devised to efficiently produce delta-corBiclusters. The biclusters obtained using our method from gene expression data sets of multiple human organs and the yeast Saccharomyces cerevisiae demonstrate clear biological meanings.
引用
收藏
页码:568 / 584
页数:17
相关论文
共 50 条
  • [1] Finding k-Biclusters from Gene Expression Data
    Xu, Xiaohua
    He, Ping
    Lu, Lin
    Xi, Yanqiu
    Pan, Zhoujin
    INTELLIGENT COMPUTING THEORIES AND APPLICATIONS, ICIC 2012, 2012, 7390 : 433 - 439
  • [2] Finding distinct biclusters from background in gene expression matrices
    Wu, Zhengpeng
    Ao, Jiangni
    Zhang, Xuegong
    BIOINFORMATION, 2007, 2 (05) : 207 - 215
  • [3] Extraction of Optimal Biclusters from Gene Expression Data
    Bagyamani, J.
    Thangavel, K.
    Rathipriya, R.
    INFORMATION AND COMMUNICATION TECHNOLOGIES, 2010, 101 : 380 - +
  • [4] Discovering significant biclusters in gene expression data
    Zalik, Krista Rizman
    WSEAS Transactions on Information Science and Applications, 2005, 2 (09): : 1454 - 1461
  • [5] Statistical identification of biclusters in gene expression data
    Chakraborty, A
    Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3, 2005, : 1185 - 1190
  • [6] Mining deterministic biclusters in gene expression data
    Zhang, ZH
    Teo, A
    Ooi, BC
    Tan, KL
    BIBE 2004: FOURTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2004, : 283 - 290
  • [7] Discovering maximal size coherent biclusters from gene expression data
    Bagyamani, J.
    Thangavel, K.
    INTERNATIONAL JOURNAL OF HEALTHCARE TECHNOLOGY AND MANAGEMENT, 2011, 12 (5-6) : 405 - 421
  • [8] Constructing gene network based on biclusters of expression data
    Liu, F.
    Yang, L.
    Tian, Z. Z.
    Wu, P.
    Sun, S. L.
    GENETICS AND MOLECULAR RESEARCH, 2016, 15 (02)
  • [9] Exhaustive search of maximal biclusters in gene expression data
    Okada, Yoshifumi
    Fujibuchi, Wataru
    Horton, Paul
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 307 - +
  • [10] Discovery of error-tolerant biclusters from noisy gene expression data
    Rohit Gupta
    Navneet Rao
    Vipin Kumar
    BMC Bioinformatics, 12