Stable Biclustering of Gene Expression Data with Nonnegative Matrix Factorizations

被引:0
|
作者
Badea, Liviu
Tilivea, Doina
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although clustering is probably the most frequently used tool for data mining gene expression data, existing clustering approaches face at least one of the following problems in this domain: a huge number of variables (genes) as compared to the number of samples, high noise levels, the inability to naturally deal with overlapping clusters, the instability of the resulting clusters w.r.t. the initialization of the algorithm as well as the difficulty in clustering genes and samples simultaneously. In this paper we show that all of these problems can be elegantly dealt with by using nonnegative matrix factorizations to cluster genes and samples simultaneously while allowing for bicluster overlaps and by employing Positive Tensor Factorization to perform a two-way meta-clustering of the biclusters produced in several different clustering runs (thereby addressing the above-mentioned instability). The application of our approach to a large lung cancer dataset proved computationally tractable and was able to recover the histological classification of the various cancer subtypes represented in the dataset.
引用
收藏
页码:2651 / 2656
页数:6
相关论文
共 50 条
  • [1] Nonnegative Decompositions with Resampling for Improving Gene Expression Data Biclustering Stability
    Badea, Liviu
    Tilivea, Doina
    [J]. ECAI 2008, PROCEEDINGS, 2008, 178 : 152 - +
  • [2] On Biclustering of Gene Expression Data
    Mukhopadhyay, Anirban
    Maulik, Ujjwal
    Bandyopadhyay, Sanghamitra
    [J]. CURRENT BIOINFORMATICS, 2010, 5 (03) : 204 - 216
  • [3] On Biclustering of Gene Expression Data
    Mounir, Mahmoud
    Hamdy, Mohamed
    [J]. 2015 IEEE SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INFORMATION SYSTEMS (ICICIS), 2015, : 641 - 648
  • [4] Biclustering On Gene Expression Data
    Shruthi, M. P.
    [J]. 2017 INTERNATIONAL CONFERENCE ON ALGORITHMS, METHODOLOGY, MODELS AND APPLICATIONS IN EMERGING TECHNOLOGIES (ICAMMAET), 2017,
  • [5] Intelligent Twitter Data Analysis Based on Nonnegative Matrix Factorizations
    Casalino, Gabriella
    Castiello, Ciro
    Del Buono, Nicoletta
    Mencar, Corrado
    [J]. COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2017, PT I, 2017, 10404 : 188 - 202
  • [6] Regularized Nonnegative Matrix Factorization for Clustering Gene Expression Data
    Liu, Weixiang
    Wang, Tianfu
    Chen, Siping
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2013,
  • [7] Bayesian biclustering of gene expression data
    Jiajun Gu
    Jun S Liu
    [J]. BMC Genomics, 9
  • [8] Biclustering in gene expression data by tendency
    Liu, JZ
    Yang, J
    Wang, W
    [J]. 2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 182 - 193
  • [9] A mixed iteration for nonnegative matrix factorizations
    Soltuz, Stefan M.
    Rhoades, B. E.
    [J]. APPLIED MATHEMATICS AND COMPUTATION, 2013, 219 (18) : 9847 - 9855
  • [10] Bayesian biclustering of gene expression data
    Gu, Jiajun
    Liu, Jun S.
    [J]. BMC GENOMICS, 2008, 9 (Suppl 1)