Nonnegative Decompositions with Resampling for Improving Gene Expression Data Biclustering Stability

被引:0
|
作者
Badea, Liviu [1 ]
Tilivea, Doina [1 ]
机构
[1] Natl Inst Res Informat, Tokyo, Tokyo, Japan
来源
ECAI 2008, PROCEEDINGS | 2008年 / 178卷
关键词
D O I
10.3233/978-1-58603-891-5-152
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The small sample sizes and high dimensionality of gene expression datasets pose significant problems for unsupervised subgroup discovery. While the stability of unidimensional clustering algorithms has been previously addressed, generalizing existing approaches to biclustering has proved extremely difficult. Despite these difficulties, developing a stable biclustering algorithm is essential for analyzing gene expression data, where genes tend to be co-expressed only for subsets of samples, in certain specific biological contexts, so that both gene and sample dimensions have to be taken into account simultaneously. In this paper, we describe an elegant approach for ensuring bicluster stability that combines three ideas. A slight modification of nonnegative matrix factorization that allows intercepts for genes has proved to be superior to other biclustering methods and is used for base-level clustering. A continuous-weight resampling method for samples is employed to generate slight perturbations of the dataset without sacrificing data and a positive tensor factorization is used to extract the biclusters that are common to the various runs. Finally, we present an application to a large colon cancer dataset for which we find 5 stable subclasses.
引用
收藏
页码:152 / +
页数:2
相关论文
共 50 条
  • [1] Stable Biclustering of Gene Expression Data with Nonnegative Matrix Factorizations
    Badea, Liviu
    Tilivea, Doina
    [J]. 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2007, : 2651 - 2656
  • [2] On Biclustering of Gene Expression Data
    Mukhopadhyay, Anirban
    Maulik, Ujjwal
    Bandyopadhyay, Sanghamitra
    [J]. CURRENT BIOINFORMATICS, 2010, 5 (03) : 204 - 216
  • [3] On Biclustering of Gene Expression Data
    Mounir, Mahmoud
    Hamdy, Mohamed
    [J]. 2015 IEEE SEVENTH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INFORMATION SYSTEMS (ICICIS), 2015, : 641 - 648
  • [4] Biclustering On Gene Expression Data
    Shruthi, M. P.
    [J]. 2017 INTERNATIONAL CONFERENCE ON ALGORITHMS, METHODOLOGY, MODELS AND APPLICATIONS IN EMERGING TECHNOLOGIES (ICAMMAET), 2017,
  • [5] Bayesian biclustering of gene expression data
    Jiajun Gu
    Jun S Liu
    [J]. BMC Genomics, 9
  • [6] Biclustering in gene expression data by tendency
    Liu, JZ
    Yang, J
    Wang, W
    [J]. 2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 182 - 193
  • [7] Bayesian biclustering of gene expression data
    Gu, Jiajun
    Liu, Jun S.
    [J]. BMC GENOMICS, 2008, 9 (Suppl 1)
  • [8] Improving an Evolutionary Multi-objective Algorithm for the Biclustering of Gene Expression Data
    Brizuela, Carlos A.
    Luna-Taylor, Jorge E.
    Martinez-Perez, Israel
    Guillen, Hugo A.
    Rodriguez, David O.
    Beltran-Verdugo, Armando
    [J]. 2013 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2013, : 221 - 228
  • [9] Biclustering of Linear Patterns In Gene Expression Data
    Gao, Qinghui
    Ho, Christine
    Jia, Yingmin
    Li, Jingyi Jessica
    Huang, Haiyan
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (06) : 619 - 631
  • [10] An evolutionary approach for biclustering of gene expression data
    Sheta, Walaa
    Hany, Maha
    Mahdi, Shereef
    [J]. INTERNATIONAL JOURNAL OF BIO-INSPIRED COMPUTATION, 2010, 2 (06) : 413 - 421