Nonnegative Decompositions with Resampling for Improving Gene Expression Data Biclustering Stability

被引:0
|
作者
Badea, Liviu [1 ]
Tilivea, Doina [1 ]
机构
[1] Natl Inst Res Informat, Tokyo, Tokyo, Japan
来源
ECAI 2008, PROCEEDINGS | 2008年 / 178卷
关键词
D O I
10.3233/978-1-58603-891-5-152
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The small sample sizes and high dimensionality of gene expression datasets pose significant problems for unsupervised subgroup discovery. While the stability of unidimensional clustering algorithms has been previously addressed, generalizing existing approaches to biclustering has proved extremely difficult. Despite these difficulties, developing a stable biclustering algorithm is essential for analyzing gene expression data, where genes tend to be co-expressed only for subsets of samples, in certain specific biological contexts, so that both gene and sample dimensions have to be taken into account simultaneously. In this paper, we describe an elegant approach for ensuring bicluster stability that combines three ideas. A slight modification of nonnegative matrix factorization that allows intercepts for genes has proved to be superior to other biclustering methods and is used for base-level clustering. A continuous-weight resampling method for samples is employed to generate slight perturbations of the dataset without sacrificing data and a positive tensor factorization is used to extract the biclusters that are common to the various runs. Finally, we present an application to a large colon cancer dataset for which we find 5 stable subclasses.
引用
收藏
页码:152 / +
页数:2
相关论文
共 50 条
  • [31] Biclustering of gene expression data based on hybrid genetic algorithm
    Bagyamani, J.
    Thangavel, K.
    Rathipriya, R.
    [J]. INTERNATIONAL JOURNAL OF DATA MINING MODELLING AND MANAGEMENT, 2013, 5 (04) : 333 - 350
  • [32] MIB: Using mutual information for biclustering gene expression data
    Gupta, Neelima
    Aggarwal, Seema
    [J]. PATTERN RECOGNITION, 2010, 43 (08) : 2692 - 2697
  • [33] Biclustering of high-throughput gene expression data with BiclusterMiner
    Kiraly, Andras
    Abonyi, Janos
    Laiho, Asta
    Gyenesei, Attila
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 131 - 138
  • [34] Cuckoo Search with Mutation for Biclustering of Microarray Gene Expression Data
    Rengeswaran, Balamurugan
    Mathaiyan, Natarajan
    Kandasamy, Premalatha
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2017, 14 (03) : 300 - 306
  • [35] Biclustering gene expression data by random projection based on bucketing
    Liu, Juan
    Liu, Feng
    [J]. 2008 INTERNATIONAL SPECIAL TOPIC CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS IN BIOMEDICINE, VOLS 1 AND 2, 2008, : 322 - 325
  • [36] A systematic comparison and evaluation of biclustering methods for gene expression data
    Prelic, A
    Bleuler, S
    Zimmermann, P
    Wille, A
    Bühlmann, P
    Gruissem, W
    Hennig, L
    Thiele, L
    Zitzler, E
    [J]. BIOINFORMATICS, 2006, 22 (09) : 1122 - 1129
  • [37] Biclustering Gene Expression Data using MSR Difference Threshold
    Das, Shyama
    Idicula, Sumam Mary
    [J]. 2009 ANNUAL IEEE INDIA CONFERENCE (INDICON 2009), 2009, : 430 - +
  • [38] Gibbs Sampling Based Bayesian Biclustering of Gene Expression Data
    Chen, Daoyuan
    Liu, Qinyi
    Meng, Jia
    Su, Jionglong
    [J]. 2020 13TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2020), 2020, : 790 - 795
  • [39] QUBIC: a qualitative biclustering algorithm for analyses of gene expression data
    Li, Guojun
    Ma, Qin
    Tang, Haibao
    Paterson, Andrew H.
    Xu, Ying
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 (15)
  • [40] Biclustering Analysis on Class Discovery From Gene Expression Data
    Anitha, S.
    Chandran, C. P.
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATIONS TECHNOLOGIES (ICCCT 15), 2015, : 55 - 60