Nonnegative Decompositions with Resampling for Improving Gene Expression Data Biclustering Stability

被引:0
|
作者
Badea, Liviu [1 ]
Tilivea, Doina [1 ]
机构
[1] Natl Inst Res Informat, Tokyo, Tokyo, Japan
来源
ECAI 2008, PROCEEDINGS | 2008年 / 178卷
关键词
D O I
10.3233/978-1-58603-891-5-152
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The small sample sizes and high dimensionality of gene expression datasets pose significant problems for unsupervised subgroup discovery. While the stability of unidimensional clustering algorithms has been previously addressed, generalizing existing approaches to biclustering has proved extremely difficult. Despite these difficulties, developing a stable biclustering algorithm is essential for analyzing gene expression data, where genes tend to be co-expressed only for subsets of samples, in certain specific biological contexts, so that both gene and sample dimensions have to be taken into account simultaneously. In this paper, we describe an elegant approach for ensuring bicluster stability that combines three ideas. A slight modification of nonnegative matrix factorization that allows intercepts for genes has proved to be superior to other biclustering methods and is used for base-level clustering. A continuous-weight resampling method for samples is employed to generate slight perturbations of the dataset without sacrificing data and a positive tensor factorization is used to extract the biclusters that are common to the various runs. Finally, we present an application to a large colon cancer dataset for which we find 5 stable subclasses.
引用
收藏
页码:152 / +
页数:2
相关论文
共 50 条
  • [21] An Efficient Weighted Biclustering Algorithm for Gene Expression Data
    Jia, Yankun
    Li, Yidong
    Liu, Wenhua
    Dong, Hairong
    [J]. 2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2016, : 336 - 341
  • [22] A comparative analysis of biclustering algorithms for gene expression data
    Eren, Kemal
    Deveci, Mehmet
    Kucuktunc, Onur
    Catalyurek, Umit V.
    [J]. BRIEFINGS IN BIOINFORMATICS, 2013, 14 (03) : 279 - 292
  • [23] Quick hierarchical biclustering on microarray gene expression data
    Ji, Liping
    Mock, Kenneth Wei-Liang
    Tan, Kian-Lee
    [J]. BIBE 2006: SIXTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2006, : 110 - +
  • [24] A Study of Biclustering Coherence Measures for Gene Expression Data
    Padilha, Victor A.
    de Carvalho, Andre C. P. L. F.
    [J]. 2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 546 - 551
  • [25] Using the bagging approach for biclustering of gene expression data
    Hanczar, B.
    Nadif, M.
    [J]. NEUROCOMPUTING, 2011, 74 (10) : 1595 - 1605
  • [26] Application of simulated annealing to the biclustering of gene expression data
    Bryan, Kenneth
    Cunningham, Padraig
    Bolshakova, Nadia
    [J]. IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2006, 10 (03): : 519 - 525
  • [27] Biclustering gene expression data by an improved optimal algorithm
    Wang, MingQian
    Tian, Wei
    Kang, Hao
    Gao, WenJu
    [J]. MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 2223 - 2226
  • [28] Biclustering of gene expression data using genetic algorithm
    Chakraborty, A
    Maka, H
    [J]. PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 17 - 24
  • [29] Evaluation of Plaid Models in Biclustering of Gene Expression Data
    Majd, Hamid Alavi
    Shahsavari, Soodeh
    Baghestani, Ahmad Reza
    Tabatabaei, Seyyed Mohammad
    Bashi, Naghme Khadem
    Tavirani, Mostafa Rezaei
    Hamidpour, Mohsen
    [J]. SCIENTIFICA, 2016, 2016
  • [30] Randomized Algorithmic Approach for Biclustering of Gene Expression Data
    Nayak, Sradhanjali
    Mishra, Debahuti
    Das, Satyabrata
    Rath, Amiya Kumar
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2010, 1 (06) : 80 - 86