Stable Biclustering of Gene Expression Data with Nonnegative Matrix Factorizations

被引:0
|
作者
Badea, Liviu
Tilivea, Doina
机构
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Although clustering is probably the most frequently used tool for data mining gene expression data, existing clustering approaches face at least one of the following problems in this domain: a huge number of variables (genes) as compared to the number of samples, high noise levels, the inability to naturally deal with overlapping clusters, the instability of the resulting clusters w.r.t. the initialization of the algorithm as well as the difficulty in clustering genes and samples simultaneously. In this paper we show that all of these problems can be elegantly dealt with by using nonnegative matrix factorizations to cluster genes and samples simultaneously while allowing for bicluster overlaps and by employing Positive Tensor Factorization to perform a two-way meta-clustering of the biclusters produced in several different clustering runs (thereby addressing the above-mentioned instability). The application of our approach to a large lung cancer dataset proved computationally tractable and was able to recover the histological classification of the various cancer subtypes represented in the dataset.
引用
收藏
页码:2651 / 2656
页数:6
相关论文
共 50 条
  • [41] Nonnegative Matrix Factorizations Performing Object Detection and Localization
    Casalino, Gabriella
    Del Buono, N.
    Minervini, M.
    [J]. APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2012, 2012
  • [42] Quick hierarchical biclustering on microarray gene expression data
    Ji, Liping
    Mock, Kenneth Wei-Liang
    Tan, Kian-Lee
    [J]. BIBE 2006: SIXTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2006, : 110 - +
  • [43] Smooth nonnegative matrix and tensor factorizations for robust multi-way data analysis
    Yokota, Tatsuya
    Zdunek, Rafal
    Cichocki, Andrzej
    Yamashita, Yukihiko
    [J]. SIGNAL PROCESSING, 2015, 113 : 234 - 249
  • [44] A Study of Biclustering Coherence Measures for Gene Expression Data
    Padilha, Victor A.
    de Carvalho, Andre C. P. L. F.
    [J]. 2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 2018, : 546 - 551
  • [45] Application of simulated annealing to the biclustering of gene expression data
    Bryan, Kenneth
    Cunningham, Padraig
    Bolshakova, Nadia
    [J]. IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2006, 10 (03): : 519 - 525
  • [46] Using the bagging approach for biclustering of gene expression data
    Hanczar, B.
    Nadif, M.
    [J]. NEUROCOMPUTING, 2011, 74 (10) : 1595 - 1605
  • [47] Biclustering gene expression data by an improved optimal algorithm
    Wang, MingQian
    Tian, Wei
    Kang, Hao
    Gao, WenJu
    [J]. MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 2223 - 2226
  • [48] Biclustering of gene expression data using genetic algorithm
    Chakraborty, A
    Maka, H
    [J]. PROCEEDINGS OF THE 2005 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2005, : 17 - 24
  • [49] Evaluation of Plaid Models in Biclustering of Gene Expression Data
    Majd, Hamid Alavi
    Shahsavari, Soodeh
    Baghestani, Ahmad Reza
    Tabatabaei, Seyyed Mohammad
    Bashi, Naghme Khadem
    Tavirani, Mostafa Rezaei
    Hamidpour, Mohsen
    [J]. SCIENTIFICA, 2016, 2016
  • [50] Randomized Algorithmic Approach for Biclustering of Gene Expression Data
    Nayak, Sradhanjali
    Mishra, Debahuti
    Das, Satyabrata
    Rath, Amiya Kumar
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2010, 1 (06) : 80 - 86