A Distributed Constrained Non-negative Matrix Factorization Algorithm for Time-Series Gene Expression Data

被引:0
|
作者
Dyer, Matthew [1 ]
Dymacek, Julian [1 ]
机构
[1] Longwood Univ, Dept Math & Comp Sci, Farmville, VA 23909 USA
关键词
Non-negative matrix factorization; distributed computing; time-series data; PULMONARY; IDENTIFICATION; RESPONSES; EXPOSURE;
D O I
10.1145/3233547.3233579
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We present a new distributed computing algorithm, Parallel Pattern Discovery (PPD), for constrained Non-negative Matrix Factorization (NMF). Our implementation offers the ability to constrain a specific pattern for optimization of the data while minimizing reconstruction error. Parallel Pattern Discovery operates within a distributed environment using a message passing interface. Distribution of the PPD algorithm provides better scalability and allows operation in single- or multiple-system environments. The algorithm was tested on a set of time-series, dose-dependent mRNA gene expression data. Parallel Pattern Discovery was found to accurately identify patterns within the data and reconstruct the original matrices. Our NMF algorithm found a smaller reconstruction error when compared against standard NMF algorithms. Development focused on running PPD as part of a system which identifies significantly contributing genes. Parallel Pattern Discovery is first run to find patterns from biological data. It is followed by Gene Set Enrichment (GSE) which takes the pattern data and relates it back to genetic pathways.
引用
收藏
页码:97 / 102
页数:6
相关论文
共 50 条
  • [1] Decomposing Time Series Data by a Non-negative Matrix Factorization Algorithm with Temporally Constrained Coefficients
    Cheung, Vincent C. K.
    Devarajan, Karthik
    Severini, Giacomo
    Turolla, Andrea
    Bonato, Paolo
    [J]. 2015 37TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2015, : 3496 - 3499
  • [2] Gene Expression Data Classification Based on Non-negative Matrix Factorization
    Zheng, Chun-Hou
    Zhang, Ping
    Zhang, Lei
    Liu, Xin-Xin
    Han, Ju
    [J]. IJCNN: 2009 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1- 6, 2009, : 194 - +
  • [3] Convex Hull Convolutive Non-negative Matrix Factorization for Uncovering Temporal Patterns in Multivariate Time-Series Data
    Vaz, Colin
    Toutios, Asterios
    Narayanan, Shrikanth
    [J]. 17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 963 - 967
  • [4] Biclustering of gene expression data by non-smooth non-negative matrix factorization
    Carmona-Saez, P
    Pascual-Marqui, RD
    Tirado, F
    Carazo, JM
    Pascual-Montano, A
    [J]. BMC BIOINFORMATICS, 2006, 7 (1)
  • [5] Biclustering of gene expression data by non-smooth non-negative matrix factorization
    Pedro Carmona-Saez
    Roberto D Pascual-Marqui
    F Tirado
    Jose M Carazo
    Alberto Pascual-Montano
    [J]. BMC Bioinformatics, 7
  • [6] General subspace constrained non-negative matrix factorization for data representation
    Liu, Yong
    Liao, Yiyi
    Tang, Liang
    Tang, Feng
    Liu, Weicong
    [J]. NEUROCOMPUTING, 2016, 173 : 224 - 232
  • [7] A Fast Distributed Non-Negative Matrix Factorization Algorithm Based on DSGD
    Gao, Yan
    Zhou, Lingjun
    Chen, Baifan
    Xing, Xiaobing
    [J]. INTERNATIONAL JOURNAL OF DISTRIBUTED SYSTEMS AND TECHNOLOGIES, 2018, 9 (03) : 24 - 38
  • [8] Novel Algorithm for Non-Negative Matrix Factorization
    Tran Dang Hien
    Do Van Tuan
    Pham Van At
    Le Hung Son
    [J]. NEW MATHEMATICS AND NATURAL COMPUTATION, 2015, 11 (02) : 121 - 133
  • [9] Vocal Separation by Constrained Non-Negative Matrix Factorization
    Ochiai, Eri
    Fujisawa, Takanori
    Ikehara, Masaaki
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 480 - 483
  • [10] A Framework for Regularized Non-Negative Matrix Factorization, with Application to the Analysis of Gene Expression Data
    Taslaman, Leo
    Nilsson, Bjorn
    [J]. PLOS ONE, 2012, 7 (11):