A Distributed Constrained Non-negative Matrix Factorization Algorithm for Time-Series Gene Expression Data

被引:0
|
作者
Dyer, Matthew [1 ]
Dymacek, Julian [1 ]
机构
[1] Longwood Univ, Dept Math & Comp Sci, Farmville, VA 23909 USA
关键词
Non-negative matrix factorization; distributed computing; time-series data; PULMONARY; IDENTIFICATION; RESPONSES; EXPOSURE;
D O I
10.1145/3233547.3233579
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We present a new distributed computing algorithm, Parallel Pattern Discovery (PPD), for constrained Non-negative Matrix Factorization (NMF). Our implementation offers the ability to constrain a specific pattern for optimization of the data while minimizing reconstruction error. Parallel Pattern Discovery operates within a distributed environment using a message passing interface. Distribution of the PPD algorithm provides better scalability and allows operation in single- or multiple-system environments. The algorithm was tested on a set of time-series, dose-dependent mRNA gene expression data. Parallel Pattern Discovery was found to accurately identify patterns within the data and reconstruct the original matrices. Our NMF algorithm found a smaller reconstruction error when compared against standard NMF algorithms. Development focused on running PPD as part of a system which identifies significantly contributing genes. Parallel Pattern Discovery is first run to find patterns from biological data. It is followed by Gene Set Enrichment (GSE) which takes the pattern data and relates it back to genetic pathways.
引用
收藏
页码:97 / 102
页数:6
相关论文
共 50 条
  • [11] Novel Algorithm for Non-Negative Matrix Factorization
    Tran Dang Hien
    Do Van Tuan
    Pham Van At
    Le Hung Son
    [J]. NEW MATHEMATICS AND NATURAL COMPUTATION, 2015, 11 (02) : 121 - 133
  • [12] A constrained non-negative matrix factorization in information retrieval
    Xu, BW
    Lu, JJ
    Huang, GS
    [J]. PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2003, : 273 - 277
  • [13] Vocal Separation by Constrained Non-Negative Matrix Factorization
    Ochiai, Eri
    Fujisawa, Takanori
    Ikehara, Masaaki
    [J]. 2015 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA), 2015, : 480 - 483
  • [14] Tumor Classification Based on Non-Negative Matrix Factorization Using Gene Expression Data
    Zheng, Chun-Hou
    Ng, To-Yee
    Zhang, Lei
    Shiu, Chi-Keung
    Wang, Hong-Qiang
    [J]. IEEE TRANSACTIONS ON NANOBIOSCIENCE, 2011, 10 (02) : 86 - 93
  • [15] Multiplicative Algorithms for Constrained Non-negative Matrix Factorization
    Peng, Chengbin
    Wong, Ka-Chun
    Rockwood, Alyn
    Zhang, Xiangliang
    Jiang, Jinling
    Keyes, David
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2012), 2012, : 1068 - 1073
  • [16] IMPROVED NON-NEGATIVE FACTORIZATION IN THE ANALYSIS OF GENE EXPRESSION DATA
    Zhang, Jin
    Wang, Jiajun
    [J]. 2008 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS AND SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 163 - 167
  • [17] Non-negative Matrix Factorization for Binary Data
    Larsen, Jacob Sogaard
    Clemmensen, Line Katrine Harder
    [J]. 2015 7TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT (IC3K), 2015, : 555 - 563
  • [18] Gene Expression Analysis through Parallel Non-Negative Matrix Factorization
    Alejandra Serrano-Rubio, Angelica
    Morales-Luna, Guillermo B.
    Meneses-Viveros, Amilcar
    [J]. COMPUTATION, 2021, 9 (10)
  • [19] Non-negative tensor factorization workflow for time series biomedical data
    Tsuyuzaki, Koki
    Yoshida, Naoki
    Ishikawa, Tetsuo
    Goshima, Yuki
    Kawakami, Eiryo
    [J]. STAR PROTOCOLS, 2023, 4 (03):
  • [20] Non-negative Matrix and Tensor Factorization Based Classification of Clinical Microarray Gene Expression Data
    Li, Yifeng
    Ngom, Alioune
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2010, : 438 - 443