Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization

被引:21
|
作者
Jia, Zhilong [1 ,2 ]
Zhang, Xiang [3 ]
Guan, Naiyang [3 ]
Bo, Xiaochen [4 ]
Barnes, Michael R. [2 ]
Luo, Zhigang [3 ]
机构
[1] Natl Univ Def Technol, Coll Sci, Dept Biol & Chem, Changsha, Hunan, Peoples R China
[2] Queen Mary Univ London, William Harvey Res Inst, Barts & London Sch Med & Dent, London, England
[3] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Proc Lab, Coll Comp, Changsha, Hunan, Peoples R China
[4] Inst Radiat Med, Beijing, Peoples R China
来源
PLOS ONE | 2015年 / 10卷 / 09期
基金
中国国家自然科学基金;
关键词
DIFFERENTIAL EXPRESSION ANALYSIS; ENRICHMENT ANALYSIS; PACKAGE; INFORMATION; ALGORITHM;
D O I
10.1371/journal.pone.0137782
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
RNA-sequencing is rapidly becoming the method of choice for studying the full complexity of transcriptomes, however with increasing dimensionality, accurate gene ranking is becoming increasingly challenging. This paper proposes an accurate and sensitive gene ranking method that implements discriminant non-negative matrix factorization (DNMF) for RNA-seq data. To the best of our knowledge, this is the first work to explore the utility of DNMF for gene ranking. When incorporating Fisher's discriminant criteria and setting the reduced dimension as two, DNMF learns two factors to approximate the original gene expression data, abstracting the up-regulated or down-regulated metagene by using the sample label information. The first factor denotes all the genes' weights of two metagenes as the additive combination of all genes, while the second learned factor represents the expression values of two metagenes. In the gene ranking stage, all the genes are ranked as a descending sequence according to the differential values of the metagene weights. Leveraging the nature of NMF and Fisher's criterion, DNMF can robustly boost the gene ranking performance. The Area Under the Curve analysis of differential expression analysis on two benchmarking tests of four RNA-seq data sets with similar phenotypes showed that our proposed DNMF-based gene ranking method outperforms other widely used methods. Moreover, the Gene Set Enrichment Analysis also showed DNMF outweighs others. DNMF is also computationally efficient, substantially outperforming all other benchmarked methods. Consequently, we suggest DNMF is an effective method for the analysis of differential gene expression and gene ranking for RNA-seq data.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Non-negative Matrix Factorization for EEG
    Jahan, Ibrahim Salem
    Snasel, Vaclav
    [J]. 2013 INTERNATIONAL CONFERENCE ON TECHNOLOGICAL ADVANCES IN ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (TAEECE), 2013, : 183 - 187
  • [42] Collaborative Non-negative Matrix Factorization
    Benlamine, Kaoutar
    Grozavu, Nistor
    Bennani, Younes
    Matei, Basarab
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: TEXT AND TIME SERIES, PT IV, 2019, 11730 : 655 - 666
  • [43] Algorithms for non-negative matrix factorization
    Lee, DD
    Seung, HS
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 556 - 562
  • [44] Non-negative matrix factorization with α-divergence
    Cichocki, Andrzej
    Lee, Hyekyoung
    Kim, Yong-Deok
    Choi, Seungjin
    [J]. PATTERN RECOGNITION LETTERS, 2008, 29 (09) : 1433 - 1440
  • [45] Dropout non-negative matrix factorization
    He, Zhicheng
    Liu, Jie
    Liu, Caihua
    Wang, Yuan
    Yin, Airu
    Huang, Yalou
    [J]. KNOWLEDGE AND INFORMATION SYSTEMS, 2019, 60 (02) : 781 - 806
  • [46] Non-Negative Matrix Factorization with Constraints
    Liu, Haifeng
    Wu, Zhaohui
    [J]. PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 506 - 511
  • [47] Non-negative Matrix Factorization on Manifold
    Cai, Deng
    He, Xiaofei
    Wu, Xiaoyun
    Han, Jiawei
    [J]. ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 63 - +
  • [48] A Framework for Regularized Non-Negative Matrix Factorization, with Application to the Analysis of Gene Expression Data
    Taslaman, Leo
    Nilsson, Bjorn
    [J]. PLOS ONE, 2012, 7 (11):
  • [49] Stretched non-negative matrix factorization
    Gu, Ran
    Rakita, Yevgeny
    Lan, Ling
    Thatcher, Zach
    Kamm, Gabrielle E.
    O'Nolan, Daniel
    Mcbride, Brennan
    Wustrow, Allison
    Neilson, James R.
    Chapman, Karena W.
    Du, Qiang
    Billinge, Simon J. L.
    [J]. NPJ COMPUTATIONAL MATERIALS, 2024, 10 (01)
  • [50] Uniqueness of non-negative matrix factorization
    Laurberg, Hans
    [J]. 2007 IEEE/SP 14TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2007, : 44 - 48