Gene Ranking of RNA-Seq Data via Discriminant Non-Negative Matrix Factorization

被引:21
|
作者
Jia, Zhilong [1 ,2 ]
Zhang, Xiang [3 ]
Guan, Naiyang [3 ]
Bo, Xiaochen [4 ]
Barnes, Michael R. [2 ]
Luo, Zhigang [3 ]
机构
[1] Natl Univ Def Technol, Coll Sci, Dept Biol & Chem, Changsha, Hunan, Peoples R China
[2] Queen Mary Univ London, William Harvey Res Inst, Barts & London Sch Med & Dent, London, England
[3] Natl Univ Def Technol, Sci & Technol Parallel & Distributed Proc Lab, Coll Comp, Changsha, Hunan, Peoples R China
[4] Inst Radiat Med, Beijing, Peoples R China
来源
PLOS ONE | 2015年 / 10卷 / 09期
基金
中国国家自然科学基金;
关键词
DIFFERENTIAL EXPRESSION ANALYSIS; ENRICHMENT ANALYSIS; PACKAGE; INFORMATION; ALGORITHM;
D O I
10.1371/journal.pone.0137782
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
RNA-sequencing is rapidly becoming the method of choice for studying the full complexity of transcriptomes, however with increasing dimensionality, accurate gene ranking is becoming increasingly challenging. This paper proposes an accurate and sensitive gene ranking method that implements discriminant non-negative matrix factorization (DNMF) for RNA-seq data. To the best of our knowledge, this is the first work to explore the utility of DNMF for gene ranking. When incorporating Fisher's discriminant criteria and setting the reduced dimension as two, DNMF learns two factors to approximate the original gene expression data, abstracting the up-regulated or down-regulated metagene by using the sample label information. The first factor denotes all the genes' weights of two metagenes as the additive combination of all genes, while the second learned factor represents the expression values of two metagenes. In the gene ranking stage, all the genes are ranked as a descending sequence according to the differential values of the metagene weights. Leveraging the nature of NMF and Fisher's criterion, DNMF can robustly boost the gene ranking performance. The Area Under the Curve analysis of differential expression analysis on two benchmarking tests of four RNA-seq data sets with similar phenotypes showed that our proposed DNMF-based gene ranking method outperforms other widely used methods. Moreover, the Gene Set Enrichment Analysis also showed DNMF outweighs others. DNMF is also computationally efficient, substantially outperforming all other benchmarked methods. Consequently, we suggest DNMF is an effective method for the analysis of differential gene expression and gene ranking for RNA-seq data.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Detecting heterogeneity in single-cell RNA-Seq data by non-negative matrix factorization
    Zhu, Xun
    Ching, Travers
    Pan, Xinghua
    Weissman, Sherman M.
    Garmire, Lana
    [J]. PEERJ, 2017, 5
  • [2] Imputation for Single-cell RNA-seq Data with Non-negative Matrix Factorization and Transfer Learning
    Zhu, Jiadi
    Yang, Youlong
    [J]. JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2023, 21 (06)
  • [3] scRNMF: An imputation method for single-cell RNA-seq data by robust and non-negative matrix factorization
    Qian, Yuqing
    Zou, Quan
    Zhao, Mengyuan
    Liu, Yi
    Guo, Fei
    Ding, Yijie
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (08)
  • [4] Discriminant Projective Non-Negative Matrix Factorization
    Guan, Naiyang
    Zhang, Xiang
    Luo, Zhigang
    Tao, Dacheng
    Yang, Xuejun
    [J]. PLOS ONE, 2013, 8 (12):
  • [5] IMPROVING NON-NEGATIVE MATRIX FACTORIZATION VIA RANKING ITS BASES
    Huang, Sheng
    Elhoseiny, Mohamed
    Elgammal, Ahmed
    Yang, Dan
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 5951 - 5955
  • [6] Recent Advances in Discriminant Non-negative Matrix Factorization
    Nikitidis, Symeon
    Tefas, Anastasios
    Pitas, Ioannis
    [J]. 2011 FIRST ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR), 2011, : 1 - 6
  • [7] Online Discriminant Projective Non-negative Matrix Factorization
    Zhang, Xiang
    Liao, Qing
    Luo, Zhigang
    [J]. 2017 INTERNATIONAL CONFERENCE ON SECURITY, PATTERN ANALYSIS, AND CYBERNETICS (SPAC), 2017, : 537 - 542
  • [8] Coordinate Ranking Regularized Non-negative Matrix Factorization
    Li, Yingming
    Yang, Ming
    Zhang, Zhongfei
    [J]. 2013 SECOND IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION (ACPR 2013), 2013, : 215 - 219
  • [9] NMFP: a non-negative matrix factorization based preselection method to increase accuracy of identifying mRNA isoforms from RNA-seq data
    Yuting Ye
    Jingyi Jessica Li
    [J]. BMC Genomics, 17
  • [10] A new gene-scoring method for uncovering novel glaucoma-related genes using non-negative matrix factorization based on RNA-seq data
    Huang, Xiaoqin
    Bajpai, Akhilesh K. K.
    Sun, Jian
    Xu, Fuyi
    Lu, Lu
    Yousefi, Siamak
    [J]. FRONTIERS IN GENETICS, 2023, 14