Dimensionality Reduction of RNA-Seq Data

被引:0
|
作者
Al-Turaiki, Isra [1 ]
机构
[1] King Saud Univ, Coll Comp & Informat Sci, Informat Technol Dept, Riyadh, Saudi Arabia
关键词
Principal Component Analysis (PCA); Singular Value Decomposition (SVD); Self-Organizing Maps (SOM); RNA-Seq; Dimensionality Reduction;
D O I
10.22937/IJCSNS.2021.21.3.4
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
RNA sequencing (RNA-Seq) is a technology that facilitates transcriptome analysis using next-generation sequencing (NSG) tools. Information on the quantity and sequences of RNA is vital to relate our genomes to functional protein expression. RNA-Seq data are characterized as being high-dimensional in that the number of variables (i.e., transcripts) far exceeds the number of observations (e.g., experiments). Given the wide range of dimensionality reduction techniques, it is not clear which is best for RNA-Seq data analysis. In this paper, we study the effect of three dimensionality reduction techniques to improve the classification of the RNA-Seq dataset. In particular, we use PCA, SVD, and SOM to obtain a reduced feature space. We built nine classification models for a cancer dataset and compared their performance. Our experimental results indicate that better classification performance is obtained with PCA and SOM. Overall, the combinations PCA+KNN, SOM+RF, and SOM+KNN produce preferred results.
引用
收藏
页码:31 / 36
页数:6
相关论文
共 50 条
  • [1] A Comparison for Dimensionality Reduction Methods of Single-Cell RNA-seq Data
    Xiang, Ruizhi
    Wang, Wencan
    Yang, Lei
    Wang, Shiyuan
    Xu, Chaohan
    Chen, Xiaowen
    FRONTIERS IN GENETICS, 2021, 12
  • [2] Discovering What Dimensionality Reduction Really Tells Us About RNA-Seq Data
    Simmons, Sean
    Peng, Jian
    Bienkowska, Jadwiga
    Berger, Bonnie
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2015, 22 (08) : 715 - 728
  • [3] Performance comparison of dimensionality reduction methods on RNA-Seq data from the GTEx project
    Seok, Ho-Sik
    GENES & GENOMICS, 2020, 42 (02) : 225 - 234
  • [4] Performance comparison of dimensionality reduction methods on RNA-Seq data from the GTEx project
    Ho-Sik Seok
    Genes & Genomics, 2020, 42 : 225 - 234
  • [5] Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data
    Xiaoxiao Sun
    Yiwen Liu
    Lingling An
    Nature Communications, 11
  • [6] Ensemble dimensionality reduction and feature gene extraction for single-cell RNA-seq data
    Sun, Xiaoxiao
    Liu, Yiwen
    An, Lingling
    NATURE COMMUNICATIONS, 2020, 11 (01)
  • [7] ScDA: A Denoising AutoEncoder Based Dimensionality Reduction for Single-cell RNA-seq Data
    Zhu, Xiaoshu
    Lin, Yongchang
    Li, Jian
    Wang, Jianxin
    Peng, Xiaoqing
    BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2021, 2021, 13064 : 534 - 545
  • [8] Multivariate and Dimensionality-Reduction-Based Machine Learning Techniques for Tumor Classification of RNA-Seq Data
    Al-khassaweneh, Mahmood
    Bronakowski, Mark
    Al-Sharoa, Esraa
    APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [9] Dimensionality reduction and visualization of single-cell RNA-seq data with an improved deep variational autoencoder
    Jiang, Jing
    Xu, Junlin
    Liu, Yuansheng
    Song, Bosheng
    Guo, Xiulan
    Zeng, Xiangxiang
    Zou, Quan
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (03)
  • [10] Toward improved cancer classification using PCA plus tSNE dimensionality reduction on bulk RNA-seq data
    Bocker, Michael
    Grushko, Mikhail G.
    Arline, Katherine E.
    CANCER RESEARCH, 2022, 82 (12)