A comprehensive workflow for optimizing RNA-seq data analysis

被引:1
|
作者
Jiang, Gao [1 ]
Zheng, Juan-Yu [1 ]
Ren, Shu-Ning [2 ]
Yin, Weilun [2 ]
Xia, Xinli [2 ]
Li, Yun [1 ]
Wang, Hou-Ling [2 ]
机构
[1] Beijing Forestry Univ, Sch Artificial Intelligence, Sch Informat Sci & Technol, Beijing 100083, Peoples R China
[2] Beijing Forestry Univ, Coll Biol Sci & Technol, Natl Engn Res Ctr Tree Breeding & Ecol Restorat, State Key Lab Tree Genet & Breeding, Beijing 100083, Peoples R China
来源
BMC GENOMICS | 2024年 / 25卷 / 01期
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
RNA-seq data; Differential gene analysis; Software comparison; DIFFERENTIAL EXPRESSION; ALIGNMENT; PROGRAM; HISAT;
D O I
10.1186/s12864-024-10414-y
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background Current RNA-seq analysis software for RNA-seq data tends to use similar parameters across different species without considering species-specific differences. However, the suitability and accuracy of these tools may vary when analyzing data from different species, such as humans, animals, plants, fungi, and bacteria. For most laboratory researchers lacking a background in information science, determining how to construct an analysis workflow that meets their specific needs from the array of complex analytical tools available poses a significant challenge.Results By utilizing RNA-seq data from plants, animals, and fungi, it was observed that different analytical tools demonstrate some variations in performance when applied to different species. A comprehensive experiment was conducted specifically for analyzing plant pathogenic fungal data, focusing on differential gene analysis as the ultimate goal. In this study, 288 pipelines using different tools were applied to analyze five fungal RNA-seq datasets, and the performance of their results was evaluated based on simulation. This led to the establishment of a relatively universal and superior fungal RNA-seq analysis pipeline that can serve as a reference, and certain standards for selecting analysis tools were derived for reference. Additionally, we compared various tools for alternative splicing analysis. The results based on simulated data indicated that rMATS remained the optimal choice, although consideration could be given to supplementing with tools such as SpliceWiz.Conclusion The experimental results demonstrate that, in comparison to the default software parameter configurations, the analysis combination results after tuning can provide more accurate biological insights. It is beneficial to carefully select suitable analysis software based on the data, rather than indiscriminately choosing tools, in order to achieve high-quality analysis results more efficiently.
引用
下载
收藏
页数:21
相关论文
共 50 条
  • [31] RNA-Seq UD: A bioinformatics plattform for RNA-Seq analysis
    Ramirez, Miguel
    Alejandro Rojas-Quintero, Cristian
    Enrique Vera-Parra, Nelson
    2015 10TH IBERIAN CONFERENCE ON INFORMATION SYSTEMS AND TECHNOLOGIES (CISTI), 2015,
  • [32] Analysis of a Single Cell RNA-seq Workflow by Random Matrix Theory MethodsAnalysis of a Single Cell RNA-seq Workflow by Random Matrix...S. Leviyang
    Sivan Leviyang
    Bulletin of Mathematical Biology, 2025, 87 (1)
  • [33] Erratum to: Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data
    Franck Rapaport
    Raya Khanin
    Yupu Liang
    Mono Pirun
    Azra Krek
    Paul Zumbo
    Christopher E. Mason
    Nicholas D. Socci
    Doron Betel
    Genome Biology, 16
  • [34] Comprehensive RNA-Seq Data Analysis Identifies Key mRNAs and lncRNAs in Atrial Fibrillation
    Wu, Dong-Mei
    Zhou, Zheng-Kun
    Fan, Shao-Hua
    Zheng, Zi-Hui
    Wen, Xin
    Han, Xin-Rui
    Wang, Shan
    Wang, Yong-Jian
    Zhang, Zi-Feng
    Shan, Qun
    Li, Meng-Qiu
    Hu, Bin
    Lu, Jun
    Chen, Gui-Quan
    Hong, Xiao-Wu
    Zheng, Yuan-Lin
    FRONTIERS IN GENETICS, 2019, 10
  • [35] The application of RNA-seq to the comprehensive analysis of plant mitochondrial transcriptomes
    Stone, James D.
    Storchova, Helena
    MOLECULAR GENETICS AND GENOMICS, 2015, 290 (01) : 1 - 9
  • [36] The application of RNA-seq to the comprehensive analysis of plant mitochondrial transcriptomes
    James D. Stone
    Helena Storchova
    Molecular Genetics and Genomics, 2015, 290 : 1 - 9
  • [37] Statistical Issues in the Analysis of ChIP-Seq and RNA-Seq Data
    Ghosh, Debashis
    Qin, Zhaohui S.
    GENES, 2010, 1 (02) : 317 - 334
  • [38] OneStopRNAseq: A Web Application for Comprehensive and Efficient Analyses of RNA-Seq Data
    Li, Rui
    Hu, Kai
    Liu, Haibo
    Green, Michael R.
    Zhu, Lihua Julie
    GENES, 2020, 11 (10) : 1 - 14
  • [39] Oqtans: a multifunctional workbench for RNA-seq data analysis
    Sreedharan, Vipin T.
    Schultheiss, Sebastian J.
    Jean, Geraldine
    Kahles, Andre
    Bohnert, Regina
    Drewe, Philipp
    Mudrakarta, Pramod
    Goernitz, Nico
    Zeller, Georg
    Raetsch, Gunnar
    BMC BIOINFORMATICS, 2014, 15
  • [40] CellHeap: A Workflow for Optimizing COVID-19 Single-Cell RNA-Seq Data Processing in the Santos Dumont Supercomputer
    Silva, Vanessa S.
    Costa, Maiana O. C.
    Castro, Maria Clicia S.
    Silva, Helena S.
    Walter, Maria Emilia M. T.
    Melo, Alba C. M. A.
    Ocana, Kary A. C.
    dos Santos, Marcelo T.
    Nicolas, Marisa F.
    Carvalho, Anna Cristina C.
    Henriques-Pons, Andrea
    Silva, Fabricio A. B.
    ADVANCES IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, BSB 2021, 2021, 13063 : 41 - 52