The importance of data transformation in RNA-Seq preprocessing for bladder cancer subtyping

被引:0
|
作者
Acedo-Terrades, Ariadna [1 ]
Perera-Bel, Julia [1 ]
Nonell, Lara [2 ]
机构
[1] Hosp del Mar Res Inst HMRI, Barcelona, Spain
[2] Vall dHebron Inst Oncol, Bioinformat Unit, Barcelona, Spain
关键词
Molecular subtypes; RNA sequencing; Preprocessing; Bladder cancer; MOLECULAR TAXONOMY;
D O I
10.1186/s13104-025-07138-x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
ObjectiveRNA-Seq provides an accurate quantification of gene expression levels and it is widely used for molecular subtype classification in cancer, with special importance in prognosis. However, the reliability and validity of these analyses can significantly be influenced by how data are processed. In this study we evaluate how RNA-Seq preprocessing methods influence molecular subtype classification in bladder cancer. By benchmarking various aligners, quantifiers and methods of normalization and transformation, we stress the importance of preprocessing choices for accurate and consistent subtype classification.ResultsOur findings highlight that log-transformation plays a crucial role in centroid-based classifiers such as consensusMIBC and TCGAclas, while distribution-free algorithms like LundTax offer robustness to preprocessing variations. Non log-transformed data resulted in low classification rates and poor agreement with reference classifications in consensusMIBC and TCGAclas classifiers. Additionally, LundTax consistently demonstrated better separation among subtypes, compared to consensusMIBC and TCGAclas, regardless of preprocessing methods. Nonetheless, the study is limited by the lack of a true reference for objective assessment of the accuracy of the assigned subtypes. Hence, future work will be necessary to determine the robustness and scalability of the obtained results.
引用
收藏
页数:8
相关论文
共 50 条
  • [42] Pathogen detection in RNA-seq data with Pathonoia
    Liebhoff, Anna-Maria
    Menden, Kevin
    Laschtowitz, Alena
    Franke, Andre
    Schramm, Christoph
    Bonn, Stefan
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [43] An integrative method to normalize RNA-Seq data
    Cyril Filloux
    Meersseman Cédric
    Philippe Romain
    Forestier Lionel
    Klopp Christophe
    Rocha Dominique
    Maftah Abderrahman
    Petit Daniel
    BMC Bioinformatics, 15
  • [44] RNASeqGUI: a GUI for analysing RNA-Seq data
    Russo, Francesco
    Angelini, Claudia
    BIOINFORMATICS, 2014, 30 (17) : 2514 - 2516
  • [45] Comparison of normalization methods for RNA-Seq data
    Aghababazadeh, Farnoosh A.
    Li, Qian
    Fridley, Brooke L.
    GENETIC EPIDEMIOLOGY, 2018, 42 (07) : 684 - 684
  • [46] Computational analysis of bacterial RNA-Seq data
    McClure, Ryan
    Balasubramanian, Divya
    Sun, Yan
    Bobrovskyy, Maksym
    Sumby, Paul
    Genco, Caroline A.
    Vanderpool, Carin K.
    Tjaden, Brian
    NUCLEIC ACIDS RESEARCH, 2013, 41 (14)
  • [47] Dynamic Model for RNA-seq Data Analysis
    Li, Lerong
    Xiong, Momiao
    BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [48] A comprehensive review on RNA-seq data analysis
    Zhang, Li
    Liu, Xuejun
    Transactions of Nanjing University of Aeronautics and Astronautics, 2016, 33 (03) : 339 - 361
  • [49] Mining RNA-Seq Data for Infections and Contaminations
    Bonfert, Thomas
    Csaba, Gergely
    Zimmer, Ralf
    Friedel, Caroline C.
    PLOS ONE, 2013, 8 (09):
  • [50] RNAseqViewer: visualization tool for RNA-Seq data
    Roge, Xavier
    Zhang, Xuegong
    BIOINFORMATICS, 2014, 30 (06) : 891 - 892