Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes

被引:13
|
作者
Cobos, Francisco Avila [1 ,2 ]
Panah, Mohammad Javad Najaf [3 ]
Epps, Jessica [3 ]
Long, Xiaochen [3 ,4 ]
Man, Tsz-Kwong [3 ]
Chiu, Hua-Sheng [3 ]
Chomsky, Elad [5 ]
Kiner, Evgeny [5 ]
Krueger, Michael J. [3 ]
di Bernardo, Diego [6 ]
Voloch, Luis [5 ]
Molenaar, Jan [7 ]
van Hooff, Sander R. [7 ]
Westermann, Frank [8 ]
Jansky, Selina [8 ]
Redell, Michele L. [3 ]
Mestdagh, Pieter [1 ,2 ]
Sumazin, Pavel [3 ]
机构
[1] Univ Ghent, Dept Biomol Med, Ghent, Belgium
[2] Canc Res Inst Ghent, Ghent, Belgium
[3] Texas Childrens Hosp Canc Ctr, Baylor Coll Med, Dept Pediat, Houston, TX 77030 USA
[4] Rice Univ, Dept Stat, Houston, TX 77251 USA
[5] ImmunAi, New York, NY USA
[6] Univ Naples Federico II, Telethon Inst Genet & Med, Dept Chem Mat & Ind Engn, Via Campi Flegrei 34, I-80078 Pozzuoli, Italy
[7] Princess Maxima Ctr Pediat Oncol, Utrecht, Netherlands
[8] DKFZ, German Canc Res Ctr, Heidelberg, Germany
关键词
SINGLE-CELL; EXPRESSION; ATLAS; LANDSCAPE; EVOLUTION; ABUNDANCE; THERAPY;
D O I
10.1186/s13059-023-03016-6
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundRNA profiling technologies at single-cell resolutions, including single-cell and single-nuclei RNA sequencing (scRNA-seq and snRNA-seq, scnRNA-seq for short), can help characterize the composition of tissues and reveal cells that influence key functions in both healthy and disease tissues. However, the use of these technologies is operationally challenging because of high costs and stringent sample-collection requirements. Computational deconvolution methods that infer the composition of bulk-profiled samples using scnRNA-seq-characterized cell types can broaden scnRNA-seq applications, but their effectiveness remains controversial.ResultsWe produced the first systematic evaluation of deconvolution methods on datasets with either known or scnRNA-seq-estimated compositions. Our analyses revealed biases that are common to scnRNA-seq 10X Genomics assays and illustrated the importance of accurate and properly controlled data preprocessing and method selection and optimization. Moreover, our results suggested that concurrent RNA-seq and scnRNA-seq profiles can help improve the accuracy of both scnRNA-seq preprocessing and the deconvolution methods that employ them. Indeed, our proposed method, Single-cell RNA Quantity Informed Deconvolution (SQUID), which combines RNA-seq transformation and dampened weighted least-squares deconvolution approaches, consistently outperformed other methods in predicting the composition of cell mixtures and tissue samples.ConclusionsWe showed that analysis of concurrent RNA-seq and scnRNA-seq profiles with SQUID can produce accurate cell-type abundance estimates and that this accuracy improvement was necessary for identifying outcomes-predictive cancer cell subclones in pediatric acute myeloid leukemia and neuroblastoma datasets. These results suggest that deconvolution accuracy improvements are vital to enabling its applications in the life sciences.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Effective methods for bulk RNA-seq deconvolution using scnRNA-seq transcriptomes
    Francisco Avila Cobos
    Mohammad Javad Najaf Panah
    Jessica Epps
    Xiaochen Long
    Tsz-Kwong Man
    Hua-Sheng Chiu
    Elad Chomsky
    Evgeny Kiner
    Michael J. Krueger
    Diego di Bernardo
    Luis Voloch
    Jan Molenaar
    Sander R. van Hooff
    Frank Westermann
    Selina Jansky
    Michele L. Redell
    Pieter Mestdagh
    Pavel Sumazin
    Genome Biology, 24
  • [2] Studying bacterial transcriptomes using RNA-seq
    Croucher, Nicholas J.
    Thomson, Nicholas R.
    CURRENT OPINION IN MICROBIOLOGY, 2010, 13 (05) : 619 - 624
  • [3] Uncovering the Complexity of Transcriptomes with RNA-Seq
    Costa, Valerio
    Angelini, Claudia
    De Feis, Italia
    Ciccodicola, Alfredo
    JOURNAL OF BIOMEDICINE AND BIOTECHNOLOGY, 2010,
  • [4] Bulk Tissue Gene Expression Deconvolution Using Single Cell RNA-Seq Data
    Wang, X.
    Li, M.
    Zhang, N.
    HUMAN HEREDITY, 2017, 83 (01) : 51 - 51
  • [5] Mapping and quantifying mammalian transcriptomes by RNA-Seq
    Mortazavi, Ali
    Williams, Brian A.
    McCue, Kenneth
    Schaeffer, Lorian
    Wold, Barbara
    NATURE METHODS, 2008, 5 (07) : 621 - 628
  • [6] Mapping and quantifying mammalian transcriptomes by RNA-Seq
    Ali Mortazavi
    Brian A Williams
    Kenneth McCue
    Lorian Schaeffer
    Barbara Wold
    Nature Methods, 2008, 5 : 621 - 628
  • [7] Precise reconstruction of the TME using bulk RNA-seq and a machine learning algorithm trained on artificial transcriptomes
    Zaitsev, Aleksandr
    Chelushkin, Maksim
    Dyikanov, Daniiar
    Cheremushkin, Ilya
    Shpak, Boris
    Nomie, Krystle
    Zyrin, Vladimir
    Nuzhdina, Ekaterina
    Lozinsky, Yaroslav
    Zotova, Anastasia
    Degryse, Sandrine
    Kotlov, Nikita
    Baisangurov, Artur
    Shatsky, Vladimir
    Afenteva, Daria
    Kuznetsov, Alexander
    Paul, Susan Raju
    Davies, Diane L.
    Reeves, Patrick M.
    Lanuti, Michael
    Goldberg, Michael F.
    Tazearslan, Cagdas
    Chasse, Madison
    Wang, Iris
    Abdou, Mary
    Aslanian, Sharon M.
    Andrewes, Samuel
    Hsieh, James J.
    Ramachandran, Akshaya
    Lyu, Yang
    Galkin, Ilia
    Svekolkin, Viktor
    Cerchietti, Leandro
    Poznansky, Mark C.
    Ataullakhanov, Ravshan
    Fowler, Nathan
    Bagaev, Alexander
    CANCER CELL, 2022, 40 (08) : 879 - +
  • [8] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Peng, Tao
    Zhu, Qin
    Yin, Penghang
    Tan, Kai
    GENOME BIOLOGY, 2019, 20 (1)
  • [9] A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq
    Ye, Wenbin
    Lian, Qiwei
    Ye, Congting
    Wu, Xiaohui
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2023, 21 (01) : 67 - 83
  • [10] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Tao Peng
    Qin Zhu
    Penghang Yin
    Kai Tan
    Genome Biology, 20