Deconvolution from bulk gene expression by leveraging sample-wise and gene-wise similarities and single-cell RNA-Seq data

被引:1
|
作者
Wang, Chenqi [1 ]
Lin, Yifan [1 ]
Li, Shuchao [1 ]
Guan, Jinting [1 ,2 ,3 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen, Peoples R China
[2] Minist Educ, Key Lab Syst Control & Informat Proc, Shanghai, Peoples R China
[3] Xiamen Univ, Natl Inst Data Sci Hlth & Med, Xiamen, Peoples R China
来源
BMC GENOMICS | 2024年 / 25卷 / 01期
关键词
Deconvolution; Cell type abundance; Cell type-specific gene expression profile; Similarity matrix; Single-cell RNA-seq data; MOUSE; MAP; NORMALIZATION; HETEROGENEITY; DIVERSITY; ATLAS; STEM;
D O I
10.1186/s12864-024-10728-x
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
BackgroundThe widely adopted bulk RNA-seq measures the gene expression average of cells, masking cell type heterogeneity, which confounds downstream analyses. Therefore, identifying the cellular composition and cell type-specific gene expression profiles (GEPs) facilitates the study of the underlying mechanisms of various biological processes. Although single-cell RNA-seq focuses on cell type heterogeneity in gene expression, it requires specialized and expensive resources and currently is not practical for a large number of samples or a routine clinical setting. Recently, computational deconvolution methodologies have been developed, while many of them only estimate cell type composition or cell type-specific GEPs by requiring the other as input. The development of more accurate deconvolution methods to infer cell type abundance and cell type-specific GEPs is still essential.ResultsWe propose a new deconvolution algorithm, DSSC, which infers cell type-specific gene expression and cell type proportions of heterogeneous samples simultaneously by leveraging gene-gene and sample-sample similarities in bulk expression and single-cell RNA-seq data. Through comparisons with the other existing methods, we demonstrate that DSSC is effective in inferring both cell type proportions and cell type-specific GEPs across simulated pseudo-bulk data (including intra-dataset and inter-dataset simulations) and experimental bulk data (including mixture data and real experimental data). DSSC shows robustness to the change of marker gene number and sample size and also has cost and time efficiencies.ConclusionsDSSC provides a practical and promising alternative to the experimental techniques to characterize cellular composition and heterogeneity in the gene expression of heterogeneous samples.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] SCDC: bulk gene expression deconvolution by multiple single-cell RNA sequencing references
    Dong, Meichen
    Thennavan, Aatish
    Urrutia, Eugene
    Li, Yun
    Perou, Charles M.
    Zou, Fei
    Jiang, Yuchao
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (01) : 416 - 427
  • [32] Polar Gini Curve: A Technique to Discover Gene Expression Spatial Patterns from Single-cell RNA-seq Data
    Thanh Minh Nguyen
    Jacob John Jeevan
    Nuo Xu
    Jake Y.Chen
    Genomics,Proteomics & Bioinformatics, 2021, (03) : 493 - 503
  • [33] Semi-deconvolution of bulk and single-cell RNA-seq data with application to metastatic progression in breast cancer
    Lei, Haoyun
    Guo, Xiaoyan A.
    Tao, Yifeng
    Ding, Kai
    Fu, Xuecong
    Oesterreich, Steffi
    Lee, Adrian, V
    Schwartz, Russell
    BIOINFORMATICS, 2022, 38 (SUPPL 1) : 386 - 394
  • [34] Polar Gini Curve: A Technique to Discover Gene Expression Spatial Patterns from Single-cell RNA-seq Data
    Thanh Minh Nguyen
    Jeevan, Jacob John
    Xu, Nuo
    Chen, Jake Y.
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2021, 19 (03) : 493 - 503
  • [35] Deciphering Tumour Microenvironment of Liver Cancer through Deconvolution of Bulk RNA-Seq Data with Single-Cell Atlas
    Zhang, Shaoshi
    Bacon, Wendi
    Peppelenbosch, Maikel P.
    van Kemenade, Folkert
    Stubbs, Andrew Peter
    CANCERS, 2023, 15 (01)
  • [36] Bubble: a fast single-cell RNA-seq imputation using an autoencoder constrained by bulk RNA-seq data
    Chen, Siqi
    Yan, Xuhua
    Zheng, Ruiqing
    Li, Min
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (01)
  • [37] Distortion-free PCA on sample space for highly variable gene detection from single-cell RNA-seq data
    MATSUDA Momo
    FUTAMURA Yasunori
    YE Xiucai
    SAKURAI Tetsuya
    Frontiers of Computer Science, 2023, 17 (01)
  • [38] Distortion-free PCA on sample space for highly variable gene detection from single-cell RNA-seq data
    Matsuda, Momo
    Futamura, Yasunori
    Ye, Xiucai
    Sakurai, Tetsuya
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (01)
  • [39] Distortion-free PCA on sample space for highly variable gene detection from single-cell RNA-seq data
    Momo Matsuda
    Yasunori Futamura
    Xiucai Ye
    Tetsuya Sakurai
    Frontiers of Computer Science, 2023, 17
  • [40] Protocols for single-cell RNA-seq and spatial gene expression integration and interactive visualization
    Sona, Surbhi
    Bradley, Matthew
    Ting, Angela H.
    STAR PROTOCOLS, 2023, 4 (01):