HArmonized single-cell RNA-seq Cell type Assisted Deconvolution (HASCAD)

被引:0
|
作者
Chiu, Yen-Jung [1 ,2 ]
Ni, Chung-En [1 ]
Huang, Yen-Hua [1 ,3 ]
机构
[1] Natl Yang Ming Chiao Tung Univ, Inst Biomed Informat, Taipei 112, Taiwan
[2] Ming Chuan Univ, Dept Biomed Engn, Taoyuan 333, Taiwan
[3] Natl Yang Ming Chiao Tung Univ, Ctr Syst & Synthet Biol, Taipei 112, Taiwan
关键词
Harmonization; Cell composition deconvolution; RNA-seq; Deep learning; CANCER;
D O I
10.1186/s12920-023-01674-w
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
BackgroundCell composition deconvolution (CCD) is a type of bioinformatic task to estimate the cell fractions from bulk gene expression profiles, such as RNA-seq. Many CCD models were developed to perform linear regression analysis using reference gene expression signatures of distinct cell types. Reference gene expression signatures could be generated from cell-specific gene expression profiles, such as scRNA-seq. However, the batch effects and dropout events frequently observed across scRNA-seq datasets have limited the performances of CCD methods.MethodsWe developed a deep neural network (DNN) model, HASCAD, to predict the cell fractions of up to 15 immune cell types. HASCAD was trained using the bulk RNA-seq simulated from three scRNA-seq datasets that have been normalized by using a Harmony-Symphony based strategy. Mean square error and Pearson correlation coefficient were used to compare the performance of HASCAD with those of other widely used CCD methods. Two types of datasets, including a set of simulated bulk RNA-seq, and three human PBMC RNA-seq datasets, were arranged to conduct the benchmarks.ResultsHASCAD is useful for the investigation of the impacts of immune cell heterogeneity on the therapeutic effects of immune checkpoint inhibitors, since the target cell types include the ones known to play a role in anti-tumor immunity, such as three subtypes of CD8 T cells and three subtypes of CD4 T cells. We found that the removal of batch effects in the reference scRNA-seq datasets could benefit the task of CCD. Our benchmarks showed that HASCAD is more suitable for analyzing bulk RNA-seq data, compared with the two widely used CCD methods, CIBERSORTx and quanTIseq. We applied HASCAD to analyze the liver cancer samples of TCGA-LIHC, and found that there were significant associations of the predicted abundance of Treg and effector CD8 T cell with patients' overall survival.ConclusionHASCAD could predict the cell composition of the PBMC bulk RNA-seq and classify the cell type from pure bulk RNA-seq. The model of HASCAD is available at https://github.com/holiday01/HASCAD.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Distribution-Independent Cell Type Identification for Single-Cell RNA-seq Data
    Zhai, Yuyao
    Chen, Liang
    Deng, Minghua
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 6143 - 6151
  • [22] Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data
    Qianhui Huang
    Yu Liu
    Yuheng Du
    Lana X.Garmire
    Genomics,Proteomics & Bioinformatics, 2021, 19 (02) : 267 - 281
  • [23] Cell type diversity in scallop adductor muscles revealed by single-cell RNA-Seq
    Sun, Xiujun
    Li, Li
    Wu, Biao
    Ge, Jianlong
    Zheng, Yanxin
    Yu, Tao
    Zhou, Liqing
    Zhang, Tianshi
    Yang, Aiguo
    Liu, Zhihong
    GENOMICS, 2021, 113 (06) : 3582 - 3598
  • [24] scReClassify: post hoc cell type classification of single-cell rNA-seq data
    Taiyun Kim
    Kitty Lo
    Thomas A. Geddes
    Hani Jieun Kim
    Jean Yee Hwa Yang
    Pengyi Yang
    BMC Genomics, 20
  • [25] scReClassify: post hoc cell type classification of single-cell rNA-seq data
    Kim, Taiyun
    Lo, Kitty
    Geddes, Thomas A.
    Kim, Hani Jieun
    Yang, Jean Yee Hwa
    Yang, Pengyi
    BMC GENOMICS, 2019, 20 (Suppl 9)
  • [26] Comparative Analysis of Supervised Cell Type Detection in Single-Cell RNA-seq Data
    Vasighizaker, Akram
    Hora, Sheena
    Trivedi, Yash
    Rueda, Luis
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, PT II, 2022, : 333 - 345
  • [27] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Peng, Tao
    Zhu, Qin
    Yin, Penghang
    Tan, Kai
    GENOME BIOLOGY, 2019, 20 (1)
  • [28] Evaluation of Cell Type Annotation R Packages on Single-cell RNA-seq Data
    Qianhui Huang
    Yu Liu
    Yuheng Du
    Lana XGarmire
    Genomics,Proteomics & Bioinformatics, 2021, (02) : 267 - 281
  • [29] Application of bioinformatic tools in cell type classification for single-cell RNA-seq data
    Sujana, Shah Tania Akter
    Shahjaman, Md.
    Singha, Atul Chandra
    COMPUTATIONAL BIOLOGY AND CHEMISTRY, 2025, 115
  • [30] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Tao Peng
    Qin Zhu
    Penghang Yin
    Kai Tan
    Genome Biology, 20