EnTSSR: A Weighted Ensemble Learning Method to Impute Single-Cell RNA Sequencing Data

被引:3
|
作者
Lu, Fan [1 ,2 ]
Lin, Yilong [1 ,2 ]
Yuan, Chongbin [1 ,2 ]
Zhang, Xiao-Fei [3 ,4 ]
Le Ou-Yang [1 ,2 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen Key Lab Media Secur,Guangdong Key Lab In, Guangdong Lab Artificial Intelligence & Digital E, Shenzhen 518060, Peoples R China
[2] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518129, Peoples R China
[3] Cent China Normal Univ, Sch Math & Stat, Wuhan 430079, Peoples R China
[4] Cent China Normal Univ, Hubei Key Lab Math Sci, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Sparse matrices; Sequential analysis; Data models; RNA; Mathematical model; Linear programming; Learning systems; Single-cell RNA sequencing; dropout events; ensemble learning; GENE-EXPRESSION; MOUSE; TRANSCRIPTOME;
D O I
10.1109/TCBB.2021.3110850
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The advancements of single-cell RNA sequencing (scRNA-seq) technologies have provided us unprecedented opportunities to characterize cellular states and investigate the mechanisms of complex diseases. Due to technical issues such as dropout events, scRNA-seq data contains excess of false zero counts, which has a substantial impact on the downstream analyses. Although several computational approaches have been proposed to impute dropout events in scRNA-seq data, there is no strong consensus on which is the best approach. In this study, we propose a novel weighted ensemble learning method, named EnTSSR, to impute dropout events in scRNA-seq data. By using a multi-view two-side sparse self-representation framework, our model can exploit the consensus similarities between genes and between cells based on the imputed results of various imputation methods. Moreover, we introduce a weighted ensemble strategy to leverage the information captured by various imputation methods effectively. Down-sampling experiments, clustering analysis, differential expression analysis and cell trajectory inference are carried out to evaluate the performance of our proposed model. Experiment results demonstrate that our EnTSSR can effectively recover the true expression pattern of scRNA-seq data.
引用
收藏
页码:2781 / 2787
页数:7
相关论文
共 50 条
  • [1] I-Impute: a self-consistent method to impute single cell RNA sequencing data
    Feng, Xikang
    Chen, Lingxi
    Wang, Zishuai
    Li, Shuai Cheng
    BMC GENOMICS, 2020, 21 (Suppl 10)
  • [2] I-Impute: a self-consistent method to impute single cell RNA sequencing data
    Xikang Feng
    Lingxi Chen
    Zishuai Wang
    Shuai Cheng Li
    BMC Genomics, 21
  • [3] Chord: an ensemble machine learning algorithm to identify doublets in single-cell RNA sequencing data
    Xiong, Ke-Xu
    Zhou, Han-Lin
    Lin, Cong
    Yin, Jian-Hua
    Kristiansen, Karsten
    Yang, Huan-Ming
    Li, Gui-Bo
    COMMUNICATIONS BIOLOGY, 2022, 5 (01)
  • [4] Chord: an ensemble machine learning algorithm to identify doublets in single-cell RNA sequencing data
    Ke-Xu Xiong
    Han-Lin Zhou
    Cong Lin
    Jian-Hua Yin
    Karsten Kristiansen
    Huan-Ming Yang
    Gui-Bo Li
    Communications Biology, 5
  • [5] scDEA: differential expression analysis in single-cell RNA-sequencing data via ensemble learning
    Li, Hui-Sheng
    Le Ou-Yang
    Yuan Zhu
    Hong Yan
    Zhang, Xiao-Fei
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [6] EnImpute: imputing dropout events in single-cell RNA-sequencing data via ensemble learning
    Zhang, Xiao-Fei
    Le Ou-Yang
    Shuo Yang
    Zhao, Xing-Ming
    Hu, Xiaohua
    Hong Yan
    BIOINFORMATICS, 2019, 35 (22) : 4827 - 4829
  • [7] PanoView: An iterative clustering method for single-cell RNA sequencing data
    Hu, Ming-Wen
    Kim, Dong Won
    Liu, Sheng
    Zack, Donald J.
    Blackshaw, Seth
    Qian, Jiang
    PLOS COMPUTATIONAL BIOLOGY, 2019, 15 (08)
  • [8] A Bayesian factorization method to recover single-cell RNA sequencing data
    Wen, Zi-Hang
    Langsam, Jeremy L.
    Zhang, Lu
    Shen, Wenjun
    Zhou, Xin
    CELL REPORTS METHODS, 2022, 2 (01):
  • [9] Evaluation of single-cell classifiers for single-cell RNA sequencing data sets
    Zhao, Xinlei
    Wu, Shuang
    Fang, Nan
    Sun, Xiao
    Fan, Jue
    BRIEFINGS IN BIOINFORMATICS, 2020, 21 (05) : 1581 - 1595
  • [10] Kernelized multiview signed graph learning for single-cell RNA sequencing data
    Abdullah Karaaslanli
    Satabdi Saha
    Tapabrata Maiti
    Selin Aviyente
    BMC Bioinformatics, 24