EnTSSR: A Weighted Ensemble Learning Method to Impute Single-Cell RNA Sequencing Data

被引:3
|
作者
Lu, Fan [1 ,2 ]
Lin, Yilong [1 ,2 ]
Yuan, Chongbin [1 ,2 ]
Zhang, Xiao-Fei [3 ,4 ]
Le Ou-Yang [1 ,2 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen Key Lab Media Secur,Guangdong Key Lab In, Guangdong Lab Artificial Intelligence & Digital E, Shenzhen 518060, Peoples R China
[2] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518129, Peoples R China
[3] Cent China Normal Univ, Sch Math & Stat, Wuhan 430079, Peoples R China
[4] Cent China Normal Univ, Hubei Key Lab Math Sci, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Sparse matrices; Sequential analysis; Data models; RNA; Mathematical model; Linear programming; Learning systems; Single-cell RNA sequencing; dropout events; ensemble learning; GENE-EXPRESSION; MOUSE; TRANSCRIPTOME;
D O I
10.1109/TCBB.2021.3110850
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The advancements of single-cell RNA sequencing (scRNA-seq) technologies have provided us unprecedented opportunities to characterize cellular states and investigate the mechanisms of complex diseases. Due to technical issues such as dropout events, scRNA-seq data contains excess of false zero counts, which has a substantial impact on the downstream analyses. Although several computational approaches have been proposed to impute dropout events in scRNA-seq data, there is no strong consensus on which is the best approach. In this study, we propose a novel weighted ensemble learning method, named EnTSSR, to impute dropout events in scRNA-seq data. By using a multi-view two-side sparse self-representation framework, our model can exploit the consensus similarities between genes and between cells based on the imputed results of various imputation methods. Moreover, we introduce a weighted ensemble strategy to leverage the information captured by various imputation methods effectively. Down-sampling experiments, clustering analysis, differential expression analysis and cell trajectory inference are carried out to evaluate the performance of our proposed model. Experiment results demonstrate that our EnTSSR can effectively recover the true expression pattern of scRNA-seq data.
引用
收藏
页码:2781 / 2787
页数:7
相关论文
共 50 条
  • [41] An Introduction to the Analysis of Single-Cell RNA-Sequencing Data
    AlJanahi, Aisha A.
    Danielsen, Mark
    Dunbar, Cynthia E.
    MOLECULAR THERAPY-METHODS & CLINICAL DEVELOPMENT, 2018, 10 : 189 - 196
  • [42] MISC: missing imputation for single-cell RNA sequencing data
    Yang, Mary Qu
    Weissman, Sherman M.
    Yang, William
    Zhang, Jialing
    Canaann, Allon
    Guan, Renchu
    BMC SYSTEMS BIOLOGY, 2018, 12
  • [43] SNV identification from single-cell RNA sequencing data
    Schnepp, Patricia M.
    Chen, Mengjie
    Keller, Evan T.
    Zhou, Xiang
    HUMAN MOLECULAR GENETICS, 2019, 28 (21) : 3569 - 3583
  • [44] Normalizing single-cell RNA sequencing data: Challenges and opportunities
    Vallejos C.A.
    Risso D.
    Scialdone A.
    Dudoit S.
    Marioni J.C.
    Nature Methods, 2017, 14 (6) : 565 - 571
  • [45] Analysis of single-cell RNA sequencing data based on autoencoders
    Andrea Tangherloni
    Federico Ricciuti
    Daniela Besozzi
    Pietro Liò
    Ana Cvejic
    BMC Bioinformatics, 22
  • [46] The shaky foundations of simulating single-cell RNA sequencing data
    Crowell, Helena L.
    Leonardo, Sarah X. Morillo X.
    Soneson, Charlotte
    Robinson, Mark D.
    GENOME BIOLOGY, 2023, 24 (01)
  • [47] SCRIP: an accurate simulator for single-cell RNA sequencing data
    Qin, Fei
    Luo, Xizhi
    Xiao, Feifei
    Cai, Guoshuai
    BIOINFORMATICS, 2022, 38 (05) : 1304 - 1311
  • [48] Analysis of single-cell RNA sequencing data based on autoencoders
    Tangherloni, Andrea
    Ricciuti, Federico
    Besozzi, Daniela
    Lio, Pietro
    Cvejic, Ana
    BMC BIOINFORMATICS, 2021, 22 (01)
  • [49] Normalizing single-cell RNA sequencing data: challenges and opportunities
    Vallejos, Catalina A.
    Risso, Davide
    Scialdone, Antonio
    Dudoit, Sandrine
    Marioni, John C.
    NATURE METHODS, 2017, 14 (06) : 565 - 571
  • [50] scDetect: a rank-based ensemble learning algorithm for cell type identification of single-cell RNA sequencing in cancer
    Shen, Yifei
    Chu, Qinjie
    Timko, Michael P.
    Fan, Longjiang
    BIOINFORMATICS, 2021, 37 (22) : 4115 - 4122