共 50 条
EnTSSR: A Weighted Ensemble Learning Method to Impute Single-Cell RNA Sequencing Data
被引:3
|作者:
Lu, Fan
[1
,2
]
Lin, Yilong
[1
,2
]
Yuan, Chongbin
[1
,2
]
Zhang, Xiao-Fei
[3
,4
]
Le Ou-Yang
[1
,2
]
机构:
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen Key Lab Media Secur,Guangdong Key Lab In, Guangdong Lab Artificial Intelligence & Digital E, Shenzhen 518060, Peoples R China
[2] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518129, Peoples R China
[3] Cent China Normal Univ, Sch Math & Stat, Wuhan 430079, Peoples R China
[4] Cent China Normal Univ, Hubei Key Lab Math Sci, Wuhan 430079, Peoples R China
基金:
中国国家自然科学基金;
关键词:
Sparse matrices;
Sequential analysis;
Data models;
RNA;
Mathematical model;
Linear programming;
Learning systems;
Single-cell RNA sequencing;
dropout events;
ensemble learning;
GENE-EXPRESSION;
MOUSE;
TRANSCRIPTOME;
D O I:
10.1109/TCBB.2021.3110850
中图分类号:
Q5 [生物化学];
学科分类号:
071010 ;
081704 ;
摘要:
The advancements of single-cell RNA sequencing (scRNA-seq) technologies have provided us unprecedented opportunities to characterize cellular states and investigate the mechanisms of complex diseases. Due to technical issues such as dropout events, scRNA-seq data contains excess of false zero counts, which has a substantial impact on the downstream analyses. Although several computational approaches have been proposed to impute dropout events in scRNA-seq data, there is no strong consensus on which is the best approach. In this study, we propose a novel weighted ensemble learning method, named EnTSSR, to impute dropout events in scRNA-seq data. By using a multi-view two-side sparse self-representation framework, our model can exploit the consensus similarities between genes and between cells based on the imputed results of various imputation methods. Moreover, we introduce a weighted ensemble strategy to leverage the information captured by various imputation methods effectively. Down-sampling experiments, clustering analysis, differential expression analysis and cell trajectory inference are carried out to evaluate the performance of our proposed model. Experiment results demonstrate that our EnTSSR can effectively recover the true expression pattern of scRNA-seq data.
引用
收藏
页码:2781 / 2787
页数:7
相关论文