EnTSSR: A Weighted Ensemble Learning Method to Impute Single-Cell RNA Sequencing Data

被引:3
|
作者
Lu, Fan [1 ,2 ]
Lin, Yilong [1 ,2 ]
Yuan, Chongbin [1 ,2 ]
Zhang, Xiao-Fei [3 ,4 ]
Le Ou-Yang [1 ,2 ]
机构
[1] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen Key Lab Media Secur,Guangdong Key Lab In, Guangdong Lab Artificial Intelligence & Digital E, Shenzhen 518060, Peoples R China
[2] Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518129, Peoples R China
[3] Cent China Normal Univ, Sch Math & Stat, Wuhan 430079, Peoples R China
[4] Cent China Normal Univ, Hubei Key Lab Math Sci, Wuhan 430079, Peoples R China
基金
中国国家自然科学基金;
关键词
Sparse matrices; Sequential analysis; Data models; RNA; Mathematical model; Linear programming; Learning systems; Single-cell RNA sequencing; dropout events; ensemble learning; GENE-EXPRESSION; MOUSE; TRANSCRIPTOME;
D O I
10.1109/TCBB.2021.3110850
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The advancements of single-cell RNA sequencing (scRNA-seq) technologies have provided us unprecedented opportunities to characterize cellular states and investigate the mechanisms of complex diseases. Due to technical issues such as dropout events, scRNA-seq data contains excess of false zero counts, which has a substantial impact on the downstream analyses. Although several computational approaches have been proposed to impute dropout events in scRNA-seq data, there is no strong consensus on which is the best approach. In this study, we propose a novel weighted ensemble learning method, named EnTSSR, to impute dropout events in scRNA-seq data. By using a multi-view two-side sparse self-representation framework, our model can exploit the consensus similarities between genes and between cells based on the imputed results of various imputation methods. Moreover, we introduce a weighted ensemble strategy to leverage the information captured by various imputation methods effectively. Down-sampling experiments, clustering analysis, differential expression analysis and cell trajectory inference are carried out to evaluate the performance of our proposed model. Experiment results demonstrate that our EnTSSR can effectively recover the true expression pattern of scRNA-seq data.
引用
收藏
页码:2781 / 2787
页数:7
相关论文
共 50 条
  • [31] COME: contrastive mapping learning for spatial reconstruction of single-cell RNA sequencing data
    Wei, Xindian
    Chen, Tianyi
    Wang, Xibiao
    Shen, Wenjun
    Liu, Cheng
    Wu, Si
    Wong, Hau-San
    BIOINFORMATICS, 2025, 41 (03)
  • [32] DeepImpute: an accurate, fast, and scalable deep neural network method to impute single-cell RNA-seq data
    Cédric Arisdakessian
    Olivier Poirion
    Breck Yunits
    Xun Zhu
    Lana X. Garmire
    Genome Biology, 20
  • [33] scEWE: high-order element-wise weighted ensemble clustering for heterogeneity analysis of single-cell RNA-sequencing data
    Huang, Yixiang
    Jiang, Hao
    Ching, Wai-Ki
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (03)
  • [34] CTEC: a cross-tabulation ensemble clustering approach for single-cell RNA sequencing data analysis
    Wang, Liang
    Hong, Chenyang
    Song, Jiangning
    Yao, Jianhua
    BIOINFORMATICS, 2024, 40 (04)
  • [35] DeepImpute: an accurate, fast, and scalable deep neural network method to impute single-cell RNA-seq data
    Arisdakessian, Cedric
    Poirion, Olivier
    Yunits, Breck
    Zhu, Xun
    Garmire, Lana X.
    GENOME BIOLOGY, 2019, 20 (01)
  • [36] A comparison of integration methods for single-cell RNA sequencing data and ATAC sequencing data
    Kan, Yulong
    Wang, Weihao
    Qi, Yunjing
    Zhang, Zhongxiao
    Liang, Xikeng
    Jin, Shuilin
    QUANTITATIVE BIOLOGY, 2025, 13 (02)
  • [37] Identification of Kidney Cell Types in Single-Cell RNA Sequencing and Single-Nucleus RNA Sequencing Data Using Machine-Learning Algorithms
    Madapoosi, Siddharth S.
    Tisch, Adam
    Blough, Stephen A.
    Rosa, Jan S.
    Eddy, Sean
    Naik, Abhijit S.
    Limonte, Christine P.
    McCown, Phillip J.
    Menon, Rajasree
    Rosas, Sylvia E.
    Parikh, Chirag R.
    Mariani, Laura H.
    Kretzler, Matthias
    Mahfouz, Ahmed
    Alakwaa, Fadhl
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2024, 35 (10):
  • [38] Method of moments framework for differential expression analysis of single-cell RNA sequencing data
    Kim, Min Cheol
    Gate, Rachel
    Lee, David S.
    Tolopko, Andrew
    Lu, Andrew
    Gordon, Erin
    Shifrut, Eric
    Garcia-Nieto, Pablo E.
    Marson, Alexander
    Ntranos, Vasilis
    Ye, Chun Jimmie
    CELL, 2024, 187 (22)
  • [39] CMF-Impute: an accurate imputation tool for single-cell RNA-seq data
    Xu, Junlin
    Cai, Lijun
    Liao, Bo
    Zhu, Wen
    Yang, JiaLiang
    BIOINFORMATICS, 2020, 36 (10) : 3139 - 3147
  • [40] One-step spectral clustering of weighted variables on single-cell RNA-sequencing data
    Park, Min Young
    Park, Seyoung
    KOREAN JOURNAL OF APPLIED STATISTICS, 2020, 33 (04) : 511 - 526