High-throughput single-cell RNA-seq data imputation and characterization with surrogate-assisted automated deep learning

被引:9
|
作者
Li, Xiangtao [1 ]
Li, Shaochuan [2 ]
Huang, Lei [3 ]
Zhang, Shixiong [4 ]
Wong, Ka-chun [5 ]
机构
[1] Jilin Univ, Sch Artificial Intelligence, Jilin, Jilin, Peoples R China
[2] Northeast Normal Univ, Sch Informat Sci & Technol, Jilin, Jilin, Peoples R China
[3] City Univ Hong Kong, Dept Comp Sci, Hong Kong, Peoples R China
[4] Xidian Univ, Sch Comp Sci & Technol, Xian, Peoples R China
[5] City Univ Hong Kong, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
GENE-EXPRESSION; REVEALS;
D O I
10.1093/bib/bbab368
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell RNA sequencing (scRNA-seq) technologies have been heavily developed to probe gene expression profiles at single-cell resolution. Deep imputation methods have been proposed to address the related computational challenges (e.g. the gene sparsity in single-cell data). In particular, the neural architectures of those deep imputation models have been proven to be critical for performance. However, deep imputation architectures are difficult to design and tune for those without rich knowledge of deep neural networks and scRNA-seq. Therefore, Surrogate-assisted Evolutionary Deep Imputation Model (SEDIM) is proposed to automatically design the architectures of deep neural networks for imputing gene expression levels in scRNA-seq data without any manual tuning. Moreover, the proposed SEDIM constructs an offline surrogate model, which can accelerate the computational efficiency of the architectural search. Comprehensive studies show that SEDIM significantly improves the imputation and clustering performance compared with other benchmark methods. In addition, we also extensively explore the performance of SEDIM in other contexts and platforms including mass cytometry and metabolic profiling in a comprehensive manner. Marker gene detection, gene ontology enrichment and pathological analysis are conducted to provide novel insights into cell-type identification and the underlying mechanisms. The source code is available at https://github.com/li-shaochuan/SEDIM.
引用
收藏
页数:21
相关论文
共 50 条
  • [1] High-throughput spatial mapping of single-cell RNA-seq data to tissue of origin
    Kaia Achim
    Jean-Baptiste Pettit
    Luis R Saraiva
    Daria Gavriouchkina
    Tomas Larsson
    Detlev Arendt
    John C Marioni
    [J]. Nature Biotechnology, 2015, 33 : 503 - 509
  • [2] deepMc: Deep Matrix Completion for Imputation of Single-Cell RNA-seq Data
    Mongia, Aanchal
    Sengupta, Debarka
    Majumdar, Angshul
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (07) : 1011 - 1019
  • [3] High-throughput spatial mapping of single-cell RNA-seq data to tissue of origin
    Achim, Kaia
    Pettit, Jean-Baptiste
    Saraiva, Luis R.
    Gavriouchkina, Daria
    Larsson, Tomas
    Arendt, Detlev
    Marioni, John C.
    [J]. NATURE BIOTECHNOLOGY, 2015, 33 (05) : 503 - U215
  • [4] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Peng, Tao
    Zhu, Qin
    Yin, Penghang
    Tan, Kai
    [J]. GENOME BIOLOGY, 2019, 20 (1)
  • [5] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Tao Peng
    Qin Zhu
    Penghang Yin
    Kai Tan
    [J]. Genome Biology, 20
  • [6] Deep Learning for Clustering Single-cell RNA-seq Data
    Zhu, Yuan
    Bai, Litai
    Ning, Zilin
    Fu, Wenfei
    Liu, Jie
    Jiang, Linfeng
    Fei, Shihuang
    Gong, Shiyun
    Lu, Lulu
    Deng, Minghua
    Yi, Ming
    [J]. CURRENT BIOINFORMATICS, 2024, 19 (03) : 193 - 210
  • [7] Evaluating imputation methods for single-cell RNA-seq data
    Yi Cheng
    Xiuli Ma
    Lang Yuan
    Zhaoguo Sun
    Pingzhang Wang
    [J]. BMC Bioinformatics, 24
  • [8] Evaluating imputation methods for single-cell RNA-seq data
    Cheng, Yi
    Ma, Xiuli
    Yuan, Lang
    Sun, Zhaoguo
    Wang, Pingzhang
    [J]. BMC BIOINFORMATICS, 2023, 24 (01)
  • [9] Locality Sensitive Imputation for Single-Cell RNA-Seq Data
    Moussa, Marmar
    Mandoiu, Ion I.
    [J]. BIOINFORMATICS RESEARCH AND APPLICATIONS, ISBRA 2018, 2018, 10847 : 347 - 360
  • [10] Correlation Imputation for Single-Cell RNA-seq
    Gan, Luqin
    Vinci, Giuseppe
    Allen, Genevera I.
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2022, 29 (05) : 465 - 482