scIGANs: single-cell RNA-seq imputation using generative adversarial networks

被引:87
|
作者
Xu, Yungang [1 ,8 ,9 ]
Zhang, Zhigang [2 ,3 ]
You, Lei [1 ]
Liu, Jiajia [1 ,4 ]
Fan, Zhiwei [1 ,5 ,6 ]
Zhou, Xiaobo [1 ,7 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Ctr Computat Syst Med, Sch Biomed Informat, Houston, TX 77030 USA
[2] Hubei Univ Econ, Sch Informat Management & Stat, Wuhan 430205, Hubei, Peoples R China
[3] Hubei Univ Econ, Hubei Ctr Data & Anal, Wuhan 430205, Hubei, Peoples R China
[4] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
[5] Sichuan Univ, West China Sch Publ Hlth, Chengdu 610040, Sichuan, Peoples R China
[6] Sichuan Univ, West China Hosp 4, Chengdu 610040, Sichuan, Peoples R China
[7] Univ Texas Hlth Sci Ctr Houston, Dept Paediat Surg, McGovern Med Sch, Houston, TX 77030 USA
[8] Childrens Hosp Philadelphia, Dept Pathol & Lab Med, Philadelphia, PA 19104 USA
[9] Univ Penn, Philadelphia, PA 19104 USA
基金
美国国家卫生研究院;
关键词
HETEROGENEITY; PREDICTION;
D O I
10.1093/nar/gkaa506
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell RNA-sequencing (scRNA-seq) enables the characterization of transcriptomic profiles at the single-cell resolution with increasingly high throughput. However, it suffers from many sources of technical noises, including insufficient mRNA molecules that lead to excess false zero values, termed dropouts. Computational approaches have been proposed to recover the biologically meaningful expression by borrowing information from similar cells in the observed dataset. However, these methods suffer from oversmoothing and removal of natural cell-to-cell stochasticity in gene expression. Here, we propose the generative adversarial networks (GANs) for scRNA-seq imputation (scIGANs), which uses generated cells rather than observed cells to avoid these limitations and balances the performance between major and rare cell populations. Evaluations based on a variety of simulated and real scRNA-seq datasets show that scIGANs is effective for dropout imputation and enhances various down-stream analysis. ScIGANs is robust to small datasets that have very few genes with low expression and/or cell-to-cell variance. ScIGANs works equally well on datasets from different scRNA-seq protocols and is scalable to datasets with over 100 000 cells. We demonstrated in many ways with compelling evidence that scIGANs is not only an application of GANs in omics data but also represents a competing imputation method for the scRNA-seq data.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] scGGAN: single-cell RNA-seq imputation by graph-based generative adversarial network
    Huang, Zimo
    Wang, Jun
    Lu, Xudong
    Zain, Azlan Mohd
    Yu, Guoxian
    [J]. BRIEFINGS IN BIOINFORMATICS, 2023, 24 (02)
  • [2] Realistic in silico generation and augmentation of single-cell RNA-seq data using generative adversarial networks
    Mohamed Marouf
    Pierre Machart
    Vikas Bansal
    Christoph Kilian
    Daniel S. Magruder
    Christian F. Krebs
    Stefan Bonn
    [J]. Nature Communications, 11
  • [3] Realistic in silico generation and augmentation of single-cell RNA-seq data using generative adversarial networks
    Marouf, Mohamed
    Machart, Pierre
    Bansal, Vikas
    Kilian, Christoph
    Magruder, Daniel S.
    Krebs, Christian F.
    Bonn, Stefan
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)
  • [4] Correlation Imputation for Single-Cell RNA-seq
    Gan, Luqin
    Vinci, Giuseppe
    Allen, Genevera I.
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2022, 29 (05) : 465 - 482
  • [5] GRouNdGAN: GRN-guided simulation of single-cell RNA-seq data using causal generative adversarial networks
    Zinati, Yazdan
    Takiddeen, Abdulrahman
    Emad, Amin
    [J]. NATURE COMMUNICATIONS, 2024, 15 (01)
  • [6] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Peng, Tao
    Zhu, Qin
    Yin, Penghang
    Tan, Kai
    [J]. GENOME BIOLOGY, 2019, 20 (1)
  • [7] SCRABBLE: single-cell RNA-seq imputation constrained by bulk RNA-seq data
    Tao Peng
    Qin Zhu
    Penghang Yin
    Kai Tan
    [J]. Genome Biology, 20
  • [8] Bubble: a fast single-cell RNA-seq imputation using an autoencoder constrained by bulk RNA-seq data
    Chen, Siqi
    Yan, Xuhua
    Zheng, Ruiqing
    Li, Min
    [J]. BRIEFINGS IN BIOINFORMATICS, 2023, 24 (01)
  • [9] Evaluating imputation methods for single-cell RNA-seq data
    Yi Cheng
    Xiuli Ma
    Lang Yuan
    Zhaoguo Sun
    Pingzhang Wang
    [J]. BMC Bioinformatics, 24
  • [10] Evaluating imputation methods for single-cell RNA-seq data
    Cheng, Yi
    Ma, Xiuli
    Yuan, Lang
    Sun, Zhaoguo
    Wang, Pingzhang
    [J]. BMC BIOINFORMATICS, 2023, 24 (01)