scIGANs: single-cell RNA-seq imputation using generative adversarial networks

被引:87
|
作者
Xu, Yungang [1 ,8 ,9 ]
Zhang, Zhigang [2 ,3 ]
You, Lei [1 ]
Liu, Jiajia [1 ,4 ]
Fan, Zhiwei [1 ,5 ,6 ]
Zhou, Xiaobo [1 ,7 ]
机构
[1] Univ Texas Hlth Sci Ctr Houston, Ctr Computat Syst Med, Sch Biomed Informat, Houston, TX 77030 USA
[2] Hubei Univ Econ, Sch Informat Management & Stat, Wuhan 430205, Hubei, Peoples R China
[3] Hubei Univ Econ, Hubei Ctr Data & Anal, Wuhan 430205, Hubei, Peoples R China
[4] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
[5] Sichuan Univ, West China Sch Publ Hlth, Chengdu 610040, Sichuan, Peoples R China
[6] Sichuan Univ, West China Hosp 4, Chengdu 610040, Sichuan, Peoples R China
[7] Univ Texas Hlth Sci Ctr Houston, Dept Paediat Surg, McGovern Med Sch, Houston, TX 77030 USA
[8] Childrens Hosp Philadelphia, Dept Pathol & Lab Med, Philadelphia, PA 19104 USA
[9] Univ Penn, Philadelphia, PA 19104 USA
基金
美国国家卫生研究院;
关键词
HETEROGENEITY; PREDICTION;
D O I
10.1093/nar/gkaa506
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Single-cell RNA-sequencing (scRNA-seq) enables the characterization of transcriptomic profiles at the single-cell resolution with increasingly high throughput. However, it suffers from many sources of technical noises, including insufficient mRNA molecules that lead to excess false zero values, termed dropouts. Computational approaches have been proposed to recover the biologically meaningful expression by borrowing information from similar cells in the observed dataset. However, these methods suffer from oversmoothing and removal of natural cell-to-cell stochasticity in gene expression. Here, we propose the generative adversarial networks (GANs) for scRNA-seq imputation (scIGANs), which uses generated cells rather than observed cells to avoid these limitations and balances the performance between major and rare cell populations. Evaluations based on a variety of simulated and real scRNA-seq datasets show that scIGANs is effective for dropout imputation and enhances various down-stream analysis. ScIGANs is robust to small datasets that have very few genes with low expression and/or cell-to-cell variance. ScIGANs works equally well on datasets from different scRNA-seq protocols and is scalable to datasets with over 100 000 cells. We demonstrated in many ways with compelling evidence that scIGANs is not only an application of GANs in omics data but also represents a competing imputation method for the scRNA-seq data.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Single-cell RNA-seq data analysis using graph autoencoders and graph attention networks
    Feng, Xiang
    Fang, Fang
    Long, Haixia
    Zeng, Rao
    Yao, Yuhua
    [J]. FRONTIERS IN GENETICS, 2022, 13
  • [42] SINGLE-CELL ANALYSIS From single-cell RNA-seq to transcriptional regulation
    La Manno, Gioele
    [J]. NATURE BIOTECHNOLOGY, 2019, 37 (12) : 1421 - 1422
  • [43] Decontamination of ambient RNA in single-cell RNA-seq with DecontX
    Shiyi Yang
    Sean E. Corbett
    Yusuke Koga
    Zhe Wang
    W Evan Johnson
    Masanao Yajima
    Joshua D. Campbell
    [J]. Genome Biology, 21
  • [44] Decontamination of ambient RNA in single-cell RNA-seq with DecontX
    Yang, Shiyi
    Corbett, Sean E.
    Koga, Yusuke
    Wang, Zhe
    Johnson, W. Evan
    Yajima, Masanao
    Campbell, Joshua D.
    [J]. GENOME BIOLOGY, 2020, 21 (01)
  • [45] Complementing single-cell RNA-seq using bulk transcriptional profiles
    Haynes, Winston A.
    Vallania, Francesco
    Khatri, Purvesh
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 1446 - 1450
  • [46] Single-cell RNA-seq denoising using a deep count autoencoder
    Gökcen Eraslan
    Lukas M. Simon
    Maria Mircea
    Nikola S. Mueller
    Fabian J. Theis
    [J]. Nature Communications, 10
  • [47] scphaser: haplotype inference using single-cell RNA-seq data
    Edsgard, Daniel
    Reinius, Bjorn
    Sandberg, Rickard
    [J]. BIOINFORMATICS, 2016, 32 (19) : 3038 - 3040
  • [48] Single-cell RNA-seq denoising using a deep count autoencoder
    Eraslan, Goekcen
    Simon, Lukas M.
    Mircea, Maria
    Mueller, Nikola S.
    Theis, Fabian J.
    [J]. NATURE COMMUNICATIONS, 2019, 10 (1)
  • [49] Guidelines for reporting single-cell RNA-seq experiments
    Fullgrabe, Anja
    George, Nancy
    Green, Matthew
    Nejad, Parisa
    Aronow, Bruce
    Fexova, Silvie Korena
    Fischer, Clay
    Freeberg, Mallory Ann
    Huerta, Laura
    Morrison, Norman
    Scheuermann, Richard H.
    Taylor, Deanne
    Vasilevsky, Nicole
    Clarke, Laura
    Gehlenborg, Nils
    Kent, Jim
    Marioni, John
    Teichmann, Sarah
    Brazma, Alvis
    Papatheodorou, Irene
    [J]. NATURE BIOTECHNOLOGY, 2020, 38 (12) : 1384 - 1386
  • [50] A SMARTer solution to stranded single-cell RNA-seq
    Gandlur, S.
    Pesant, M.
    Bolduc, N.
    Lee, S.
    Hardy, C.
    Das, A.
    Bostick, M.
    Farmer, A.
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 1716 - 1717