Accurate Estimation of Gene Expression Levels from DGE Sequencing Data

被引:0
|
作者
Nicolae, Marius [1 ]
Mandoiu, Ion [1 ]
机构
[1] Univ Connecticut, Dept Comp Sci & Engn, Storrs, CT 06269 USA
来源
关键词
SEQ; GENOME;
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Two main transcriptome sequencing protocols have been proposed in the literature: the most commonly used shotgun sequencing of full length mRNAs (RNA-Seq) and 3'-tag digital gene expression (DOE). In this paper we present a novel expectation-maximization algorithm, called DOE-EM, for inference of gene-specific expression levels from DOE tags. Unlike previous methods, our algorithm takes into account alternative splicing isoforms and tags that map at multiple locations in the genome, and corrects for incomplete digestion and sequencing errors. The open source Java/Scala implementation of the DOE-EM algorithm is freely available at http://dna.engr.uconn.edu/software/DGE-EM/. Experimental results on real DOE data generated from reference RNA samples show that our algorithm outperforms commonly used estimation methods based on unique tag counting. Furthermore, the accuracy of DOE-EM estimates is comparable to that obtained by state-of-the-art estimation algorithms from RNA-Seq data for the same samples. Results of a comprehensive simulation study assessing the effect of various experimental parameters suggest that further improvements in estimation accuracy could be achieved by optimizing DGE protocol parameters such as the anchoring enzymes and digestion time.
引用
收藏
页码:392 / 403
页数:12
相关论文
共 50 条
  • [1] Accurate estimation of cell-type composition from gene expression data
    Tsoucas, Daphne
    Dong, Rui
    Chen, Haide
    Zhu, Qian
    Guo, Guoji
    Yuan, Guo-Cheng
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [2] Accurate estimation of cell-type composition from gene expression data
    Daphne Tsoucas
    Rui Dong
    Haide Chen
    Qian Zhu
    Guoji Guo
    Guo-Cheng Yuan
    Nature Communications, 10
  • [3] Accurate pan-cancer tumor purity estimation from gene expression data
    Revkov, Egor
    Sung, Ken W. -K.
    Skanderup, Anders J.
    CANCER RESEARCH, 2022, 82 (12)
  • [4] Low Dimensionality in Gene Expression Data Enables the Accurate Extraction of Transcriptional Programs from Shallow Sequencing
    Heimberg, Graham
    Bhatnagar, Rajat
    El-Samad, Hana
    Thomson, Matt
    CELL SYSTEMS, 2016, 2 (04) : 239 - 250
  • [5] PUREE: accurate pan-cancer tumor purity estimation from gene expression data
    Revkov, Egor
    Kulshrestha, Tanmay
    Sung, Ken Wing-Kin
    Skanderup, Anders Jacobsen
    COMMUNICATIONS BIOLOGY, 2023, 6 (01)
  • [6] PUREE: accurate pan-cancer tumor purity estimation from gene expression data
    Egor Revkov
    Tanmay Kulshrestha
    Ken Wing-Kin Sung
    Anders Jacobsen Skanderup
    Communications Biology, 6
  • [7] ModEst: Accurate estimation of genome size from next generation sequencing data
    Pfenninger, Markus
    Schoennenbeck, Philipp
    Schell, Tilman
    MOLECULAR ECOLOGY RESOURCES, 2022, 22 (04) : 1454 - 1464
  • [8] aFold – using polynomial uncertainty modelling for differential gene expression estimation from RNA sequencing data
    Wentao Yang
    Philip Rosenstiel
    Hinrich Schulenburg
    BMC Genomics, 20
  • [9] aFold - using polynomial uncertainty modelling for differential gene expression estimation from RNA sequencing data
    Yang, Wentao
    Rosenstiel, Philip
    Schulenburg, Hinrich
    BMC GENOMICS, 2019, 20 (1)
  • [10] Gene2DGE: A Perl Package for Gene Model Renewal with Digital Gene Expression Data
    Xiaoli Tang1#
    2Institute of Translational Medicine
    3Beijing Institute of Genomics
    Genomics, Proteomics & Bioinformatics, 2012, (01) : 51 - 54