A survey on identification and quantification of alternative polyadenylation sites from RNA-seq data

被引:25
|
作者
Chen, Moliang [1 ]
Ji, Guoli [1 ,2 ]
Fu, Hongjuan [1 ]
Lin, Qianmin [3 ]
Ye, Congting [4 ]
Ye, Wenbin [1 ]
Su, Yaru [5 ]
Wu, Xiaohui [1 ,6 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361005, Fujian, Peoples R China
[2] Xiamen Res Inst, Xiamen, Peoples R China
[3] Xiamen Univ, Xiangan Hosp, Xiamen, Peoples R China
[4] Xiamen Univ, Coll Environm & Ecol, Xiamen, Peoples R China
[5] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou, Peoples R China
[6] Xiamen Res Inst, Natl Ctr Healthcare Big Data, Xiamen, Peoples R China
基金
中国国家自然科学基金;
关键词
alternative polyadenylation; RNA-seq; 3 ' untranslated region; benchmark; predictive modeling; 3' UNTRANSLATED REGIONS; CHANGE-POINT MODEL; GENE-EXPRESSION; MESSENGER-RNAS; POLY(A) SITES; CLEAVAGE; REVEALS; WIDESPREAD; MECHANISMS; DYNAMICS;
D O I
10.1093/bib/bbz068
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Alternative polyadenylation (APA) has been implicated to play an important role in post-transcriptional regulation by regulating mRNA abundance, stability, localization and translation, which contributes considerably to transcriptome diversity and gene expression regulation. RNA-seq has become a routine approach for transcriptome profiling, generating unprecedented data that could be used to identify and quantify APA site usage. A number of computational approaches for identifying APA sites and/or dynamic APA events from RNA-seq data have emerged in the literature, which provide valuable yet preliminary results that should be refined to yield credible guidelines for the scientific community. In this review, we provided a comprehensive overview of the status of currently available computational approaches. We also conducted objective benchmarking analysis using RNA-seq data sets from different species (human, mouse and Arabidopsis) and simulated data sets to present a systematic evaluation of 11 representative methods. Our benchmarking study showed that the overall performance of all tools investigated is moderate, reflecting that there is still lot of scope to improve the prediction of APA site or dynamic APA events from RNA-seq data. Particularly, prediction results from individual tools differ considerably, and only a limited number of predicted APA sites or genes are common among different tools. Accordingly, we attempted to give some advice on how to assess the reliability of the obtained results. We also proposed practical recommendations on the appropriate method applicable to diverse scenarios and discussed implications and future directions relevant to profiling APA from RNA-seq data.
引用
收藏
页码:1261 / 1276
页数:16
相关论文
共 50 条
  • [31] Transcriptome assembly and quantification from Ion Torrent RNA-Seq data
    Mangul, Serghei
    Caciula, Adrian
    Al Seesi, Sahar
    Brinza, Dumitru
    Mondoiu, Ion
    Zelikovsky, Alex
    BMC GENOMICS, 2014, 15
  • [32] Quantification of co-transcriptional splicing from RNA-Seq data
    Herzel, Lydia
    Neugebauer, Karla M.
    METHODS, 2015, 85 : 36 - 43
  • [33] Transcriptome assembly and quantification from Ion Torrent RNA-Seq data
    Serghei Mangul
    Adrian Caciula
    Sahar Al Seesi
    Dumitru Brinza
    Ion Mӑndoiu
    Alex Zelikovsky
    BMC Genomics, 15
  • [34] Reliable Identification of Genomic Variants from RNA-Seq Data
    Piskol, Robert
    Ramaswami, Gokul
    Li, Jin Billy
    AMERICAN JOURNAL OF HUMAN GENETICS, 2013, 93 (04) : 641 - 651
  • [35] The identification and characterization of novel transcripts from RNA-seq data
    Weirick, Tyler
    Militello, Giuseppe
    Mueller, Raphael
    John, David
    Dimmeler, Stefanie
    Uchida, Shizuka
    BRIEFINGS IN BIOINFORMATICS, 2016, 17 (04) : 678 - 685
  • [36] A Robust Method for Transcript Quantification with RNA-Seq Data
    Huang, Yan
    Hu, Yin
    Jones, Corbin D.
    MacLeod, James N.
    Chiang, Derek Y.
    Liu, Yufeng
    Prins, Jan F.
    Liu, Jinze
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2013, 20 (03) : 167 - 187
  • [37] Quantitative visualization of alternative exon expression from RNA-seq data
    Katz, Yarden
    Wang, Eric T.
    Silterra, Jacob
    Schwartz, Schraga
    Wong, Bang
    Thorvaldsdottir, Helga
    Robinson, James T.
    Mesirov, Jill P.
    Airoldi, Edoardo M.
    Burge, Christopher B.
    BIOINFORMATICS, 2015, 31 (14) : 2400 - 2402
  • [38] Estimation of alternative splicing isoform frequencies from RNA-Seq data
    Marius Nicolae
    Serghei Mangul
    Ion I Măndoiu
    Alex Zelikovsky
    Algorithms for Molecular Biology, 6
  • [39] Estimation of Alternative Splicing isoform Frequencies from RNA-Seq Data
    Nicolae, Marius
    Mangul, Serghei
    Mandoiu, Ion
    Zelikovsky, Alex
    ALGORITHMS IN BIOINFORMATICS, 2010, 6293 : 202 - +
  • [40] Estimation of alternative splicing isoform frequencies from RNA-Seq data
    Nicolae, Marius
    Mangul, Serghei
    Mandoiu, Ion I.
    Zelikovsky, Alex
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2011, 6