A survey on identification and quantification of alternative polyadenylation sites from RNA-seq data

被引:25
|
作者
Chen, Moliang [1 ]
Ji, Guoli [1 ,2 ]
Fu, Hongjuan [1 ]
Lin, Qianmin [3 ]
Ye, Congting [4 ]
Ye, Wenbin [1 ]
Su, Yaru [5 ]
Wu, Xiaohui [1 ,6 ]
机构
[1] Xiamen Univ, Dept Automat, Xiamen 361005, Fujian, Peoples R China
[2] Xiamen Res Inst, Xiamen, Peoples R China
[3] Xiamen Univ, Xiangan Hosp, Xiamen, Peoples R China
[4] Xiamen Univ, Coll Environm & Ecol, Xiamen, Peoples R China
[5] Fuzhou Univ, Coll Math & Comp Sci, Fuzhou, Peoples R China
[6] Xiamen Res Inst, Natl Ctr Healthcare Big Data, Xiamen, Peoples R China
基金
中国国家自然科学基金;
关键词
alternative polyadenylation; RNA-seq; 3 ' untranslated region; benchmark; predictive modeling; 3' UNTRANSLATED REGIONS; CHANGE-POINT MODEL; GENE-EXPRESSION; MESSENGER-RNAS; POLY(A) SITES; CLEAVAGE; REVEALS; WIDESPREAD; MECHANISMS; DYNAMICS;
D O I
10.1093/bib/bbz068
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Alternative polyadenylation (APA) has been implicated to play an important role in post-transcriptional regulation by regulating mRNA abundance, stability, localization and translation, which contributes considerably to transcriptome diversity and gene expression regulation. RNA-seq has become a routine approach for transcriptome profiling, generating unprecedented data that could be used to identify and quantify APA site usage. A number of computational approaches for identifying APA sites and/or dynamic APA events from RNA-seq data have emerged in the literature, which provide valuable yet preliminary results that should be refined to yield credible guidelines for the scientific community. In this review, we provided a comprehensive overview of the status of currently available computational approaches. We also conducted objective benchmarking analysis using RNA-seq data sets from different species (human, mouse and Arabidopsis) and simulated data sets to present a systematic evaluation of 11 representative methods. Our benchmarking study showed that the overall performance of all tools investigated is moderate, reflecting that there is still lot of scope to improve the prediction of APA site or dynamic APA events from RNA-seq data. Particularly, prediction results from individual tools differ considerably, and only a limited number of predicted APA sites or genes are common among different tools. Accordingly, we attempted to give some advice on how to assess the reliability of the obtained results. We also proposed practical recommendations on the appropriate method applicable to diverse scenarios and discussed implications and future directions relevant to profiling APA from RNA-seq data.
引用
收藏
页码:1261 / 1276
页数:16
相关论文
共 50 条
  • [1] APAtrap: identification and quantification of alternative polyadenylation sites from RNA-seq data
    Ye, Congting
    Long, Yuqi
    Ji, Guoli
    Li, Qingshun Quinn
    Wu, Xiaohui
    BIOINFORMATICS, 2018, 34 (11) : 1841 - 1849
  • [2] scAPAtrap: identification and quantification of alternative polyadenylation sites from single-cell RNA-seq data
    Wu, Xiaohui
    Liu, Tao
    Ye, Congting
    Ye, Wenbin
    Ji, Guoli
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (04)
  • [3] Identification of Alternative Splicing and Polyadenylation in RNA-seq Data
    Dixit, Gunjan
    Zheng, Ying
    Parker, Brian
    Wen, Jiayu
    JOVE-JOURNAL OF VISUALIZED EXPERIMENTS, 2021, (172):
  • [4] Computational analysis of alternative polyadenylation from standard RNA-seq and single-cell RNA-seq data
    Gao, Yipeng
    Li, Wei
    MRNA 3' END PROCESSING AND METABOLISM, 2021, 655 : 225 - 243
  • [5] APAtizer: a tool for alternative polyadenylation analysis of RNA-Seq data
    Sousa, Bruno
    Bessa, Maria
    de Mendonca, Filipa L.
    Ferreira, Pedro G.
    Moreira, Alexandra
    Pereira-Castro, Isabel
    BIOINFORMATICS, 2024, 40 (11)
  • [6] mountainClimber Identifies Alternative Transcription Start and Polyadenylation Sites in RNA-Seq
    Cass, Ashley A.
    Xiao, Xinshu
    CELL SYSTEMS, 2019, 9 (04) : 393 - +
  • [7] SplAdder: identification, quantification and testing of alternative splicing events from RNA-Seq data
    Kahles, Andre
    Ong, Cheng Soon
    Zhong, Yi
    Ratsch, Gunnar
    BIOINFORMATICS, 2016, 32 (12) : 1840 - 1847
  • [8] A Survey on Methods for Predicting Polyadenylation Sites from DNA Sequences, Bulk RNA-seq, and Single-cell RNA-seq
    Ye, Wenbin
    Lian, Qiwei
    Ye, Congting
    Wu, Xiaohui
    GENOMICS PROTEOMICS & BIOINFORMATICS, 2023, 21 (01) : 67 - 83
  • [9] nagnag: Identification and quantification of NAGNAG alternative splicing using RNA-Seq data
    Yan, Xiaoyan
    Sablok, Gaurav
    Feng, Gang
    Ma, Jiaxin
    Zhao, Hongwei
    Sun, Xiaoyong
    FEBS LETTERS, 2015, 589 (15) : 1766 - 1770
  • [10] Extensible benchmarking of methods that identify and quantify polyadenylation sites from RNA-seq data
    Bryce-Smith, Sam
    Burri, Dominik
    Gazzara, Matthew R.
    Herrmann, Christina J.
    Danecka, Weronika
    Fitzsimmons, Christina M.
    Wan, Yuk Kei
    Zhuang, Farica
    Fansler, Mervin M.
    Fernandez, Jose M.
    Ferret, Meritxell
    Gonzalez-Uriarte, Asier
    Haynes, Samuel
    Herdman, Chelsea
    Kanitz, Alexander
    Katsantoni, Maria
    Marini, Federico
    McDonnel, Euan
    Nicolet, Ben
    Poon, Chi-Lam
    Rot, Gregor
    Scharfen, Leonard
    Wu, Pin-Jou
    Yoon, Yoseop
    Barash, Yoseph
    Zavolan, Mihaela
    RNA, 2023, 29 (12) : 1839 - 1855