ISOGO: Functional annotation of protein-coding splice variants

被引:0
|
作者
Juan A Ferrer-Bonsoms
Ignacio Cassol
Pablo Fernández-Acín
Carlos Castilla
Fernando Carazo
Angel Rubio
机构
[1] Department of Biomedical Engineering and Sciences,
[2] Tecnun-Universidad de Navarra,undefined
[3] Manuel de Lardizábal 15,undefined
[4] Department of Bioengineering,undefined
[5] Facultad de Ingeniería,undefined
[6] Universidad Austral,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
The advent of RNA-seq technologies has switched the paradigm of genetic analysis from a genome to a transcriptome-based perspective. Alternative splicing generates functional diversity in genes, but the precise functions of many individual isoforms are yet to be elucidated. Gene Ontology was developed to annotate gene products according to their biological processes, molecular functions and cellular components. Despite a single gene may have several gene products, most annotations are not isoform-specific and do not distinguish the functions of the different proteins originated from a single gene. Several approaches have tried to automatically annotate ontologies at the isoform level, but this has shown to be a daunting task. We have developed ISOGO (ISOform + GO function imputation), a novel algorithm to predict the function of coding isoforms based on their protein domains and their correlation of expression along 11,373 cancer patients. Combining these two sources of information outperforms previous approaches: it provides an area under precision-recall curve (AUPRC) five times larger than previous attempts and the median AUROC of assigned functions to genes is 0.82. We tested ISOGO predictions on some genes with isoform-specific functions (BRCA1, MADD,VAMP7 and ITSN1) and they were coherent with the literature. Besides, we examined whether the main isoform of each gene -as predicted by APPRIS- was the most likely to have the annotated gene functions and it occurs in 99.4% of the genes. We also evaluated the predictions for isoform-specific functions provided by the CAFA3 challenge and results were also convincing. To make these results available to the scientific community, we have deployed a web application to consult ISOGO predictions (https://biotecnun.unav.es/app/isogo). Initial data, website link, isoform-specific GO function predictions and R code is available at https://gitlab.com/icassol/isogo.
引用
收藏
相关论文
共 50 条
  • [1] ISOGO: Functional annotation of protein-coding splice variants
    Ferrer-Bonsoms, Juan A.
    Cassol, Ignacio
    Fernandez-Acin, Pablo
    Castilla, Carlos
    Carazo, Fernando
    Rubio, Angel
    [J]. SCIENTIFIC REPORTS, 2020, 10 (01)
  • [2] Genome-Wide Functional Annotation of Human Protein-Coding Splice Variants Using Multiple Instance Learning
    Panwar, Bharat
    Menon, Rajasree
    Eksi, Ridvan
    Li, Hong-Dong
    Omenn, Gilbert S.
    Guan, Yuanfang
    [J]. JOURNAL OF PROTEOME RESEARCH, 2016, 15 (06) : 1747 - 1753
  • [3] EXPANSION: a webserver to explore the functional consequences of protein-coding alternative splice variants in cancer genomics
    Arora, Chakit
    Rosa, Natalia De Oliveira
    Matic, Marin
    Cascone, Mariastella
    Miglionico, Pasquale
    Raimondi, Francesco
    [J]. BIOINFORMATICS ADVANCES, 2023, 3 (01):
  • [4] An Updated Functional Annotation of Protein-Coding Genes in the Cucumber Genome
    Song, Hongtao
    Lin, Kui
    Hu, Jinglu
    Pang, Erli
    [J]. FRONTIERS IN PLANT SCIENCE, 2018, 9
  • [5] ANNOTATION OF PROTEIN-CODING GENES IN FUNGAL GENOMES
    Martinez, Diego
    Grigoriev, Igor
    Salamov, Asaf
    [J]. APPLIED AND COMPUTATIONAL MATHEMATICS, 2010, 9 : 56 - 65
  • [6] Accurate annotation of protein-coding genes in mitochondrial genomes
    Al Arab, Marwa
    zu Siederdissen, Christian Hoener
    Tout, Kifah
    Sahyoun, Abdullah H.
    Stadler, Peter F.
    Bernt, Matthias
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2017, 106 : 209 - 216
  • [7] Refining reference human protein-coding gene annotation
    Frankish, A.
    [J]. EUROPEAN JOURNAL OF HUMAN GENETICS, 2020, 28 (SUPPL 1) : 652 - 652
  • [8] Current methods for automated annotation of protein-coding genes
    Hoff, K. J.
    Stanke, M.
    [J]. CURRENT OPINION IN INSECT SCIENCE, 2015, 7 : 8 - 14
  • [9] Genome-wide annotation of protein-coding genes in pig
    Karlsson, Max
    Sjostedt, Evelina
    Oksvold, Per
    Sivertsson, Asa
    Huang, Jinrong
    Alvez, Maria Bueno
    Arif, Muhammad
    Li, Xiangyu
    Lin, Lin
    Yu, Jiaying
    Ma, Tao
    Xu, Fengping
    Han, Peng
    Jiang, Hui
    Mardinoglu, Adil
    Zhang, Cheng
    von Feilitzen, Kalle
    Xu, Xun
    Wang, Jian
    Yang, Huanming
    Bolund, Lars
    Zhong, Wen
    Fagerberg, Linn
    Lindskog, Cecilia
    Ponten, Fredrik
    Mulder, Jan
    Luo, Yonglun
    Uhlen, Mathias
    [J]. BMC BIOLOGY, 2022, 20 (01)
  • [10] Annotation of Protein-Coding in Drosophila biarmipes Contig6
    Yonke, J. M.
    Sadikot, T.
    [J]. MOLECULAR BIOLOGY OF THE CELL, 2016, 27