Genscan for Arabidopsis is a valuable tool for predicting sponge coding sequences

被引:0
|
作者
Mauro Stifanic
Renato Batel
机构
[1] Center for Marine Research,Rudjer Boskovic Institute
来源
Biologia | 2007年 / 62卷
关键词
Porifera; exon; intron; prediction;
D O I
暂无
中图分类号
学科分类号
摘要
Although the first sponge genome project has already started releasing completed sequences, only a very small number of annotated sponge genomic sequences has so far been published. In addition, no gene-prediction software optimised for sponges is available yet. In the present paper, we present the performance of Arabidopsis-optimised Genscan as tested on sponge genes. All genes whose genomic and complete CDS sequences are deposited in the NCBI nucleotide database were retrieved and used as the test set. The 18 test genes are composed of 114 coding exons. The sensitivity and specificity, respectively, of all exons were predicted with 83.3% and 79.2%, internal exons with 88.5% and 80.2%, donor with 93.8% and 85.7%, acceptor with 89.6% and 78.9%, initiation with 94.4% and 85%, and termination sites with 72.2% and 81.3%. The results are compared with prediction results obtained with Genscan for vertebrates and GeneMark.hmm ES-3.0 for Arabidopsis. The surprising finding is that although the animals are the source of sequences, the best results (more than 80% accuracy in predicting complete exons) were obtained by Genscan optimised for a plant A. thaliana. Although the sample is small, the results lead to the conclusion that Genscan for Arabidopsis is a valuable tool for predicting coding sequences in sponges and could be of great help in annotating sponge genes.
引用
收藏
页码:124 / 127
页数:3
相关论文
共 50 条
  • [1] Genscan for Arabidopsis is a valuable tool for predicting sponge coding sequences
    Stifanic, Mauro
    Batel, Renato
    [J]. BIOLOGIA, 2007, 62 (02) : 124 - 127
  • [2] The multivariate model: a valuable tool in predicting poor response in IVF
    Klinkert, E. R.
    Bancsi, L. F. J. M. M.
    Looman, C. W.
    Eijkemans, M. J. C.
    Habbema, J. D. F.
    Broekmans, F. J.
    Te Velde, E. R.
    [J]. HUMAN REPRODUCTION, 2001, 16 : 73 - 74
  • [3] Correlations of length distributions between non-coding and coding sequences of Arabidopsis thaliana
    Caldwell, Rachel
    Lin, Yan-Xia
    Zhang, Ren
    [J]. 2008 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, PROCEEDINGS, 2008, : 72 - 77
  • [4] Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana
    Robert D. Hoffmann
    Michael Palmgren
    [J]. BMC Genomics, 17
  • [5] Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana
    Hoffmann, Robert D.
    Palmgren, Michael
    [J]. BMC GENOMICS, 2016, 17
  • [6] Distribution of microsatellites in relation to coding sequences within the Arabidopsis thaliana genome
    Casacuberta, E
    Puigdomènech, P
    Monfort, A
    [J]. PLANT SCIENCE, 2000, 157 (01) : 97 - 104
  • [7] Zebrafish (Danio rerio): A valuable tool for predicting the metabolism of xenobiotics in humans?
    Anselmo, Carina de Souza
    Sardela, Vinicius Figueiredo
    de Sousa, Valeria Pereira
    Gualberto Pereira, Henrique Marcelo
    [J]. COMPARATIVE BIOCHEMISTRY AND PHYSIOLOGY C-TOXICOLOGY & PHARMACOLOGY, 2018, 212 : 34 - 46
  • [8] Seforta, an integrated tool for detecting the signature of selection in coding sequences
    Camiolo S.
    Melito S.
    Milia G.
    Porceddu A.
    [J]. BMC Research Notes, 7 (1)
  • [9] OLIGOPEPTIDE BIASES IN PROTEIN SEQUENCES AND THEIR USE IN PREDICTING PROTEIN CODING REGIONS IN NUCLEOTIDE-SEQUENCES
    MCCALDON, P
    ARGOS, P
    [J]. PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1988, 4 (02): : 99 - 122
  • [10] Conserved non-coding sequences are associated with rates of mRNa decay in Arabidopsis
    Spangler, Jacob B.
    Feltus, Frank Alex
    [J]. FRONTIERS IN PLANT SCIENCE, 2013, 4