A symbolic-numeric approach to find patterns in genomes.: Application to the translation initiation sites of E-coli.

被引:2
|
作者
Delamarche, C
Guerdoux-Jamet, P
Gras, R
Nicolas, J
机构
[1] CNRS, UPRES A 6026, Equipe Canaux & Recepteurs Membranaires, F-35042 Rennes, France
[2] Hop Pontchaillou, INSERM, U522, F-35033 Rennes, France
[3] INRIA, IRISA, F-35042 Rennes, France
关键词
Shine-Dalgarno; translation initiation; genome of E-coli; computational analysis;
D O I
10.1016/S0300-9084(99)00328-4
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
DNA sequence data provided by genome sequencing programs open new research prospects. in this respect, computational investigations are of major importance to discover new 'functional/structural patterns' and to improve biological process knowledge. For example, even though the principal steps of translation initiation in prokaryotes are known, it is difficult to point out the exact pattern of the mRNA that is recognized by the ribosome. in this study, we have carried out a systematic context analysis of the complete genome of E. coli, around codons in competition for translation initiation. Using a combinatorial approach, we first show that it is possible to accurately define the initiation site by looking for the localization of patterns representing various combinations of trinucleotides. We have combined this approach with a statistical analysis based on the frequencies of these patterns. This lends to a decision tree, able to discriminate true and false starts with a recognition level near 90%. Our method may help to precisely localize the beginning of open reading frames, and point to likely mistakes for some genes in the database. The method may be included as a component of a gene recognition system, is not restricted to a particular genome or a two-classes discrimination, and may be applied to a broader class of biological patterns. (C) Societe francaise de biochimie et biologie moleculaire/Editions scientifiques et medicales Elsevier SAS.
引用
收藏
页码:1065 / 1072
页数:8
相关论文
empty
未找到相关数据