Prediction of protein coding regions in DNA sequences using signal processing methods

被引:0
|
作者
Saberkari, Hamidreza [1 ]
Shamsi, Mousa [1 ]
Sedaaghi, MohammadHossein [1 ]
Golabi, Faegheh [1 ]
机构
[1] Sahand Univ Technol, Dept Elect Engn, Tabriz, Iran
关键词
DNA sequence; protein coding region; signal processing; exon; DFT; notch filter; IDENTIFICATION;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Identification of protein-coding regions in Deoxyribonucleic Acid (DNA) sequences because of their 3-base periodicity has been a challenging issue in bioinformatics. Many DSP (Digital Signal Processing) techniques have been applied for identification task and concentrated on assigning numerical values to the symbolic DNA sequence and then applying spectral analysis tools such as the short-time discrete Fourier transform (ST-DFT) to locate periodicity components. In this paper, we investigate the location of exons in DNA strand using directly the DFT approach. By using this method, we see that background noise in the period-3 DNA spectrum has been present. In order to eliminate this noise and for improve the quality of detection, we use an efficient algorithm based on notch filter. Simulation results represent that by using this simple algorithm, the exon location in DNA sequence can be detected as well as possible and the background noise is removes. In this paper, we have also developed a useful user friendly package to analyze DNA sequences.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Genomic Signal Processing Methods for Computation of Alignment-Free Distances from DNA Sequences
    Borrayo, Ernesto
    Gerardo Mendizabal-Ruiz, E.
    Velez-Perez, Hugo
    Romo-Vazquez, Rebeca
    Mendizabal, Adriana P.
    Alejandro Morales, J.
    PLOS ONE, 2014, 9 (11):
  • [42] Computational analysis of DNA photolyases using digital signal processing methods
    Pirogova, E.
    Vojisavljevic, V.
    Fang, Q.
    Cosic, I.
    MOLECULAR SIMULATION, 2006, 32 (14) : 1195 - 1203
  • [43] Prediction of Protein Coding Regions by Support Vector Machine
    Guo Shuo
    Zhu Yi-sheng
    2009 INTERNATIONAL SYMPOSIUM ON INTELLIGENT UBIQUITOUS COMPUTING AND EDUCATION, 2009, : 185 - 188
  • [44] DNA SEQUENCES CODING FOR MORE THAN ONE PROTEIN
    LEWIN, B
    NATURE, 1976, 264 (5581) : 11 - 12
  • [45] Correlations in DNA sequences: The role of protein coding segments
    Herzel, H
    Grosse, I
    PHYSICAL REVIEW E, 1997, 55 (01) : 800 - 810
  • [46] SYSTEMATIC ANALYSIS OF CODING AND NONCODING DNA-SEQUENCES USING METHODS OF STATISTICAL LINGUISTICS
    MANTEGNA, RN
    BULDYREV, SV
    GOLDBERGER, AL
    HAVLIN, S
    PENG, CK
    SIMONS, M
    STANLEY, HE
    PHYSICAL REVIEW E, 1995, 52 (03) : 2939 - 2950
  • [47] Comparison of the accuracies of several phylogenetic methods using protein and DNA sequences
    Hall, BG
    MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (03) : 792 - 802
  • [48] IDENTIFICATION OF PROTEIN-CODING REGIONS IN GENOMIC DNA
    SNYDER, EE
    STORMO, GD
    JOURNAL OF MOLECULAR BIOLOGY, 1995, 248 (01) : 1 - 18
  • [49] Prediction of Protein-Protein Interaction Based Only on Coding Sequences
    Wang, Yongcui
    Wang, Ji-Guang
    Yang, Zhi-Xia
    Deng, Naiyang
    OPTIMIZATION AND SYSTEMS BIOLOGY, 2009, 11 : 151 - +
  • [50] Locating regions of differential variability in DNA and protein sequences
    Tang, H
    Lewontin, RC
    GENETICS, 1999, 153 (01) : 485 - 495