Discovering Motifs in DNA Sequences: A Suffix Tree Based Approach

被引:0
|
作者
Prakash, Sanchi [1 ]
Agarwal, Harshit [1 ]
Agarwal, Urvi [1 ]
Biswas, Prantik [1 ]
Dawn, Suma [1 ]
机构
[1] Jaypee Inst Informat Technol, Noida, India
来源
PROCEEDINGS OF THE 2018 IEEE 8TH INTERNATIONAL ADVANCE COMPUTING CONFERENCE (IACC 2018) | 2018年
关键词
Motif finding problem; Suffix tree; DNA sequences; Trie; Transcription factors;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Motif discovery also known as motif finding is a challenging problem in the field of bioinformatics that deals with various computational and statistical techniques to identify short patterns, often referred to as motifs that corresponds to the binding sites in the DNA sequence for transcription factors. Owing to the recent growth of bioinformatics, a good number of algorithms have come into limelight. This paper proposes a competent algorithm that extracts binding sites in set of DNA sequences for transcription factors, using successive iterations on the sequences provided. The motif we work on are of unknown length, un-gapped and non-mutated. The algorithm uses suffix trie for finding such sites. In this approach the first sequence is used as base for constructing the suffix trie and is mapped with other sequences which results in extraction of the motif. Additionally, this algorithm can also be applied to related problems in the field of data mining, pattern detection, etc.
引用
收藏
页码:327 / 332
页数:6
相关论文
共 50 条
  • [21] Parallelizing and optimizing a hybrid differential evolution with Pareto tournaments for discovering motifs in DNA sequences
    Gonzalez-Alvarez, David L.
    Vega-Rodriguez, Miguel A.
    Rubio-Largo, Alvaro
    JOURNAL OF SUPERCOMPUTING, 2014, 70 (02): : 880 - 905
  • [22] Searching Maximal Degenerate Motifs Guided by a Compact Suffix Tree
    Jiang, Hongshan
    Zhao, Ying
    Chen, Wenguang
    Zheng, Weimin
    ADVANCES IN COMPUTATIONAL BIOLOGY, 2010, 680 : 19 - 26
  • [23] Discovering DNA motifs with nucleotide dependency
    Leung, Henry C. M.
    Chin, Francis Y. L.
    BIBE 2006: SIXTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2006, : 70 - +
  • [24] Spelling approximate repeated or common motifs using a suffix tree
    Sagot, MF
    LATIN '98: THEORETICAL INFORMATICS, 1998, 1380 : 374 - 390
  • [25] DRIMust: a web server for discovering rank imbalanced motifs using suffix trees
    Leibovich, Limor
    Paz, Inbal
    Yakhini, Zohar
    Mandel-Gutfreund, Yael
    NUCLEIC ACIDS RESEARCH, 2013, 41 (W1) : W174 - W179
  • [26] Finding Motifs in A Set of DNA Sequences: A Dynamic Programming Approach
    Li, Zhen-Hao
    Zheng, Xiao-Juan
    Guan, Ji-Wen
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 198 - +
  • [27] Constructing suffix tree for gigabyte sequences with megabyte memory
    Cheung, CF
    Yu, JX
    Lu, HJ
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (01) : 90 - 105
  • [28] Constraint based method for finding motifs in DNA sequences
    Dong, X
    Sung, SY
    Sung, WK
    Tan, CL
    BIBE 2004: FOURTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2004, : 483 - 490
  • [29] An adaptive suffix tree based algorithm for repeats identification in a DNA sequence
    Huo H.-W.
    Wang X.-W.
    Jisuanji Xuebao/Chinese Journal of Computers, 2010, 33 (04): : 747 - 754
  • [30] An adaptive suffix tree based algorithm for repeats recognition in a DNA sequence
    Huo, Hongwei
    Wang, Xiaowu
    Stojkovic, Vojislav
    2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 181 - +