Finding Motifs in DNA Sequences Using Low-Dispersion Sequences

被引:49
|
作者
Wang, Xun [1 ]
Miao, Ying [1 ]
Cheng, Minquan [2 ]
机构
[1] Univ Tsukuba, Grad Sch Syst & Informat Engn, Tsukuba, Ibaraki 3058577, Japan
[2] Guangxi Normal Univ, Dept Math, Guilin, Peoples R China
基金
美国国家科学基金会;
关键词
random projection; developed almost difference family; uniform projection; motif finding; low-dispersion sequence; EXPECTATION MAXIMIZATION; PROJECTION; DISCOVERY; BINDING;
D O I
10.1089/cmb.2013.0054
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motif finding problems, abstracted as the planted (l, d)-motif finding problem, are a major task in molecular biology-finding functioning units and genes. In 2002, the random projection algorithm was introduced to solve the challenging (15, 4)-motif finding problem by using randomly chosen templates. Two years later, a so-called uniform projection algorithm was developed to improve the random projection algorithm by means of low-dispersion sequences generated by coverings. In this article, we introduce an improved projection algorithm called the low-dispersion projection algorithm, which uses low-dispersion sequences generated by developed almost difference families. Compared with the random projection algorithm, the low-dispersion projection algorithm can solve the (l, d)-motif finding problem with fewer templates without decreasing the success rate.
引用
收藏
页码:320 / 329
页数:10
相关论文
共 50 条
  • [1] LOW-DISCREPANCY AND LOW-DISPERSION SEQUENCES
    NIEDERREITER, H
    JOURNAL OF NUMBER THEORY, 1988, 30 (01) : 51 - 70
  • [2] MotifMark: Finding Regulatory Motifs in DNA Sequences
    Hassanzadeh, Hamid Reza
    Kolhe, Pushkar
    Isbell, Charles L.
    Wang, May D.
    2017 39TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2017, : 3890 - 3893
  • [3] Constraint based method for finding motifs in DNA sequences
    Dong, X
    Sung, SY
    Sung, WK
    Tan, CL
    BIBE 2004: FOURTH IEEE SYMPOSIUM ON BIOINFORMATICS AND BIOENGINEERING, PROCEEDINGS, 2004, : 483 - 490
  • [4] New scoring schema for finding motifs in DNA Sequences
    Zare-Mirakabad, Fatemeh
    Ahrabian, Hayedeh
    Sadeghi, Mehdei
    Nowzari-Dalini, Abbas
    Goliaei, Bahram
    BMC BIOINFORMATICS, 2009, 10
  • [5] New scoring schema for finding motifs in DNA Sequences
    Fatemeh Zare-Mirakabad
    Hayedeh Ahrabian
    Mehdei Sadeghi
    Abbas Nowzari-Dalini
    Bahram Goliaei
    BMC Bioinformatics, 10
  • [6] Finding motifs in protein sequences
    Todd Richmond
    Genome Biology, 1 (3)
  • [7] Finding motifs in miRNA sequences
    Subramaniam, Chinnu
    An, Jiyuan
    Gubbi, Jayavardhana
    Chen, Yi-Ping Phoebe
    6TH IEEE/ACIS INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCE, PROCEEDINGS, 2007, : 507 - +
  • [8] Finding Motifs in A Set of DNA Sequences: A Dynamic Programming Approach
    Li, Zhen-Hao
    Zheng, Xiao-Juan
    Guan, Ji-Wen
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 198 - +
  • [9] Finding subtle motifs with variable gaps in unaligned DNA sequences
    Hu, YJ
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2003, 70 (01) : 11 - 20
  • [10] The Improvement and Implementation in the Algorithm of Finding Maximal Motifs in DNA Sequences
    Fu Yifan
    Zhou Dongdai
    Zhong Shaochun
    Zhao Ruiqing
    2008 3RD INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEM AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2008, : 176 - +