Deterministic local alignment methods improved by a simple genetic algorithm

被引:10
|
作者
Bi, Chengpeng [1 ]
机构
[1] Univ Missouri, Bioinformat & Intelligent Comp Lab, Div Clin Pharmacol, Childrens Mercy Hosp,Sch Med Comp & Engn, Kansas City, MO 64108 USA
关键词
Expectation maximization (EM); Genetic algorithms (GA); Motif discovery; Multiple sequence local alignment; Memetic algorithms; MULTIPLE SEQUENCE ALIGNMENT; DATA AUGMENTATION ALGORITHMS; DNA-BINDING SITES; EM ALGORITHM; EXPECTATION MAXIMIZATION; REGULATORY PROTEINS; MIXTURE-MODELS; MOTIFS; IDENTIFICATION; STRATEGIES;
D O I
10.1016/j.neucom.2010.01.023
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiple sequence local alignment, often deployed for de nova discovery of biological motifs hidden in a set of DNA or protein sequences, remains a challenge in bioinformatics and computational biology. Many algorithms and software packages have been developed to address the problem. Expectation maximization (EM), one of the popular local alignment methods, is often used to solve the motif-finding problem. However, EM largely depends on its initialization and can be easily trapped in local optima. This paper presents the Genetic-enabled EM Motif-Finding Algorithm (GEMFA) in an effort to mitigate the difficulties confronted the EM-based motif discovery algorithms. The new algorithm integrates a simple genetic algorithm (GA) with a local searcher to explore the local alignment space, that is, it combines deterministic local alignment methods with a simple GA to effectively perform de novo motif discovery. It first initializes a population of multiple local alignments each of which is encoded on a chromosome that represents a potential solution. GEMFA then performs heuristic search in the whole alignment space using minimum distance length (MDL) as the fitness function, which is generalized from maximum log-likelihood. The genetic algorithm gradually moves this population towards the best alignment from which the motif model is derived. Simulated and real biological sequence analysis showed that GEMFA significantly improved deterministic local alignment methods especially in the subtle motif sequence alignment, and it also outperformed other algorithms tested. (C) 2010 Elsevier B.V. All rights reserved,
引用
收藏
页码:2394 / 2406
页数:13
相关论文
共 50 条
  • [1] A simple genetic algorithm for multiple sequence alignment
    Gondro, C.
    Kinghorn, B. P.
    GENETICS AND MOLECULAR RESEARCH, 2007, 6 (04) : 964 - 982
  • [2] An Improved Genetic Algorithm for Multiple Sequence Alignment
    Fan, Hui
    Wu, Ronghui
    Liao, Bo
    Lu, Xinguo
    JOURNAL OF COMPUTATIONAL AND THEORETICAL NANOSCIENCE, 2012, 9 (10) : 1558 - 1564
  • [3] An improved deterministic local search algorithm for 3-SAT
    Brueggemann, T
    Kern, W
    THEORETICAL COMPUTER SCIENCE, 2004, 329 (1-3) : 303 - 313
  • [4] An Improved Genetic Algorithm for Developing Deterministic OTP Key Generator
    Jain, Ashish
    Chaudhari, Narendra S.
    COMPLEXITY, 2017,
  • [5] Improved Runtime Analysis of the Simple Genetic Algorithm
    Oliveto, Pietro S.
    Witt, Carsten
    GECCO'13: PROCEEDINGS OF THE 2013 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, 2013, : 1621 - 1628
  • [6] Genetic algorithm with a simple heuristic local tuning
    Yang, Rongfu
    Ding, Jing
    Jin, Juliang
    Shuikexue Jinzhan/Advances in Water Science, 1999, 10 (02): : 150 - 154
  • [7] Deterministic genetic algorithm
    Xiong, Z.Y.
    Ding, Y.L.
    Nanjing Hangkong Hangtian Daxue Xuebao/Journal of Nanjing University of Aeronautics and Astronautics, 2001, 33 (01):
  • [8] Genetic Algorithm with Improved Mutation Operator for Multiple Sequence Alignment
    Yadav, Rohit Kumar
    Banka, Haider
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 2, 2015, 340 : 515 - 523
  • [9] Improved time complexity analysis of the Simple Genetic Algorithm
    Oliveto, Pietro S.
    Witt, Carsten
    THEORETICAL COMPUTER SCIENCE, 2015, 605 : 21 - 41
  • [10] Improved Genetic Algorithm to Enhance The Ability of Local Search
    Yuan, Chen
    2014 INTERNATIONAL CONFERENCE ON MECHATRONICS AND CONTROL (ICMC), 2014, : 2100 - 2103