Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques

被引:10
|
作者
Ortuno, Francisco M. [1 ]
Valenzuela, Olga [2 ]
Pomares, Hector [1 ]
Rojas, Fernando [1 ]
Florido, Javier P. [3 ]
Urquiza, Jose M. [4 ]
Rojas, Ignacio [1 ]
机构
[1] Univ Granada UGR, Dept Comp Architecture & Comp Technol, Granada 18071, Spain
[2] Univ Granada UGR, Dept Appl Math, Granada 18071, Spain
[3] Andalusian Human Genome Sequencing Ctr CASEGH, Med Genome Project, Seville 41092, Spain
[4] Bellvitge Biomed Res Inst IDIBELL, Chromatin & Dis Grp, Barcelona 08907, Spain
关键词
MUTUAL INFORMATION; GENE ONTOLOGY; SELECTION;
D O I
10.1093/nar/gks919
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Multiple sequence alignments (MSAs) have become one of the most studied approaches in bioinformatics to perform other outstanding tasks such as structure prediction, biological function analysis or next-generation sequencing. However, current MSA algorithms do not always provide consistent solutions, since alignments become increasingly difficult when dealing with low similarity sequences. As widely known, these algorithms directly depend on specific features of the sequences, causing relevant influence on the alignment accuracy. Many MSA tools have been recently designed but it is not possible to know in advance which one is the most suitable for a particular set of sequences. In this work, we analyze some of the most used algorithms presented in the bibliography and their dependences on several features. A novel intelligent algorithm based on least square support vector machine is then developed to predict how accurate each alignment could be, depending on its analyzed features. This algorithm is performed with a dataset of 2180 MSAs. The proposed system first estimates the accuracy of possible alignments. The most promising methodologies are then selected in order to align each set of sequences. Since only one selected algorithm is run, the computational time is not excessively increased.
引用
收藏
页数:10
相关论文
共 50 条
  • [31] DCA based algorithms for multiple sequence alignment (MSA)
    Hoai An Le Thi
    Tao Pham Dinh
    Belghiti, Moulay
    CENTRAL EUROPEAN JOURNAL OF OPERATIONS RESEARCH, 2014, 22 (03) : 501 - 524
  • [32] Improving the efficiency of multiple sequence alignment by genetic algorithms
    Seijas, J
    Morató, C
    Andina, D
    Vega-Corona, A
    ARTIFICIAL NEURAL NETS PROBLEM SOLVING METHODS, PT II, 2003, 2687 : 361 - 368
  • [33] Multiple sequence alignment accuracy and evolutionary distance estimation
    Michael S Rosenberg
    BMC Bioinformatics, 6
  • [34] Multiple sequence alignment accuracy and evolutionary distance estimation
    Rosenberg, MS
    BMC BIOINFORMATICS, 2005, 6 (1)
  • [35] DCA based algorithms for multiple sequence alignment (MSA)
    Hoai An Le Thi
    Tao Pham Dinh
    Moulay Belghiti
    Central European Journal of Operations Research, 2014, 22 : 501 - 524
  • [36] Predicting the oxidation state of cysteines by multiple sequence alignment
    Fiser, A
    Simon, I
    BIOINFORMATICS, 2000, 16 (03) : 251 - 256
  • [37] Multiple sequence alignment using anytime A*
    Zhou, R
    Hansen, EA
    EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 975 - 976
  • [38] Computational intelligent techniques for predicting optical behavior of different materials
    Mohamed, R.A.
    El-Nahass, M.M.
    El-Bakry, M.Y.
    El-Dahshan, El-Sayed A.
    Aamer, E.H.
    Habashy, D.M.
    Optik, 2024, 313
  • [39] Improvement in accuracy of multiple sequence alignment using novel group-to-group sequence alignment algorithm with piecewise linear gap cost
    Yamada, Shinsuke
    Gotoh, Osamu
    Yamana, Hayato
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [40] Improvement in accuracy of multiple sequence alignment using novel group-to-group sequence alignment algorithm with piecewise linear gap cost
    Shinsuke Yamada
    Osamu Gotoh
    Hayato Yamana
    BMC Bioinformatics, 7