Predicting the accuracy of multiple sequence alignment algorithms by using computational intelligent techniques

被引:10
|
作者
Ortuno, Francisco M. [1 ]
Valenzuela, Olga [2 ]
Pomares, Hector [1 ]
Rojas, Fernando [1 ]
Florido, Javier P. [3 ]
Urquiza, Jose M. [4 ]
Rojas, Ignacio [1 ]
机构
[1] Univ Granada UGR, Dept Comp Architecture & Comp Technol, Granada 18071, Spain
[2] Univ Granada UGR, Dept Appl Math, Granada 18071, Spain
[3] Andalusian Human Genome Sequencing Ctr CASEGH, Med Genome Project, Seville 41092, Spain
[4] Bellvitge Biomed Res Inst IDIBELL, Chromatin & Dis Grp, Barcelona 08907, Spain
关键词
MUTUAL INFORMATION; GENE ONTOLOGY; SELECTION;
D O I
10.1093/nar/gks919
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Multiple sequence alignments (MSAs) have become one of the most studied approaches in bioinformatics to perform other outstanding tasks such as structure prediction, biological function analysis or next-generation sequencing. However, current MSA algorithms do not always provide consistent solutions, since alignments become increasingly difficult when dealing with low similarity sequences. As widely known, these algorithms directly depend on specific features of the sequences, causing relevant influence on the alignment accuracy. Many MSA tools have been recently designed but it is not possible to know in advance which one is the most suitable for a particular set of sequences. In this work, we analyze some of the most used algorithms presented in the bibliography and their dependences on several features. A novel intelligent algorithm based on least square support vector machine is then developed to predict how accurate each alignment could be, depending on its analyzed features. This algorithm is performed with a dataset of 2180 MSAs. The proposed system first estimates the accuracy of possible alignments. The most promising methodologies are then selected in order to align each set of sequences. Since only one selected algorithm is run, the computational time is not excessively increased.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Improving Standard Progressive Multiple Sequence Alignment by Using Multithreading Techniques
    Mohamed, Eman M.
    Mousa, Hamdy M.
    Keshk, Arabi E.
    2018 14TH INTERNATIONAL COMPUTER ENGINEERING CONFERENCE (ICENCO), 2018, : 156 - 161
  • [22] UPS: A New Approach for Multiple Sequence Alignment Using Morphing Techniques
    Quoc-Nam Tran
    Wallinga, Mike
    2017 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2017, : 425 - 430
  • [23] Evolutionary computation techniques for multiple sequence alignment
    Cai, LM
    Juedes, D
    Liakhovitch, E
    PROCEEDINGS OF THE 2000 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2000, : 829 - 835
  • [24] Optimizing Multiple Sequence Alignment using Multi-Objective Genetic Algorithms
    Yadav, Sohan Kumar
    Jha, Sudhanshu Kumar
    Singh, Sudhakar
    Dixit, Pratibha
    Prakash, Shiv
    Singh, Astha
    2022 INTERNATIONAL CONFERENCE ON DECISION AID SCIENCES AND APPLICATIONS (DASA), 2022, : 113 - 117
  • [25] The accuracy of several multiple sequence alignment programs for proteins
    Paulo AS Nuin
    Zhouzhi Wang
    Elisabeth RM Tillier
    BMC Bioinformatics, 7
  • [26] The accuracy of several multiple sequence alignment programs for proteins
    Nuin, Paulo A. S.
    Wang, Zhouzhi
    Tillier, Elisabeth R. M.
    BMC BIOINFORMATICS, 2006, 7 (1)
  • [27] MUSCLE: Multiple sequence alignment with improved accuracy and speed
    Edgar, RC
    2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 728 - 729
  • [28] Evaluating the Accuracy and Efficiency of Multiple Sequence Alignment Methods
    Pervez, Muhammad Tariq
    Babar, Masroor Ellahi
    Nadeem, Asif
    Aslam, Muhammad
    Awan, Ali Raza
    Aslam, Naeem
    Hussain, Tanveer
    Naveed, Nasir
    Qadri, Salman
    Waheed, Usman
    Shoaib, Muhammad
    EVOLUTIONARY BIOINFORMATICS, 2014, 10
  • [29] COMPREHENSIVE STUDY ON ITERATIVE ALGORITHMS OF MULTIPLE SEQUENCE ALIGNMENT
    HIROSAWA, M
    TOTOKI, Y
    HOSHIDA, M
    ISHIKAWA, M
    COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1995, 11 (01): : 13 - 18
  • [30] Multiple protein sequence alignment: Algorithms and gap insertion
    Taylor, WR
    COMPUTER METHODS FOR MACROMOLECULAR SEQUENCE ANALYSIS, 1996, 266 : 343 - 367