Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega

被引:11059
|
作者
Sievers, Fabian [1 ]
Wilm, Andreas [2 ]
Dineen, David [1 ]
Gibson, Toby J. [3 ]
Karplus, Kevin [4 ]
Li, Weizhong [5 ]
Lopez, Rodrigo [5 ]
McWilliam, Hamish [5 ]
Remmert, Michael [6 ]
Soeding, Johannes [6 ]
Thompson, Julie D. [7 ]
Higgins, Desmond G. [1 ]
机构
[1] Univ Coll Dublin, UCD Conway Inst Biomol & Biomed Res, Sch Med & Med Sci, Dublin 4, Ireland
[2] Genome Inst Singapore, Singapore, Singapore
[3] European Mol Biol Lab, Struct & Computat Biol Unit, Heidelberg, Germany
[4] Univ Calif Santa Cruz, Dept Biomol Engn, Santa Cruz, CA 95064 USA
[5] European Bioinformat Inst, EMBL Outstn, Cambridge, England
[6] Univ Munich LMU, Gene Ctr Munich, Munich, Germany
[7] Univ Strasbourg, Dept Biol Struct & Genom, IGBMC, CNRS,INSERM, Illkirch Graffenstaden, France
基金
爱尔兰科学基金会;
关键词
bioinformatics; hidden Markov models; multiple sequence alignment; CONSTRUCTION; ALGORITHM; ACCURATE; COFFEE; TREES;
D O I
10.1038/msb.2011.75
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Multiple sequence alignments are fundamental to many sequence analysis methods. Most alignments are computed using the progressive alignment heuristic. These methods are starting to become a bottleneck in some analysis pipelines when faced with data sets of the size of many thousands of sequences. Some methods allow computation of larger data sets while sacrificing quality, and others produce high-quality alignments, but scale badly with the number of sequences. In this paper, we describe a new program called Clustal Omega, which can align virtually any number of protein sequences quickly and that delivers accurate alignments. The accuracy of the package on smaller test cases is similar to that of the high-quality aligners. On larger data sets, Clustal Omega outperforms other packages in terms of execution time and quality. Clustal Omega also has powerful features for adding sequences to and exploiting information in existing alignments, making use of the vast amount of precomputed information in public databases like Pfam. Molecular Systems Biology 7: 539; published online 11 October 2011; doi:10.1038/msb.2011.75
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Fast high-quality numerical shadowing of chaotic maps using synchronization
    Dutta, M
    PHYSICAL REVIEW E, 2005, 72 (05):
  • [42] A fast and high-quality cone beam reconstruction pipeline using the GPU
    Schiwietz, Thomas
    Bose, Supratik
    Maltz, Jonathan
    Westermann, Rudiger
    MEDICAL IMAGING 2007: PHYSICS OF MEDICAL IMAGING, PTS 1-3, 2007, 6510
  • [43] A fast and high-quality charge model for the next generation general AMBER force field
    He, Xibing
    Man, Viet H.
    Yang, Wei
    Lee, Tai-Sung
    Wang, Junmei
    JOURNAL OF CHEMICAL PHYSICS, 2020, 153 (11):
  • [44] MUSTER: Improving protein sequence profile-profile alignments by using multiple sources of structure information
    Wu, Sitao
    Zhang, Yang
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2008, 72 (02) : 547 - 556
  • [45] High-quality holographic stereogram generation using four RGBD images
    Fachada, Sarah
    Bonatto, Daniele
    Lafruit, Gauthier
    APPLIED OPTICS, 2021, 60 (04) : A250 - A259
  • [46] A High-Quality Generation Approach for Educational Programming Projects Using LLM
    Song, Tian
    Zhang, Hang
    Xiao, Yijia
    IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2024, 17 : 2296 - 2309
  • [47] Generation of high-quality lines and arrays using nanoparticle controlling processes
    Huh, Seung H.
    Riu, Doh H.
    Naono, Y.
    Taguchi, Y.
    Kawabata, S.
    Nakajima, A.
    APPLIED PHYSICS LETTERS, 2007, 91 (09)
  • [48] Training a Message Passing Undirected Graph Neural Network for Protein Sequence Design Using Structural Ensembles and Multiple Sequence Alignments
    Birnbaum, Foster
    Keating, Amy
    PROTEIN SCIENCE, 2023, 32 (12)
  • [49] Fast and High-Quality Bilateral Filtering Using Gauss-Chebyshev Approximation
    Ghosh, Sanjay
    Chaudhury, Kunal N.
    2016 INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATIONS (SPCOM), 2016,
  • [50] Automatic treatment planning facilitates fast generation of high-quality treatment plans for esophageal cancer
    Hansen, Christian Ronn
    Nielsen, Morten
    Bertelsen, Anders Smedegaard
    Hazell, Irene
    Holtved, Eva
    Zukauskaite, Ruta
    Bjerregaard, Jon Kroll
    Brink, Carsten
    Bernchou, Uffe
    ACTA ONCOLOGICA, 2017, 56 (11) : 1495 - 1500