BranchClust: a phylogenetic algorithm for selecting gene families

被引:35
|
作者
Poptsova, Maria S. [1 ]
Gogarten, J. Peter [1 ]
机构
[1] Univ Connecticut, Dept Mol & Cell Biol, Storrs, CT 06269 USA
关键词
D O I
10.1186/1471-2105-8-120
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Automated methods for assembling families of orthologous genes include those based on sequence similarity scores and those based on phylogenetic approaches. The first are easy to automate but usually they do not distinguish between paralogs and orthologs or have restriction on the number of taxa. Phylogenetic methods often are based on reconciliation of a gene tree with a known rooted species tree; a limitation of this approach, especially in case of prokaryotes, is that the species tree is often unknown, and that from the analyses of single gene families the branching order between related organisms frequently is unresolved. Results: Here we describe an algorithm for the automated selection of orthologous genes that recognizes orthologous genes from different species in a phylogenetic tree for any number of taxa. The algorithm is capable of distinguishing complete ( containing all taxa) and incomplete ( not containing all taxa) families and recognizes in- and outparalogs. The BranchClust algorithm is implemented in Perl with the use of the BioPerl module for parsing trees and is freely available at http://bioinformatics.org/branchclust. Conclusion: BranchClust outperforms the Reciprocal Best Blast hit method in selecting more sets of putatively orthologous genes. In the test cases examined, the correctness of the selected families and of the identified in- and outparalogs was confirmed by inspection of the pertinent phylogenetic trees.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] PlasmoGF:: an integrated system for comparative genomics and phylogenetic analysis of Plasmodium gene families
    Xu, Xiang
    Wu, Jinyu
    Xiao, Jian
    Tan, Yi
    Bao, Qiyu
    Zhao, Fangqing
    Li, Xiaokun
    BIOINFORMATICS, 2008, 24 (09) : 1217 - 1220
  • [22] Transcriptomic and phylogenetic analysis of Culex pipiens quinquefasciatus for three detoxification gene families
    Yan, Liangzhen
    Yang, Pengcheng
    Jiang, Feng
    Cui, Na
    Ma, Enbo
    Qiao, Chuanling
    Cui, Feng
    BMC GENOMICS, 2012, 13
  • [23] Selecting β-Thalassemia Patients for Gene Therapy: A Decision-Making Algorithm
    Baronciani, Donatella
    Casale, Maddalena
    De Franceschi, Lucia
    Graziadei, Giovanna
    Longo, Filomena
    Origa, Raffaella
    Pinto, Valeria Maria
    Rigano, Paolo
    Marchetti, Monia
    Gigante, Antonia
    Angelucci, Emanuele
    Cappellini, Maria Domenica
    Iolascon, Achille
    Piga, Antonio
    Forni, Gian Luca
    BLOOD, 2019, 134
  • [24] Selecting β-thalassemia Patients for Gene Therapy: A Decision-making Algorithm
    Baronciani, Donatella
    Casale, Maddalena
    De Franceschi, Lucia
    Graziadei, Giovanna
    Longo, Filomena
    Origa, Raffaella
    Rigano, Paolo
    Pinto, Valeria
    Marchetti, Monia
    Gigante, Antonia
    Forni, Gian Luca
    HEMASPHERE, 2021, 5 (05):
  • [25] Genetic Algorithm Selection of Interacting Features (GASIF) for Selecting Biological Gene-Gene Interactions
    Kumar, Rachit
    Zhang, David
    Ritchie, Marylyn D.
    PROCEEDINGS OF THE 2024 GENETIC AND EVOLUTIONARY COMPUTATION CONFERENCE, GECCO 2024, 2024, : 1282 - 1290
  • [26] Collectively Coincidence Results and Selecting Families
    Donal O’Regan
    Bulletin of the Iranian Mathematical Society, 2023, 49
  • [27] Collectively Coincidence Results and Selecting Families
    O'Regan, Donal
    BULLETIN OF THE IRANIAN MATHEMATICAL SOCIETY, 2023, 49 (06)
  • [28] In silico identification and Bayesian phylogenetic analysis of multiple new mammalian kallikrein gene families
    Elliott, Marc B.
    Irwin, David M.
    Diamandis, Eleftherios P.
    GENOMICS, 2006, 88 (05) : 591 - 599
  • [29] Evolution of four gene families with patchy phylogenetic distributions: Influx of genes into protist genomes
    Andersson J.O.
    Hirt R.P.
    Foster P.G.
    Roger A.J.
    BMC Evolutionary Biology, 6 (1)
  • [30] Genome-wide comparative phylogenetic analysis of the rice and Arabidopsis Dof gene families
    Lijavetzky, D
    Carbonero, P
    Vicente-Carbajosa, J
    BMC EVOLUTIONARY BIOLOGY, 2003, 3 (1)