PhyInformR: phylogenetic experimental design and phylogenomic data exploration in R

被引:28
|
作者
Dornburg, Alex [1 ]
Fisk, J. Nick [2 ]
Tamagnan, Jules [3 ]
Townsend, Jeffrey P. [2 ,4 ,5 ]
机构
[1] North Carolina Museum Nat Sci, Raleigh, NC 27601 USA
[2] Yale Univ, Dept Biostat, New Haven, CT 06510 USA
[3] Yale Univ, Yale Sch Publ Hlth, Ctr Infect Dis Modeling & Anal, New Haven, CT 06510 USA
[4] Yale Univ, Dept Ecol & Evolutionary Biol, New Haven, CT 06525 USA
[5] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT 06511 USA
来源
BMC EVOLUTIONARY BIOLOGY | 2016年 / 16卷
关键词
INFORMATIVENESS; DIVERSIFICATION; ACTINOPTERYGII; SELECTION; ANCIENT; EXAMPLE; GENES; TREE;
D O I
10.1186/s12862-016-0837-3
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Analyses of phylogenetic informativeness represent an important step in screening potential or existing datasets for their proclivity toward convergent or parallel evolution of molecular sites. However, while new theory has been developed from which to predict the utility of sequence data, adoption of these advances have been stymied by a lack of software enabling application of advances in theory, especially for large next-generation sequence data sets. Moreover, there are no theoretical barriers to application of the phylogenetic informativeness or the calculation of quartet internode resolution probabilities in a Bayesian setting that more robustly accounts for uncertainty, yet there is no software with which a computationally intensive Bayesian approach to experimental design could be implemented. Results: We introduce PhyInformR, an open source software package that performs rapid calculation of phylogenetic information content using the latest advances in phylogenetic informativeness based theory. These advances include modifications that incorporate uneven branch lengths and any model of nucleotide substitution to provide assessments of the phylogenetic utility of any given dataset or dataset partition. PhyInformR provides new tools for data visualization and routines optimized for rapid statistical calculations, including approaches making use of Bayesian posterior distributions and parallel processing. By implementing the computation on user hardware, PhyInformR increases the potential power users can apply toward screening datasets for phylogenetic/genomic information content by orders of magnitude. Conclusions: PhyInformR provides a means to implement diverse substitution models and specify uneven branch lengths for phylogenetic informativeness or calculations providing quartet based probabilities of resolution, produce novel visualizations, and facilitate analyses of next-generation sequence datasets while incorporating phylogenetic uncertainty through the use parallel processing. As an open source program, PhyInformR is fully customizable and expandable, thereby allowing for advanced methodologies to be readily integrated into local bioinformatics pipelines. Software is available through CRAN and a package containing the software, a detailed manual, and additional sample data is also provided freely through github: https://github.com/carolinafishes/PhyInformR.
引用
收藏
页码:1 / 7
页数:7
相关论文
共 50 条
  • [21] SLIDER: Software for LongItudinal Data Exploration with R
    Commenges, Hadrien
    Pistre, Pierre
    Cura, Robin
    [J]. CYBERGEO-EUROPEAN JOURNAL OF GEOGRAPHY, 2014,
  • [22] A New Orthology Assessment Method for Phylogenomic Data: Unrooted Phylogenetic Orthology (vol 33, pg 2117, 2016)
    Ballesteros, Jesus A.
    Hormiga, Gustavo
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2016, 33 (09) : 2481 - 2481
  • [23] Exploration of Experimental Teaching Mode for Graphic Design
    Zhang, Lei
    [J]. PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ARTS, DESIGN AND CONTEMPORARY EDUCATION (ICADCE 2015), 2015, 23 : 557 - 560
  • [24] Phylogenetic relationships of the Chinese torrent frogs (Ranidae: Amolops) revealed by phylogenomic analyses of AFLP-Capture data
    Zeng, Zhaochi
    Liang, Dan
    Li, Jiaxuan
    Lyu, Zhitong
    Wang, Yingyong
    Zhang, Peng
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2020, 146
  • [25] Phylogenetic position of Sipuncula derived from multi-gene and phylogenomic data and its implication for the evolution of segmentation
    Dordel, Janina
    Fisse, Frauke
    Purschke, Guenter
    Struck, Torsten H.
    [J]. JOURNAL OF ZOOLOGICAL SYSTEMATICS AND EVOLUTIONARY RESEARCH, 2010, 48 (03) : 197 - 207
  • [26] Design space exploration for data path synthesis
    Mandal, CA
    Chakrabarti, PP
    Ghose, S
    [J]. TENTH INTERNATIONAL CONFERENCE ON VLSI DESIGN, PROCEEDINGS, 1997, : 166 - 171
  • [27] Developing a Design Inquiry Method for Data Exploration
    Kun, Peter
    Mulder, Ingrid
    Kortuem, Gerd
    [J]. INTERACTION DESIGN AND ARCHITECTURES, 2020, (45) : 180 - 206
  • [28] Exploration of big traffic data for urban areas with R
    Wang, Xiao-Xia
    Liu, Xi-Yuan
    Li, Zhan-Qiang
    [J]. JOURNAL OF DISCRETE MATHEMATICAL SCIENCES & CRYPTOGRAPHY, 2018, 21 (02): : 507 - 511
  • [29] Phylogenomic analysis of brachyuran crabs using transcriptome data reveals possible sources of conflicting phylogenetic relationships within the group
    Pan, Da
    Sun, Yunlong
    Shi, Boyang
    Wang, Ruxiao
    Ng, Peter K. L.
    Guinot, Daniele
    Cumberlidge, Neil
    Sun, Hongying
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2024, 201
  • [30] Exploration and Practice of Independent Design Experiments in Experimental Teaching
    Lian, Yu-Xin
    Hou, Yun-Peng
    Hou, Bo-Ya
    Li, Yan
    Wu, Jian-Qiang
    [J]. 2016 INTERNATIONAL CONFERENCE ON EDUCATION SCIENCE AND EDUCATION MANAGEMENT (ESEM 2016), 2016, : 107 - 111