Phylofactorization: a graph partitioning algorithm to identify phylogenetic scales of ecological data

被引:42
|
作者
Washburne, Alex D. [1 ]
Silverman, Justin D. [2 ,3 ]
Morton, James T. [4 ,5 ]
Becker, Daniel J. [1 ]
Crowley, Daniel [1 ]
Mukherjee, Sayan [3 ,6 ]
David, Lawrence A. [3 ]
Plowright, Raina K. [1 ]
机构
[1] Montana State Univ, Dept Microbiol & Immunol, Bozeman, MT 59717 USA
[2] Duke Univ, Program Computat Biol & Bioinformat, Durham, NC 27708 USA
[3] Duke Univ, Ctr Genom & Computat Biol, Durham, NC 27708 USA
[4] Univ Calif San Diego, Dept Comp Sci, La Jolla, CA 92037 USA
[5] Univ Calif San Diego, Dept Pediat, La Jolla, CA 92037 USA
[6] Duke Univ, Dept Stat Sci Math & Comp Sci, Durham, NC 27708 USA
关键词
community ecology; dimensionality reduction; graph partitioning; microbiome; phylofactorization; phylogeny; R PACKAGE; BODY-SIZE; EVOLUTION; MICROBIOME; DYNAMICS;
D O I
10.1002/ecm.1353
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
The problem of pattern and scale is a central challenge in ecology. In community ecology, an important scale is that at which we aggregate species to define our units of study, such as aggregation of "nitrogen fixing trees" to understand patterns in carbon sequestration. With the emergence of massive community ecological data sets, there is a need to objectively identify the scales for aggregating species to capture well-defined patterns in community ecological data. The phylogeny is a scaffold for identifying scales of species-aggregation associated with macroscopic patterns. Phylofactorization was developed to identify phylogenetic scales underlying patterns in relative abundance data, but many ecological data, such as presence-absences and counts, are not relative abundances yet may still have phylogenetic scales capturing patterns of interest. Here, we broaden phylofactorization to a graph-partitioning algorithm identifying phylogenetic scales in community ecological data. As a graph-partitioning algorithm, phylofactorization connects many tools from data analysis to phylogenetically informed analyses of community ecological data. Two-sample tests identify five phylogenetic factors of mammalian body mass which arose during the K-Pg extinction event, consistent with other analyses of mammalian body mass evolution. Projection of data onto coordinates connecting the phylogeny and graph-partitioning algorithm yield a phylogenetic principal components analysis which refines our understanding of the major sources of variation in the human gut microbiome. These same coordinates allow generalized additive modeling of microbes in Central Park soils, confirming that a large clade of Acidobacteria thrive in neutral soils. The graph-partitioning algorithm extends to generalized linear and additive modeling of exponential family random variables by phylogenetically constrained reduced-rank regression or stepwise factor contrasts. All of these tools can be implemented with the R package phylofactor.
引用
收藏
页数:27
相关论文
共 50 条
  • [21] PARTITIONING AND COMBINING DATA IN PHYLOGENETIC ANALYSIS
    BULL, JJ
    HUELSENBECK, JP
    CUNNINGHAM, CW
    SWOFFORD, DL
    WADDELL, PJ
    SYSTEMATIC BIOLOGY, 1993, 42 (03) : 384 - 397
  • [22] A Graph Similarity Algorithm Based on Graph Partitioning and Attention Mechanism
    Miao, Fengyu
    Zhou, Xiuzhuang
    Xiao, Shungen
    Zhang, Shiliang
    ELECTRONICS, 2024, 13 (19)
  • [23] PHYLOGENETIC AND ECOLOGICAL SIGNIFICANCE OF BODY WATER PARTITIONING IN AQUATIC VERTEBRATES
    THORSON, TB
    ANATOMICAL RECORD, 1960, 137 (03): : 398 - 398
  • [24] GRAPH PARTITIONING TECHNIQUE TO IDENTIFY PHYSICALLY INTEGRATED DESIGN CONCEPTS
    Gopalakrishnan, Praveen Kumare
    Kain, Helen
    Jahanbekam, Sogol
    Behdad, Sara
    PROCEEDINGS OF THE ASME INTERNATIONAL DESIGN ENGINEERING TECHNICAL CONFERENCES AND COMPUTERS AND INFORMATION IN ENGINEERING CONFERENCE, 2018, VOL 4, 2018,
  • [25] Accelerated Kerninghan Lin algorithm for Graph Partitioning
    Rajan, Archana K.
    Bhaiya, Deepika
    2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 174 - 178
  • [26] An efficient memetic algorithm for the graph partitioning problem
    Philippe Galinier
    Zied Boujbel
    Michael Coutinho Fernandes
    Annals of Operations Research, 2011, 191 : 1 - 22
  • [27] An Improved Hill Climbing Algorithm for Graph Partitioning
    Li, He
    Liu, Yanna
    Yang, Shuqi
    Lin, Yishuai
    Yang, Yi
    Yoo, Jaesoo
    COMPUTER JOURNAL, 2023, 66 (07): : 1761 - 1776
  • [28] An efficient memetic algorithm for the graph partitioning problem
    Galinier, Philippe
    Boujbel, Zied
    Fernandes, Michael Coutinho
    ANNALS OF OPERATIONS RESEARCH, 2011, 191 (01) : 1 - 22
  • [29] A Genetic Algorithm for Large Graph Partitioning Problem
    Xuan-Tung Nguyen
    Phuong-Nam Cao
    Van-Quyet Nguyen
    Kim, Kyungbaek
    Quyet-Thang Huynh
    SOICT 2019: PROCEEDINGS OF THE TENTH INTERNATIONAL SYMPOSIUM ON INFORMATION AND COMMUNICATION TECHNOLOGY, 2019, : 419 - 424
  • [30] Learning algorithm for the uniform graph partitioning problem
    Chua, CB
    Chen, K
    INTERNATIONAL JOURNAL OF MODERN PHYSICS C, 1998, 9 (02): : 331 - 339