Model-based simultaneous clustering and ordination of multivariate abundance data in ecology

被引:9
|
作者
Hui, Francis K. C. [1 ]
机构
[1] Australian Natl Univ, Inst Math Sci, Canberra, ACT 0200, Australia
关键词
Dimension reduction; Finite mixture models; Hierarchical Bayesian model; Mixtures of factor analyzers; Latent variable model; INFORMATION CRITERIA; MIXTURE-MODELS; BAYESIAN MODELS;
D O I
10.1016/j.csda.2016.07.008
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
When studying multivariate abundance data, one of the main patterns ecologists are often interested in is whether the sites exhibit clustering on the low-dimensional, ordination space representing species composition. A new model-based approach called CORAL (Clustering and Ordination Regression AnaLysis) is developed for tackling this question, based on performing simultaneous clustering and ordination using latent variable regression. By drawing the latent variables from a finite mixture density, CORAL probabilistically classifies sites based on their positions on an underlying signal space. This is similar to mixtures of factor analyzers, except CORAL is designed for non-normal responses and uses species-specific rather than cluster-specific factor loadings (regression coefficients). Estimation is performed via Bayesian MCMC sampling, with code provided in the Supplementary Material. Simulations demonstrate that, by utilizing the joint information available in the data for both classification and dimension reduction, CORAL outperforms several popular, algorithm-based methods for clustering and ordination in ecology. CORAL is applied to a dataset of presence-absence records collected at sites along the Doubs River near the France-Switzerland border, with results revealing two clusters or ecological regions partly resembling the spatial separation of upstream and downstream sites. (C) 2016 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] Model-based clustering for multivariate functional data
    Jacques, Julien
    Preda, Cristian
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 92 - 106
  • [2] Probabilistic model-based clustering of multivariate and sequential data
    Smyth, P
    [J]. ARTIFICIAL INTELLIGENCE AND STATISTICS 99, PROCEEDINGS, 1999, : 299 - 304
  • [3] Model-based clustering for multivariate partial ranking data
    Jacques, Julien
    Biernacki, Christophe
    [J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2014, 149 : 201 - 217
  • [4] mvabund- an R package for model-based analysis of multivariate abundance data
    Wang, Yi
    Naumann, Ulrike
    Wright, Stephen T.
    Warton, David I.
    [J]. METHODS IN ECOLOGY AND EVOLUTION, 2012, 3 (03): : 471 - 474
  • [5] A Model-Based Approach to Simultaneous Clustering and Dimensional Reduction of Ordinal Data
    Monia Ranalli
    Roberto Rocci
    [J]. Psychometrika, 2017, 82 : 1007 - 1034
  • [6] A Model-Based Approach to Simultaneous Clustering and Dimensional Reduction of Ordinal Data
    Ranalli, Monia
    Rocci, Roberto
    [J]. PSYCHOMETRIKA, 2017, 82 (04) : 1007 - 1034
  • [7] Model-based clustering of multivariate skew data with circular components and missing values
    Lagona, Francesco
    Picone, Marco
    [J]. JOURNAL OF APPLIED STATISTICS, 2012, 39 (05) : 927 - 945
  • [8] A Model-Based Multivariate Time Series Clustering Algorithm
    Zhou, Pei-Yuan
    Chan, Keith C. C.
    [J]. TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2014, 8643 : 805 - 817
  • [9] Model-based clustering of longitudinal data
    McNicholas, Paul D.
    Murphy, T. Brendan
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (01): : 153 - 168
  • [10] Boosting for model-based data clustering
    Saffari, Amir
    Bischof, Horst
    [J]. PATTERN RECOGNITION, 2008, 5096 : 51 - 60