Model-based simultaneous clustering and ordination of multivariate abundance data in ecology

被引：9

作者：

Hui, Francis K. C. ^{[1
]}

机构：

[1] Australian Natl Univ, Inst Math Sci, Canberra, ACT 0200, Australia

来源：

COMPUTATIONAL STATISTICS & DATA ANALYSIS | 2017年 / 105卷

关键词：

Dimension reduction; Finite mixture models; Hierarchical Bayesian model; Mixtures of factor analyzers; Latent variable model; INFORMATION CRITERIA; MIXTURE-MODELS; BAYESIAN MODELS;

D O I：

10.1016/j.csda.2016.07.008

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

When studying multivariate abundance data, one of the main patterns ecologists are often interested in is whether the sites exhibit clustering on the low-dimensional, ordination space representing species composition. A new model-based approach called CORAL (Clustering and Ordination Regression AnaLysis) is developed for tackling this question, based on performing simultaneous clustering and ordination using latent variable regression. By drawing the latent variables from a finite mixture density, CORAL probabilistically classifies sites based on their positions on an underlying signal space. This is similar to mixtures of factor analyzers, except CORAL is designed for non-normal responses and uses species-specific rather than cluster-specific factor loadings (regression coefficients). Estimation is performed via Bayesian MCMC sampling, with code provided in the Supplementary Material. Simulations demonstrate that, by utilizing the joint information available in the data for both classification and dimension reduction, CORAL outperforms several popular, algorithm-based methods for clustering and ordination in ecology. CORAL is applied to a dataset of presence-absence records collected at sites along the Doubs River near the France-Switzerland border, with results revealing two clusters or ecological regions partly resembling the spatial separation of upstream and downstream sites. (C) 2016 Elsevier B.V. All rights reserved.

引用

页码：1 / 10

页数：10

共 50 条

[1] Model-based clustering for multivariate functional data
Jacques, Julien
Preda, Cristian
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 92 - 106
[2] Probabilistic model-based clustering of multivariate and sequential data
Smyth, P
[J]. ARTIFICIAL INTELLIGENCE AND STATISTICS 99, PROCEEDINGS, 1999, : 299 - 304
[3] Model-based clustering for multivariate partial ranking data
Jacques, Julien
Biernacki, Christophe
[J]. JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2014, 149 : 201 - 217
[4] mvabund- an R package for model-based analysis of multivariate abundance data
Wang, Yi
Naumann, Ulrike
Wright, Stephen T.
Warton, David I.
[J]. METHODS IN ECOLOGY AND EVOLUTION, 2012, 3 (03): : 471 - 474
[5] A Model-Based Approach to Simultaneous Clustering and Dimensional Reduction of Ordinal Data
Monia Ranalli
Roberto Rocci
[J]. Psychometrika, 2017, 82 : 1007 - 1034
[6] A Model-Based Approach to Simultaneous Clustering and Dimensional Reduction of Ordinal Data
Ranalli, Monia
Rocci, Roberto
[J]. PSYCHOMETRIKA, 2017, 82 (04) : 1007 - 1034
[7] Model-based clustering of multivariate skew data with circular components and missing values
Lagona, Francesco
Picone, Marco
[J]. JOURNAL OF APPLIED STATISTICS, 2012, 39 (05) : 927 - 945
[8] A Model-Based Multivariate Time Series Clustering Algorithm
Zhou, Pei-Yuan
Chan, Keith C. C.
[J]. TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, 2014, 8643 : 805 - 817
[9] Model-based clustering of longitudinal data
McNicholas, Paul D.
Murphy, T. Brendan
[J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2010, 38 (01): : 153 - 168
[10] Boosting for model-based data clustering
Saffari, Amir
Bischof, Horst
[J]. PATTERN RECOGNITION, 2008, 5096 : 51 - 60

← 1 2 3 4 5 →