AIC MODEL SELECTION IN OVERDISPERSED CAPTURE-RECAPTURE DATA

被引:518
|
作者
ANDERSON, DR [1 ]
BURNHAM, KP [1 ]
WHITE, GC [1 ]
机构
[1] COLORADO STATE UNIV,DEPT FISHERY & WILDLIFE BIOL,FT COLLINS,CO 80523
关键词
AIC; AKAIKE; CAPTURE RECAPTURE; CORMACK JOLLY SEBER MODEL; EXTRA-BINOMIAL VARIATION; KULLBACK LEIBLER DISCREPANCY; MODEL SELECTION; OVERDISPERSION;
D O I
10.2307/1939637
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Selection of a proper model as a basis for statistical inference from capture-recapture data is critical. This is especially so when using open models in the analysis of multiple, interrelated data sets (e.g., males and females, with 2-3 age classes, over 3-5 areas and 10-15 yr). The most general model considered for such data sets might contain 1000 survival and recapture parameters. This paper presents numerical results on three information-theoretic methods for model selection when the data are overdispersed (i.e., a lack of independence so that extra-binomial variation occurs). Akaike's information criterion (AIC), a second-order adjustment to AIC for bias (AIC(c)), and a dimension-consistent criterion (CAIC) were modified using an empirical estimate of the average overdispersion, based on quasi-likelihood theory. Quality of model selection was evaluated based on the Euclidian distance between standardized <(theta)over cap> and theta (parameter theta is vector valued); this quantity (a type of residual sum of squares, hence denoted as RSS) is a combination of squared bias and variance. Five results seem to be of general interest for these product-multinomial models. First, when there was overdispersion the most direct estimator of the variance inflation factor was positively biased and the relative bias increased with the amount of overdispersion. Second, AIC and AIC(c), unadjusted for overdispersion using quasi-likelihood theory, performed poorly in selecting a model with a small RSS value when the data were overdispersed (i.e., overfitted models were selected when compared to the model with the minimum RRS value). Third, the information-theoretic criteria, adjusted for overdispersion, performed well, selected parismonious models, and had a good balance between under- and overfitting the data. Fourth, generally, the dimension-consistent criterion selected models with fewer parameters than the other criteria, had smaller RSS values, but clearly was in error by underfitting when compared with the model with the minimum RSS value. Fifth, even if the true model structure (but not the actual parameter values in the model) is known, that true model, when fitted to the data (by parameter estimation) is a relatively poor basis for statistical inference when that true model includes several, let alone many, estimated parameters that are not significantly different from O.
引用
收藏
页码:1780 / 1793
页数:14
相关论文
共 50 条
  • [1] MODEL SELECTION STRATEGY IN THE ANALYSIS OF CAPTURE-RECAPTURE DATA
    BURNHAM, KP
    WHITE, GC
    ANDERSON, DR
    [J]. BIOMETRICS, 1995, 51 (03) : 888 - 898
  • [2] A hierarchical model for spatial capture-recapture data
    Royle, J. Andrew
    Young, Kevin V.
    [J]. ECOLOGY, 2008, 89 (08) : 2281 - 2289
  • [3] Bayesian model selection for spatial capture-recapture models
    Dey, Soumen
    Delampady, Mohan
    Gopalaswamy, Arjun M.
    [J]. ECOLOGY AND EVOLUTION, 2019, 9 (20): : 11569 - 11583
  • [4] ESTIMATING SELECTION ON QUANTITATIVE TRAITS USING CAPTURE-RECAPTURE DATA
    KINGSOLVER, JG
    SMITH, SG
    [J]. EVOLUTION, 1995, 49 (02) : 384 - 388
  • [5] A hierarchical model for spatial capture-recapture data: comment
    Marques, Tiago A.
    Thomas, Len
    Royle, J. Andrew
    [J]. ECOLOGY, 2011, 92 (02) : 526 - 528
  • [6] A latent variable regression model for capture-recapture data
    Thandrayen, Joanne
    Wang, Yan
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (07) : 2740 - 2746
  • [7] APPLICATIONS OF A MULTINOMIAL CAPTURE-RECAPTURE MODEL TO EPIDEMIOLOGICAL DATA
    WITTES, JT
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1974, 69 (345) : 93 - 97
  • [8] Model selection and population size using capture-recapture methods
    McGilchrist, CA
    [J]. JOURNAL OF CLINICAL EPIDEMIOLOGY, 1999, 52 (10) : 915 - 915
  • [9] The selection from multiple data sources in epidemiological capture-recapture studies
    Hay, G
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1997, 46 (04) : 515 - 520
  • [10] Bayesian model discrimination for multiple strata capture-recapture data
    King, R
    Brooks, SP
    [J]. BIOMETRIKA, 2002, 89 (04) : 785 - 806