A hierarchical Bayesian approach for handling missing classification data

被引:5
|
作者
Ketz, Alison C. [1 ,2 ]
Johnson, Therese L. [3 ]
Hooten, Mevin B. [4 ,5 ,6 ]
Hobbs, N. Thompson [1 ,2 ]
机构
[1] Colorado State Univ, Dept Ecosyst Sci & Sustainabil, Nat Resource Ecol Lab, Ft Collins, CO 80523 USA
[2] Colorado State Univ, Grad Degree Program Ecol, Ft Collins, CO 80523 USA
[3] Nat Pk Serv, Rocky Mt Natl Pk, Estes Pk, CO USA
[4] Colorado State Univ, US Geol Survey, Colorado Cooperat Fish & Wildlife Res Unit, Ft Collins, CO 80523 USA
[5] Colorado State Univ, Dept Fish Wildlife & Conservat Biol, Ft Collins, CO 80523 USA
[6] Colorado State Univ, Dept Stat, Ft Collins, CO 80523 USA
来源
ECOLOGY AND EVOLUTION | 2019年 / 9卷 / 06期
基金
美国国家科学基金会;
关键词
Cervus elaphus nelsoni; classification data; demographic ratio; elk; hierarchical Bayesian statistics; missing not at random data; multinomial distribution; proportion estimation; sex ratio; Wildlife Management; MARK-RECAPTURE; LIFE-HISTORY; DISEASE PROGRESSION; DEMOGRAPHIC DRIVERS; SEXUAL SEGREGATION; AGE RATIOS; POPULATION; CAPTURE; ABUNDANCE; INFERENCE;
D O I
10.1002/ece3.4927
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
Ecologists use classifications of individuals in categories to understand composition of populations and communities. These categories might be defined by demographics, functional traits, or species. Assignment of categories is often imperfect, but frequently treated as observations without error. When individuals are observed but not classified, these "partial" observations must be modified to include the missing data mechanism to avoid spurious inference. We developed two hierarchical Bayesian models to overcome the assumption of perfect assignment to mutually exclusive categories in the multinomial distribution of categorical counts, when classifications are missing. These models incorporate auxiliary information to adjust the posterior distributions of the proportions of membership in categories. In one model, we use an empirical Bayes approach, where a subset of data from one year serves as a prior for the missing data the next. In the other approach, we use a small random sample of data within a year to inform the distribution of the missing data. We performed a simulation to show the bias that occurs when partial observations were ignored and demonstrated the altered inference for the estimation of demographic ratios. We applied our models to demographic classifications of elk (Cervus elaphus nelsoni) to demonstrate improved inference for the proportions of sex and stage classes. We developed multiple modeling approaches using a generalizable nested multinomial structure to account for partially observed data that were missing not at random for classification counts. Accounting for classification uncertainty is important to accurately understand the composition of populations and communities in ecological studies.
引用
下载
收藏
页码:3130 / 3140
页数:11
相关论文
共 50 条
  • [1] Hierarchical Bayesian networks: An approach to classification and learning for structured data
    Gyftodimos, E
    Flach, PA
    METHODS AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, 3025 : 291 - 300
  • [2] Bayesian Hierarchical Models for Ordinal and Missing Data
    Zhao Qiang
    You Haiyan
    DATA PROCESSING AND QUANTITATIVE ECONOMY MODELING, 2010, : 464 - +
  • [3] Classification using a hierarchical Bayesian approach
    Mathis, C
    Breuel, T
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITON, VOL IV, PROCEEDINGS, 2002, : 103 - 106
  • [4] A Bayesian approach for analyzing hierarchical data with missing outcomes through structural equation models
    Song, Xin-Yuan
    Lee, Sik-Yum
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2008, 15 (02) : 272 - 300
  • [6] Hierarchical Bayesian Analysis of Repeated Binary Data with Missing Covariates
    Yu, Fang
    Chen, Ming-Hui
    Huang, Lan
    Anderson, Gregory J.
    TOPICS IN APPLIED STATISTICS, 2013, 55 : 311 - 322
  • [7] A Bayesian Hierarchical Selection Model for Academic Growth With Missing Data
    Allen, Jeff
    APPLIED MEASUREMENT IN EDUCATION, 2017, 30 (02) : 147 - 162
  • [8] A method of handling missing data in the context of learning Bayesian network structure
    Chen, Chong
    Yu, Hua
    Wang, Juyun
    APPLIED SCIENCE AND PRECISION ENGINEERING INNOVATION, PTS 1 AND 2, 2014, 479-480 : 906 - +
  • [9] A kernel PLS based classification method with missing data handling
    Thuy Tuong Nguyen
    Yury Tsoy
    Statistical Papers, 2017, 58 : 211 - 225
  • [10] A kernel PLS based classification method with missing data handling
    Thuy Tuong Nguyen
    Tsoy, Yury
    STATISTICAL PAPERS, 2017, 58 (01) : 211 - 225