A Bayesian approach for analysis of ordered categorical responses subject to misclassification

被引:2
|
作者
Ling, Ashley [1 ]
Hay, El Hamidi [2 ]
Aggrey, Samuel E. [3 ,4 ]
Rekaya, Romdhane [1 ,3 ,5 ]
机构
[1] Univ Georgia, Dept Anismal & Dairy Sci, Athens, GA 30602 USA
[2] USDA ARS, Ft Keogh Livestock & Range Res Lab, Miles City, MT 59301 USA
[3] Univ Georgia, Inst Bioinformat, Athens, GA 30602 USA
[4] Univ Georgia, Dept Poultry Sci, Athens, GA 30602 USA
[5] Univ Georgia, Dept Stat, Athens, GA 30602 USA
来源
PLOS ONE | 2018年 / 13卷 / 12期
基金
美国农业部;
关键词
BINARY RESPONSES; LOGISTIC-REGRESSION; GENETIC-PARAMETERS; COMPOSITE LINE; CALVING EASE; BEEF-CATTLE; BIAS; MODELS; ERRORS;
D O I
10.1371/journal.pone.0208433
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Ordinal categorical responses are frequently collected in survey studies, human medicine, and animal and plant improvement programs, just to mention a few. Errors in this type of data are neither rare nor easy to detect. These errors tend to bias the inference, reduce the statistical power and ultimately the efficiency of the decision-making process. Contrarily to the binary situation where misclassification occurs between two response classes, noise in ordinal categorical data is more complex due to the increased number of categories, diversity and asymmetry of errors. Although several approaches have been presented for dealing with misclassification in binary data, only limited practical methods have been proposed to analyze noisy categorical responses. A latent variable model implemented within a Bayesian framework was proposed to analyze ordinal categorical data subject to misclassification using simulated and real datasets. The simulated scenario consisted of a discrete response with three categories and a symmetric error rate of 5% between any two classes. The real data consisted of calving ease records of beef cows. Using real and simulated data, ignoring misclassification resulted in substantial bias in the estimation of genetic parameters and reduction of the accuracy of predicted breeding values. Using our proposed approach, a significant reduction in bias and increase in accuracy ranging from 11% to 17% was observed. Furthermore, most of the misclassified observations (in the simulated data) were identified with a substantially higher probability. Similar results were observed for a scenario with asymmetric misclassification. While the extension to traits with more categories between adjacent classes is straightforward, it could be computationally costly. For traits with high heritability, the performance of the methodology would be expected to improve.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] A Bayesian approach to obtain confidence intervals for binomial proportion in a double sampling scheme subject to false-positive misclassification
    Lee, Seung-Chun
    Byun, Jong-Seok
    [J]. JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2008, 37 (04) : 393 - 403
  • [42] Latent Ornstein-Uhlenbeck models for Bayesian analysis of multivariate longitudinal categorical responses
    Tran, Trung Dung
    Lesaffre, Emmanuel
    Verbeke, Geert
    Duyck, Joke
    [J]. BIOMETRICS, 2021, 77 (02) : 689 - 701
  • [43] A semi-Markov model for binary longitudinal responses subject to misclassification
    Rosychuk, RJ
    Thompson, ME
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2001, 29 (03): : 395 - 404
  • [44] Bayesian inference for categorical data analysis
    Agresti A.
    Hitchcock D.B.
    [J]. Statistical Methods and Applications, 2005, 14 (3): : 297 - 330
  • [45] Benchmark dose calculation for ordered categorical responses with multiple endpoints
    Chu-Chih Chen
    Yin-Han Wang
    [J]. Stochastic Environmental Research and Risk Assessment, 2019, 33 : 535 - 543
  • [46] A two armed optimal response adaptive randomization for ordinal categorical responses with possible misclassification
    Das, Soumyadeep
    [J]. SEQUENTIAL ANALYSIS-DESIGN METHODS AND APPLICATIONS, 2024, 43 (02): : 131 - 146
  • [47] Benchmark dose calculation for ordered categorical responses with multiple endpoints
    Chen, Chu-Chih
    Wang, Yin-Han
    [J]. STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2019, 33 (02) : 535 - 543
  • [48] Modeling stage-sequential change in ordered categorical responses
    Kim, JS
    Böckenholt, U
    [J]. PSYCHOLOGICAL METHODS, 2000, 5 (03) : 380 - 400
  • [49] A Maximum Test for the Analysis of Ordered Categorical Data
    Neuhaeeuser, Markus
    [J]. JOURNAL OF MODERN APPLIED STATISTICAL METHODS, 2009, 8 (02) : 592 - 593
  • [50] Bayesian Analysis of Lung Cancer Mortality in the Presence of Misclassification
    Vahedi, Mohsen
    Pourhoseingholi, Mohamad Amin
    Baghestani, Ahmadreza
    Abadi, Alireza
    Sobhi, Sara
    Fazeli, Zeinab
    [J]. IRANIAN JOURNAL OF CANCER PREVENTION, 2013, 6 : 1 - 5