A latent scale model to minimize subjectivity in the analysis of visual rating data for the National Turfgrass Evaluation Program

被引:1
|
作者
Qu, Yuanshuo [1 ]
Kne, Len [2 ]
Graham, Steve [3 ]
Watkins, Eric [4 ]
Morris, Kevin [1 ]
机构
[1] Natl Turfgrass Evaluat Program, Beltsville, MD 20705 USA
[2] Univ Minnesota, U Spatial, Minneapolis, MN USA
[3] Univ Minnesota, U Spatial, Duluth, MN USA
[4] Univ Minnesota, Dept Hort Sci, St Paul, MN USA
来源
关键词
NTEP; visual ratings; cultivar evaluation; subjectivity minimization; Bayesian model; MULTIPLICATIVE INTERACTION; PERFORMANCE TRIALS;
D O I
10.3389/fpls.2023.1135918
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
IntroductionTraditional evaluation procedure in National Turfgrass Evaluation Program (NTEP) relies on visually assessing replicated turf plots at multiple testing locations. This process yields ordinal data; however, statistical models that falsely assume these to be interval or ratio data have almost exclusively been applied in the subsequent analysis. This practice raises concerns about procedural subjectivity, preventing objective comparisons of cultivars across different test locations. It may also lead to serious errors, such as increased false alarms, failures to detect effects, and even inversions of differences among groups. MethodsWe reviewed this problem, identified sources of subjectivity, and presented a model-based approach to minimize subjectivity, allowing objective comparisons of cultivars across different locations and better monitoring of the evaluation procedure. We demonstrate how to fit the described model in a Bayesian framework with Stan, using datasets on overall turf quality ratings from the 2017 NTEP Kentucky bluegrass trials at seven testing locations. ResultsCompared with the existing method, ours allows the estimation of additional parameters, i.e., category thresholds, rating severity, and within-field spatial variations, and provides better separation of cultivar means and more realistic standard deviations. DiscussionTo implement the proposed model, additional information on rater identification, trial layout, rating date is needed. Given the model assumptions, we recommend small trials to reduce rater fatigue. For large trials, ratings can be conducted for each replication on multiple occasions instead of all at once. To minimize subjectivity, multiple raters are required. We also proposed new ideas on temporal analysis, incorporating existing knowledge of turfgrass.
引用
收藏
页数:9
相关论文
共 15 条
  • [1] Multi-Scale Hydrologic Evaluation of the National Water Model Streamflow Data Assimilation
    Seo, Bong-Chul
    Krajewski, Witold F.
    Quintero, Felipe
    JOURNAL OF THE AMERICAN WATER RESOURCES ASSOCIATION, 2021, 57 (06): : 875 - 884
  • [2] Self-handicapping scale: evaluation of psychometric properties among Malaysian and Indonesian university students using Rasch rating scale model analysis
    Sumintono, Bambang
    Law, Mei Yui
    Sitasari, Novendawati Wahyu
    JOURNAL OF APPLIED RESEARCH IN HIGHER EDUCATION, 2025,
  • [3] The latent curve ARMA (p, q) panel model: longitudinal data analysis in educational research and evaluation
    Sivo, Stephen
    Fan, Xitao
    EDUCATIONAL RESEARCH AND EVALUATION, 2008, 14 (04) : 363 - 376
  • [4] Groundwater quality evaluation model based on multi-scale fuzzy comprehensive evaluation and big data analysis method
    Cheng, Hongxia
    Minghui, Zhang
    JOURNAL OF WATER AND CLIMATE CHANGE, 2021, 12 (07) : 2908 - 2919
  • [5] EVALUATION OF THE EFFICIENCY OF NATIONAL INSURANCE MARKETS USING A TWO-STAGE MODEL OF DATA ENVELOPMENT ANALYSIS
    Grmanova, Eva
    Hostak, Peter
    10TH INTERNATIONAL DAYS OF STATISTICS AND ECONOMICS, 2016, : 504 - 513
  • [6] Building Inventory at National scale by evaluation of seismic vulnerability classes distribution based on Census data analysis: BINC procedure
    Cacace, Francesco
    Zuccaro, Giulio
    De Gregorio, Daniela
    Perelli, Francesca Linda
    INTERNATIONAL JOURNAL OF DISASTER RISK REDUCTION, 2018, 28 : 384 - 393
  • [7] Evaluation of a Stratified National Breast Screening Program in the United Kingdom: An Early Model-Based Cost-Effectiveness Analysis
    Gray, Ewan
    Donten, Anna
    Karssemeijer, Nico
    van Gils, Carla
    Evans, D. Gareth
    Astley, Sue
    Payne, Katherine
    VALUE IN HEALTH, 2017, 20 (08) : 1100 - 1109
  • [8] The combined effect of age and body mass index on outcomes in foregut surgery: a regression model analysis of the National Surgical Quality Improvement Program data
    Palvannan, Prashanth
    Miranda, Irving
    Merchant, Aziz M.
    SURGICAL ENDOSCOPY AND OTHER INTERVENTIONAL TECHNIQUES, 2016, 30 (06): : 2572 - 2582
  • [9] The combined effect of age and body mass index on outcomes in foregut surgery: a regression model analysis of the National Surgical Quality Improvement Program data
    Prashanth Palvannan
    Irving Miranda
    Aziz M. Merchant
    Surgical Endoscopy, 2016, 30 : 2572 - 2582
  • [10] An enhanced version of FREM (Fracture Risk Evaluation Model) using national administrative health data: analysis protocol for development and validation of a multivariable prediction model
    Kristensen, Simon Bang
    Clausen, Anne
    Skjodt, Michael Kriegbaum
    Sondergaard, Jens
    Abrahamsen, Bo
    Moeller, Soeren
    Rubin, Katrine Hass
    DIAGNOSTIC AND PROGNOSTIC RESEARCH, 2023, 7 (01)