Semi-parametric Bayesian Inference for Multi-Season Baseball Data

被引:5
|
作者
Quintana, Fernando A. [1 ]
Mueller, Peter [2 ]
Rosner, Gary L. [2 ]
Munsell, Mark [2 ]
机构
[1] Pontificia Univ Catolica Chile, Dept Estadist, Santiago, Chile
[2] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
来源
BAYESIAN ANALYSIS | 2008年 / 3卷 / 02期
关键词
Dirichlet Process; Partial Exchangeability; Semiparametric Random Effects;
D O I
10.1214/08-BA312
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We analyze complete sequences of successes (hits, walks, and sacrifices) for a group of players from the American and National Leagues, collected over 4 seasons. The goal is to describe how players' performance vary from season to season. In particular, we wish to assess and compare the effect of available occasion-specific covariates over seasons. The data are binary sequences for each player and each season. We model dependence in the binary sequence by an autoregressive logistic model. The model includes lagged terms up to a fixed order. For each player and season we introduce a different set of autologistic regression coefficients, i.e., the regression coefficients are random effects that are specific of each season and player. We use a nonparametric approach to define a random effects distribution. The nonparametric model is defined as a mixture with a Dirichlet process prior for the mixing measure. The described model is justified by a representation theorem for order-k exchangeable sequences. Besides the repeated measurements for each season and player, multiple seasons within a given player define an additional level of repeated measurements. We introduce dependence at this level of repeated measurements by relating the season-specific random effects vectors in an autoregressive fashion. We ultimately conclude that while some covariates like the ERA of the opposing pitcher are always relevant, others like an indicator for the game being into the seventh inning may be significant only for certain season, and some others, like the score of the game, can safely be ignored.
引用
收藏
页码:317 / 338
页数:22
相关论文
共 50 条
  • [1] Bivariate Semi-Parametric Model: Bayesian Inference
    Debashis Samanta
    Debasis Kundu
    Methodology and Computing in Applied Probability, 2023, 25
  • [2] Bivariate Semi-Parametric Model: Bayesian Inference
    Samanta, Debashis
    Kundu, Debasis
    METHODOLOGY AND COMPUTING IN APPLIED PROBABILITY, 2023, 25 (04)
  • [3] SEMI-PARAMETRIC INFERENCE FOR COPULA MODELS FOR TRUNCATED DATA
    Emura, Takeshi
    Wang, Weijing
    Hung, Hui-Nien
    STATISTICA SINICA, 2011, 21 (01) : 349 - 367
  • [4] Semi-parametric and Parametric Inference of Extreme Value Models for Rainfall Data
    AghaKouchak, Amir
    Nasrollahi, Nasrin
    WATER RESOURCES MANAGEMENT, 2010, 24 (06) : 1229 - 1249
  • [5] Semi-parametric and Parametric Inference of Extreme Value Models for Rainfall Data
    Amir AghaKouchak
    Nasrin Nasrollahi
    Water Resources Management, 2010, 24 : 1229 - 1249
  • [6] A semi-parametric Bayesian model for semi-continuous longitudinal data
    Ren, Junting
    Tapert, Susan
    Fan, Chun Chieh
    Thompson, Wesley K.
    STATISTICS IN MEDICINE, 2022, 41 (13) : 2354 - 2374
  • [7] Inference in semi-parametric spline mixed models for longitudinal data
    Sinha S.K.
    Sattar A.
    METRON, 2015, 73 (3) : 377 - 395
  • [8] INFERENCE IN SEMI-PARAMETRIC DYNAMIC MODELS FOR REPEATED COUNT DATA
    Sutradhar, Brajendra C.
    Warriyar, K. V. Vineetha
    Zheng, Nan
    AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2016, 58 (03) : 397 - 434
  • [9] Bayesian semi-parametric ROC analysis
    Erkanli, Alaattin
    Sung, Minje
    Costello, E. Jane
    Angold, Adrian
    STATISTICS IN MEDICINE, 2006, 25 (22) : 3905 - 3928
  • [10] Semi-parametric analysis of multi-rater data
    Rogers, Simon
    Girolami, Mark
    Polajnar, Tamara
    STATISTICS AND COMPUTING, 2010, 20 (03) : 317 - 334