Semi-parametric Bayesian Inference for Multi-Season Baseball Data

被引:5
|
作者
Quintana, Fernando A. [1 ]
Mueller, Peter [2 ]
Rosner, Gary L. [2 ]
Munsell, Mark [2 ]
机构
[1] Pontificia Univ Catolica Chile, Dept Estadist, Santiago, Chile
[2] Univ Texas MD Anderson Canc Ctr, Dept Biostat, Houston, TX 77030 USA
来源
BAYESIAN ANALYSIS | 2008年 / 3卷 / 02期
关键词
Dirichlet Process; Partial Exchangeability; Semiparametric Random Effects;
D O I
10.1214/08-BA312
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We analyze complete sequences of successes (hits, walks, and sacrifices) for a group of players from the American and National Leagues, collected over 4 seasons. The goal is to describe how players' performance vary from season to season. In particular, we wish to assess and compare the effect of available occasion-specific covariates over seasons. The data are binary sequences for each player and each season. We model dependence in the binary sequence by an autoregressive logistic model. The model includes lagged terms up to a fixed order. For each player and season we introduce a different set of autologistic regression coefficients, i.e., the regression coefficients are random effects that are specific of each season and player. We use a nonparametric approach to define a random effects distribution. The nonparametric model is defined as a mixture with a Dirichlet process prior for the mixing measure. The described model is justified by a representation theorem for order-k exchangeable sequences. Besides the repeated measurements for each season and player, multiple seasons within a given player define an additional level of repeated measurements. We introduce dependence at this level of repeated measurements by relating the season-specific random effects vectors in an autoregressive fashion. We ultimately conclude that while some covariates like the ERA of the opposing pitcher are always relevant, others like an indicator for the game being into the seventh inning may be significant only for certain season, and some others, like the score of the game, can safely be ignored.
引用
收藏
页码:317 / 338
页数:22
相关论文
共 50 条
  • [21] Bayesian high-dimensional semi-parametric inference beyond sub-Gaussian errors
    Kyoungjae Lee
    Minwoo Chae
    Lizhen Lin
    Journal of the Korean Statistical Society, 2021, 50 : 511 - 527
  • [22] Empirical likelihood inference for semi-parametric estimating equations
    Wang ShanShan
    Cui HengJian
    Li RunZe
    SCIENCE CHINA-MATHEMATICS, 2013, 56 (06) : 1247 - 1262
  • [23] Empirical likelihood inference for semi-parametric estimating equations
    ShanShan Wang
    HengJian Cui
    RunZe Li
    Science China Mathematics, 2013, 56 : 1247 - 1262
  • [24] Semi-parametric inference in a bivariate (multivariate) mixture model
    Leung, DHY
    Qin, J
    STATISTICA SINICA, 2006, 16 (01) : 153 - 163
  • [25] Empirical likelihood inference for semi-parametric estimating equations
    WANG ShanShan
    CUI HengJian
    LI RunZe
    Science China(Mathematics), 2013, 56 (06) : 1243 - 1258
  • [26] A Bayesian semi-parametric model for thermal proteome profiling
    Fang, Siqi
    Kirk, Paul D. W.
    Bantscheff, Marcus
    Lilley, Kathryn S.
    Crook, Oliver M.
    COMMUNICATIONS BIOLOGY, 2021, 4 (01)
  • [27] Bayesian non- and semi-parametric methods and applications
    Assmann, Christian
    JOURNAL OF ECONOMICS, 2015, 115 (02) : 195 - 197
  • [28] A Bayesian semi-parametric approach to the ordinal calibration problem
    Paz Casanova, Maria
    Orellana, Yasna
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2016, 45 (22) : 6596 - 6610
  • [29] Bayesian modeling and optimization based on semi-parametric hierarchy
    Chen X.
    Wang J.
    Yang S.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2023, 45 (05): : 1580 - 1588
  • [30] A Bayesian semi-parametric model for colorectal cancer incidences
    Zhang, S
    Sun, DC
    He, CZ
    Schootman, M
    STATISTICS IN MEDICINE, 2006, 25 (02) : 285 - 309