Using random forests to estimate win probability before each play of an NFL game

被引:27
|
作者
Lock, Dennis [1 ]
Nettleton, Dan [1 ]
机构
[1] Iowa State Univ, Dept Stat, Ames, IA 50011 USA
关键词
random forest; NFL; win probability;
D O I
10.1515/jqas-2013-0100
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Before any play of a National Football League (NFL) game, the probability that a given team will win depends on many situational variables (such as time remaining, yards to go for a first down, field position and current score) as well as the relative quality of the two teams as quantified by the Las Vegas point spread. We use a random forest method to combine pre-play variables to estimate Win Probability (WP) before any play of an NFL game. When a subset of NFL play-by-play data for the 12 seasons from 2001 to 2012 is used as a training dataset, our method provides WP estimates that resemble true win probability and accurately predict game outcomes, especially in the later stages of games. In addition to being intrinsically interesting in real time to observers of an NFL football game, our WP estimates can provide useful evaluations of plays and, in some cases, coaching decisions.
引用
收藏
页码:197 / 205
页数:9
相关论文
共 8 条
  • [1] Time series classification using deep residual probability random forests
    Liu, Ying
    Li, Xu
    Lv, Zheng
    Zhao, Jun
    Wang, Wei
    [J]. Kongzhi yu Juece/Control and Decision, 2024, 39 (07): : 2315 - 2324
  • [2] Using Random Forests for Consistent Probability Estimation in Whole Genome Association Studies
    Kruppa, Jochen
    Konig, Inke R.
    Ziegler, Andreas
    [J]. GENETIC EPIDEMIOLOGY, 2012, 36 (07) : 752 - 752
  • [3] Using Random Forests for consistent probability estimation in whole genome association studies
    Kruppa, Jochen
    Koenig, Inke R.
    Ziegler, Andreas
    [J]. ANNALS OF HUMAN GENETICS, 2012, 76 : 426 - 426
  • [4] PItcHPERFeCT: Primary Intracranial Hemorrhage Probability Estimation using Random Forests on CT
    Muschelli, John
    Sweeney, Elizabeth M.
    Ullman, Natalie L.
    Vespa, Paul
    Hanley, Daniel F.
    Crainiceanu, Ciprian M.
    [J]. NEUROIMAGE-CLINICAL, 2017, 14 : 379 - 390
  • [5] Predicting a Multi-Parametric Probability Map of Active Tumor Extent Using Random Forests
    Prior, Fred W.
    Fouke, Sarah J.
    Benzinger, Tammie
    Boyd, Alicia
    Chicoine, Michael
    Cholleti, Sharath
    Kelsey, Matthew
    Keogh, Bart
    Kim, Lauren
    Milchenko, Mikhail
    Politte, David G.
    Tyree, Stephen
    Weinberger, Kilian
    Marcus, Daniel
    [J]. 2013 35TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2013, : 6478 - 6481
  • [6] Parameter mining using the out-of-bootstrap generalisation error estimate for Stochastic Discrimination and Random Forests
    Prior, M.
    Windeatt, T.
    [J]. 18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2006, : 498 - +
  • [7] Random forests-based error-correction of streamflow from a large-scale hydrological model: Using model state variables to estimate error terms
    Shen, Youchen
    Ruijsch, Jessica
    Lu, Meng
    Sutanudjaja, Edwin H.
    Karssenberg, Derek
    [J]. COMPUTERS & GEOSCIENCES, 2022, 159
  • [8] Random survival forests using linked data to measure illness burden among individuals before or after a cancer diagnosis: Development and internal validation of the SEER-CAHPS illness burden index
    Lines, Lisa M.
    Cohen, Julia
    Kirschner, Justin
    Halpern, Michael T.
    Kent, Erin E.
    Mollica, Michelle A.
    Smith, Ashley Wilder
    [J]. INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2021, 145