Beyond Point Prediction: Capturing Zero-Inflated & Heavy-Tailed Spatiotemporal Data with Deep Extreme Mixture Models

被引:2
|
作者
Wilson, Tyler [1 ]
McDonald, Andrew [1 ]
Galib, Asadullah Hill [1 ]
Tan, Pang-Ning [1 ]
Luo, Lifeng [1 ]
机构
[1] Michigan State Univ, E Lansing, MI 48824 USA
基金
美国国家科学基金会;
关键词
Zero-Inflated; Heavy-Tailed; Spatiotemporal Modeling; Extreme Value Theory; Deep Mixture Model;
D O I
10.1145/3534678.3539464
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Zero-inflated, heavy-tailed spatiotemporal data is common across science and engineering, from climate science to meteorology and seismology. A central modeling objective in such settings is to forecast the intensity, frequency, and timing of extreme and non-extreme events-yet in the context of deep learning, this objective presents several key challenges. First, a deep learning framework applied to such data must unify a mixture of distributions characterizing the zero events, moderate events, and extreme events. Second, the framework must be capable of enforcing parameter constraints across each component of the mixture distribution. Finally, the framework must be flexible enough to accommodate for any changes in the threshold used to define an extreme event after training. To address these challenges, we propose Deep Extreme Mixture Model (DEMM), fusing a deep learning-based hurdle model with extreme value theory to enable point and distribution prediction of zero-inflated, heavy-tailed spatiotemporal variables. The framework enables users to dynamically set a threshold for defining extreme events at inference-time without the need for retraining. We present an extensive experimental analysis applying DEMM to precipitation forecasting, and observe significant improvements in point and distribution prediction All code is available at https: //github.com/andrewmcdonald27/DeepExtremeMixtureModel.
引用
收藏
页码:2020 / 2028
页数:9
相关论文
共 11 条
  • [1] Hierarchical Mixture Models for Zero-inflated Correlated Count Data
    Chen, Xue-dong
    Shi, Hong-xing
    Wang, Xue-ren
    ACTA MATHEMATICAE APPLICATAE SINICA-ENGLISH SERIES, 2016, 32 (02): : 373 - 384
  • [2] Hierarchical Mixture Models for Zero-inflated Correlated Count Data
    Xue-dong CHEN
    Hong-xing SHI
    Xue-ren WANG
    Acta Mathematicae Applicatae Sinica, 2016, 32 (02) : 373 - 384
  • [3] Hierarchical mixture models for zero-inflated correlated count data
    Xue-dong Chen
    Hong-xing Shi
    Xue-ren Wang
    Acta Mathematicae Applicatae Sinica, English Series, 2016, 32 : 373 - 384
  • [4] Extreme Value Analysis for Mixture Models with Heavy-Tailed Impurity
    Morozova, Ekaterina
    Panov, Vladimir
    MATHEMATICS, 2021, 9 (18)
  • [5] Modelling count data with excessive zeros: The need for class prediction in zero-inflated models and the issue of data generation in choosing between zero-inflated and generic mixture models for dental caries data
    Gilthorpe, Mark S.
    Frydenberg, Morten
    Cheng, Yaping
    Baelum, Vibeke
    STATISTICS IN MEDICINE, 2009, 28 (28) : 3539 - 3553
  • [6] A framework of zero-inflated bayesian negative binomial regression models for spatiotemporal data
    He, Qing
    Huang, Hsin-Hsiung
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2024, 229
  • [7] Bayesian zero-inflated growth mixture models with application to health risk behavior data
    Yang, Si
    Puggioni, Gavino
    STATISTICS AND ITS INTERFACE, 2021, 14 (02) : 151 - 163
  • [8] Mixture model framework facilitates understanding of zero-inflated and hurdle models for count data
    Baughman, A. L.
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2007, 17 (05) : 943 - 946
  • [9] Spatiotemporal hurdle models for zero-inflated count data: Exploring trends in emergency department visits
    Neelon, Brian
    Chang, Howard H.
    Ling, Qiang
    Hastings, Nicole S.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (06) : 2558 - 2576
  • [10] Pattern-Mixture Zero-Inflated Mixed Models for Longitudinal Unbalanced Count Data with Excessive Zeros
    Hasan, M. Tariqul
    Sneddon, Gary
    Ma, Renjun
    BIOMETRICAL JOURNAL, 2009, 51 (06) : 946 - 960