Heavy-tailed longitudinal data modeling using copulas

被引:57
|
作者
Sun, Jiafeng [1 ]
Frees, Edward W. [1 ]
Rosenberg, Marjorie A. [1 ]
机构
[1] Univ Wisconsin, Sch Business, Dept Actuarial Sci Risk Management & Insurance, Madison, WI 53706 USA
来源
INSURANCE MATHEMATICS & ECONOMICS | 2008年 / 42卷 / 02期
基金
美国国家科学基金会; 美国医疗保健研究与质量局;
关键词
healthcare costs; predictive modeling;
D O I
10.1016/j.insmatheco.2007.09.009
中图分类号
F [经济];
学科分类号
02 ;
摘要
In this paper, we consider "heavy-tailed" data, that is, data where extreme values are likely to occur. Heavy-tailed data have been analyzed using flexible distributions such as the generalized beta of the second kind, the generalized gamma and the Burr. These distributions allow us to handle data with either positive or negative skewness, as well as heavy tails. Moreover, it has been shown that they can also accommodate cross-sectional regression models by allowing functions of explanatory variables to serve as distribution parameters. The objective of this paper is to extend this literature to accommodate longitudinal data, where one observes repeated observations of cross-sectional data. Specifically, we use copulas to model the dependencies over time, and heavy-tailed regression models to represent the marginal distributions. We also introduce model exploration techniques to help us with the initial choice of the copula and a goodness-of-fit test of elliptical copulas for model validation. In a longitudinal data context, we argue that elliptical copulas will be typically preferred to the Archimedean copulas. To illustrate our methods, Wisconsin nursing homes utilization data from 1995 to 2001 are analyzed. These data exhibit long tails and negative skewness and so help us to motivate the need for our new techniques. We find that time and the nursing home facility size as measured through the number of beds and square footage are important predictors of future utilization. Moreover, using our parametric model, we provide not only point predictions but also an entire predictive distribution. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:817 / 830
页数:14
相关论文
共 50 条
  • [41] Renewal reward processes with heavy-tailed inter-renewal times and heavy-tailed rewards
    Levy, JB
    Taqqu, MS
    [J]. BERNOULLI, 2000, 6 (01) : 23 - 44
  • [42] Heavy-tailed log hydraulic conductivity distributions imply heavy-tailed log velocity distributions
    Kohlbecker, MV
    Wheatcraft, SW
    Meerschaert, MM
    [J]. WATER RESOURCES RESEARCH, 2006, 42 (04)
  • [43] Heavy-tailed Representations, Text Polarity Classification & Data Augmentation
    Jalalzai, Hamid
    Colombo, Pierre
    Clavel, Chloe
    Gaussier, Eric
    Varni, Giovanna
    Vignon, Emmanuel
    Sabourin, Anne
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [44] Calibrating Spatial Stratified Heterogeneity for Heavy-Tailed Distributed Data
    Hu, Bisong
    Wu, Tingting
    Yin, Qian
    Wang, Jinfeng
    Jiang, Bin
    Luo, Jin
    [J]. ANNALS OF THE AMERICAN ASSOCIATION OF GEOGRAPHERS, 2024, 114 (07) : 1568 - 1586
  • [45] A heavy-tailed empirical Bayes method for replicated microarray data
    Salas-Gonzalez, Diego
    Kuruoglu, Ercan E.
    Ruiz, Diego P.
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2009, 53 (05) : 1535 - 1546
  • [46] Trimmed extreme value estimators for censored heavy-tailed data
    Bladt, Martin
    Albrecher, Hansjorg
    Beirlant, Jan
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (01): : 3112 - 3136
  • [47] Multivariate Denoising and Missing Data Estimation for Heavy-Tailed Signals
    Gorji, Ferdos
    Aminghafari, Mina
    [J]. FLUCTUATION AND NOISE LETTERS, 2019, 18 (03):
  • [48] A new probability model with application to heavy-tailed hydrological data
    Tassaddaq Hussain
    Hassan S. Bakouch
    Christophe Chesneau
    [J]. Environmental and Ecological Statistics, 2019, 26 : 127 - 151
  • [49] The modelling of ethernet data and of signals that are heavy-tailed with infinite variance
    Taqqu, MS
    [J]. SCANDINAVIAN JOURNAL OF STATISTICS, 2002, 29 (02) : 273 - 295
  • [50] A new probability model with application to heavy-tailed hydrological data
    Hussain, Tassaddaq
    Bakouch, Hassan S.
    Chesneau, Christophe
    [J]. ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2019, 26 (02) : 127 - 151