Heavy-tailed longitudinal data modeling using copulas

被引:57
|
作者
Sun, Jiafeng [1 ]
Frees, Edward W. [1 ]
Rosenberg, Marjorie A. [1 ]
机构
[1] Univ Wisconsin, Sch Business, Dept Actuarial Sci Risk Management & Insurance, Madison, WI 53706 USA
来源
INSURANCE MATHEMATICS & ECONOMICS | 2008年 / 42卷 / 02期
基金
美国国家科学基金会; 美国医疗保健研究与质量局;
关键词
healthcare costs; predictive modeling;
D O I
10.1016/j.insmatheco.2007.09.009
中图分类号
F [经济];
学科分类号
02 ;
摘要
In this paper, we consider "heavy-tailed" data, that is, data where extreme values are likely to occur. Heavy-tailed data have been analyzed using flexible distributions such as the generalized beta of the second kind, the generalized gamma and the Burr. These distributions allow us to handle data with either positive or negative skewness, as well as heavy tails. Moreover, it has been shown that they can also accommodate cross-sectional regression models by allowing functions of explanatory variables to serve as distribution parameters. The objective of this paper is to extend this literature to accommodate longitudinal data, where one observes repeated observations of cross-sectional data. Specifically, we use copulas to model the dependencies over time, and heavy-tailed regression models to represent the marginal distributions. We also introduce model exploration techniques to help us with the initial choice of the copula and a goodness-of-fit test of elliptical copulas for model validation. In a longitudinal data context, we argue that elliptical copulas will be typically preferred to the Archimedean copulas. To illustrate our methods, Wisconsin nursing homes utilization data from 1995 to 2001 are analyzed. These data exhibit long tails and negative skewness and so help us to motivate the need for our new techniques. We find that time and the nursing home facility size as measured through the number of beds and square footage are important predictors of future utilization. Moreover, using our parametric model, we provide not only point predictions but also an entire predictive distribution. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:817 / 830
页数:14
相关论文
共 50 条
  • [21] Hierarchical clustering of heavy-tailed data using a new similarity measure
    Seidpisheh, Mohammad
    Mohammadpour, Adel
    [J]. INTELLIGENT DATA ANALYSIS, 2018, 22 (03) : 569 - 579
  • [22] A DISCRETE-TIME APPROACH FOR HEAVY-TAILED MODELING
    Vargas, Cesar
    Munoz, David
    Rodriguez, Oscar
    Antonio, Michel Z.
    [J]. STOCHASTIC MODELS, 2008, 24 : 270 - 280
  • [23] Heavy-tailed densities
    Rojo, Javier
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (01): : 30 - 40
  • [24] MODELING HEAVY-TAILED STOCK INDEX RETURNS USING THE GENERALIZED HYPERBOLIC DISTRIBUTION
    Necula, Ciprian
    [J]. ROMANIAN JOURNAL OF ECONOMIC FORECASTING, 2009, 10 (02): : 118 - 131
  • [25] Minimum of heavy-tailed random variables is not heavy tailed
    Leipus, Remigijus
    Siaulys, Jonas
    Konstantinides, Dimitrios
    [J]. AIMS MATHEMATICS, 2023, 8 (06): : 13066 - 13072
  • [26] Type-II Generalized Crack Distribution with Application to Heavy-Tailed Data Modeling
    Taehan Bae
    Andrei Volodin
    [J]. Journal of Statistical Theory and Practice, 2022, 16
  • [27] Type-II Generalized Crack Distribution with Application to Heavy-Tailed Data Modeling
    Bae, Taehan
    Volodin, Andrei
    [J]. JOURNAL OF STATISTICAL THEORY AND PRACTICE, 2022, 16 (03)
  • [28] Applying the Heavy-Tailed Kernel to the Gaussian Process Regression for Modeling Point of Sale Data
    Yang, Rui
    Ohsawa, Yukio
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 705 - 712
  • [29] Graph Learning for Balanced Clustering of Heavy-Tailed Data
    Javaheri, Amirhossein
    Cardoso, Jose Vinicius de M.
    Palomar, Daniel P.
    [J]. 2023 IEEE 9TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING, CAMSAP, 2023, : 481 - 485
  • [30] INFERENCE FOR EXTREMAL REGRESSION WITH DEPENDENT HEAVY-TAILED DATA
    Daouia, Abdelaati
    Stupfler, Gilles
    Usseglio-carleve, Antoine
    [J]. ANNALS OF STATISTICS, 2023, 51 (05): : 2040 - 2066