Heavy-tailed longitudinal data modeling using copulas

被引:57
|
作者
Sun, Jiafeng [1 ]
Frees, Edward W. [1 ]
Rosenberg, Marjorie A. [1 ]
机构
[1] Univ Wisconsin, Sch Business, Dept Actuarial Sci Risk Management & Insurance, Madison, WI 53706 USA
来源
INSURANCE MATHEMATICS & ECONOMICS | 2008年 / 42卷 / 02期
基金
美国医疗保健研究与质量局; 美国国家科学基金会;
关键词
healthcare costs; predictive modeling;
D O I
10.1016/j.insmatheco.2007.09.009
中图分类号
F [经济];
学科分类号
02 ;
摘要
In this paper, we consider "heavy-tailed" data, that is, data where extreme values are likely to occur. Heavy-tailed data have been analyzed using flexible distributions such as the generalized beta of the second kind, the generalized gamma and the Burr. These distributions allow us to handle data with either positive or negative skewness, as well as heavy tails. Moreover, it has been shown that they can also accommodate cross-sectional regression models by allowing functions of explanatory variables to serve as distribution parameters. The objective of this paper is to extend this literature to accommodate longitudinal data, where one observes repeated observations of cross-sectional data. Specifically, we use copulas to model the dependencies over time, and heavy-tailed regression models to represent the marginal distributions. We also introduce model exploration techniques to help us with the initial choice of the copula and a goodness-of-fit test of elliptical copulas for model validation. In a longitudinal data context, we argue that elliptical copulas will be typically preferred to the Archimedean copulas. To illustrate our methods, Wisconsin nursing homes utilization data from 1995 to 2001 are analyzed. These data exhibit long tails and negative skewness and so help us to motivate the need for our new techniques. We find that time and the nursing home facility size as measured through the number of beds and square footage are important predictors of future utilization. Moreover, using our parametric model, we provide not only point predictions but also an entire predictive distribution. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:817 / 830
页数:14
相关论文
共 50 条
  • [21] Hierarchical clustering of heavy-tailed data using a new similarity measure
    Seidpisheh, Mohammad
    Mohammadpour, Adel
    INTELLIGENT DATA ANALYSIS, 2018, 22 (03) : 569 - 579
  • [22] Heavy-tailed densities
    Rojo, Javier
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2013, 5 (01): : 30 - 40
  • [23] A DISCRETE-TIME APPROACH FOR HEAVY-TAILED MODELING
    Vargas, Cesar
    Munoz, David
    Rodriguez, Oscar
    Antonio, Michel Z.
    STOCHASTIC MODELS, 2008, 24 : 270 - 280
  • [24] MODELING HEAVY-TAILED STOCK INDEX RETURNS USING THE GENERALIZED HYPERBOLIC DISTRIBUTION
    Necula, Ciprian
    ROMANIAN JOURNAL OF ECONOMIC FORECASTING, 2009, 10 (02): : 118 - 131
  • [25] Minimum of heavy-tailed random variables is not heavy tailed
    Leipus, Remigijus
    Siaulys, Jonas
    Konstantinides, Dimitrios
    AIMS MATHEMATICS, 2023, 8 (06): : 13066 - 13072
  • [26] Type-II Generalized Crack Distribution with Application to Heavy-Tailed Data Modeling
    Taehan Bae
    Andrei Volodin
    Journal of Statistical Theory and Practice, 2022, 16
  • [27] Applying the Heavy-Tailed Kernel to the Gaussian Process Regression for Modeling Point of Sale Data
    Yang, Rui
    Ohsawa, Yukio
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 705 - 712
  • [28] Type-II Generalized Crack Distribution with Application to Heavy-Tailed Data Modeling
    Bae, Taehan
    Volodin, Andrei
    JOURNAL OF STATISTICAL THEORY AND PRACTICE, 2022, 16 (03)
  • [29] Support Vector Machine with Heavy-tailed Distribution Data
    Kim, Chansoo
    Choi, ByoungSecon
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCES IN SIGNAL PROCESSING AND ARTIFICIAL INTELLIGENCE, ASPAI' 2020, 2020, : 197 - 198
  • [30] On Empirical Risk Minimization with Dependent and Heavy-Tailed Data
    Roy, Abhishek
    Balasubramanian, Krishnakumar
    Erdogdu, Murat A.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34