Loss amount prediction from textual data using a double GLM with shrinkage and selection

被引:0
|
作者
Scott Manski
Kaixu Yang
Gee Y. Lee
Tapabrata Maiti
机构
[1] Pfizer Inc.,Department of Statistics and Probability
[2] LinkedIn Corporation,Department of Mathematics
[3] Michigan State University,undefined
[4] Michigan State University,undefined
来源
关键词
Insurance analytics; Claims prediction; Loss reserving; Word2vec; Word embedding matrix; Gamma double group lasso;
D O I
暂无
中图分类号
学科分类号
摘要
The Gamma model has been widely utilized in a variety of fields, including actuarial science, where it has important applications in insurance loss predictions. Meanwhile, high dimensional models and their applications have become more common in the statistics literature in recent years. The availability of such high dimensional models have allowed the analysis of non-traditional data, including those containing textual descriptions of the response. In the models used in such applications, the dispersion may be designed to be related to a set of covariates, as opposed to being a single fixed value for the entire population. Following this approach, we incorporate a group Lasso type penalty in both the dispersion and the mean parameterization for a Gamma model, and illustrate its use in a predictive analytics application in actuarial science. In particular, we apply the method to an insurance claim prediction problem involving textual data analysis methods. Simulations are conducted to illustrate the variable selection and model fitting performance of our method.
引用
收藏
页码:503 / 528
页数:25
相关论文
共 50 条
  • [21] Can metafeatures help improve explanations of prediction models when using behavioral and textual data?
    Ramon, Yanou
    Martens, David
    Evgeniou, Theodoros
    Praet, Stiene
    [J]. MACHINE LEARNING, 2024, 113 (07) : 4245 - 4284
  • [22] Blood Loss Severity Prediction using Game Theoretic Based Feature Selection
    Razi, Abolfazl
    Afghah, Fatemeh
    Belle, Ashwin
    Ward, Kevin
    Najarian, Kayvan
    [J]. 2014 IEEE-EMBS INTERNATIONAL CONFERENCE ON BIOMEDICAL AND HEALTH INFORMATICS (BHI), 2014, : 776 - 780
  • [23] Return prediction and stock selection from unidentified historical data
    Sonsino, Doron
    Shavit, Tal
    [J]. QUANTITATIVE FINANCE, 2014, 14 (04) : 641 - 655
  • [24] Prediction of Brain Connectivity Map in Resting-State fMRI Data Using Shrinkage Estimator
    Nazari, Atiye
    Alavimajd, Hamid
    Shakeri, Nezhat
    Bakhshandeh, Mohsen
    Faghihzadeh, Elham
    Marzbani, Hengameh
    [J]. BASIC AND CLINICAL NEUROSCIENCE, 2019, 10 (02) : 147 - 156
  • [25] Prediction of restrained shrinkage crack width of slag mortar composites using data mining techniques
    Martins, Francisco Ferreira
    Camoes, Aires
    [J]. MATERIA-RIO DE JANEIRO, 2019, 24 (04):
  • [26] Comparison of selection based on phenotype, selection index and best linear unbiased prediction using data from a closed broiler line
    Morris, AJ
    Pollott, GE
    [J]. BRITISH POULTRY SCIENCE, 1997, 38 (03) : 249 - 254
  • [27] Audit lead selection and yield prediction from historical tax data using artificial neural networks
    Chan, Trevor
    Tan, Cheng-En
    Tagkopoulos, Ilias
    [J]. PLOS ONE, 2022, 17 (11):
  • [28] Random forest prediction of Alzheimer's disease using pairwise selection from time series data
    Moore, P. J.
    Lyons, T. J.
    Gallacher, J.
    Weiner, Michael W.
    Aisen, Paul
    Petersen, Ronald
    Jack, Clifford R., Jr.
    Jagust, William
    Trojanowki, John Q.
    Toga, Arthur W.
    Beckett, Laurel
    Green, Robert C.
    Saykin, Andrew J.
    Morris, John
    Shaw, Leslie M.
    Khachaturian, Zaven
    Sorensen, Greg
    Carrillo, Maria
    Kuller, Lew
    Raichle, Marc
    Paul, Steven
    Davies, Peter
    Fillit, Howard
    Hefti, Franz
    Holtzman, David
    Mesulam, M. Marcel
    Potter, William
    Snyder, Peter
    Montine, Tom
    Jimenez, Gustavo
    Donohue, Michael
    Gessert, Devon
    Harless, Kelly
    Salazar, Jennifer
    Cabrera, Yuliana
    Walter, Sarah
    Hergesheimer, Lindsey
    Harvey, Danielle
    Donohue, Michael
    Bernstein, Matthew
    Fox, Nick
    Thompson, Paul
    Schuff, Norbert
    DeCArli, Charles
    Borowski, Bret
    Gunter, Jeff
    Senjem, Matt
    Vemuri, Prashanthi
    Jones, David
    Kantarci, Kejal
    [J]. PLOS ONE, 2019, 14 (02):
  • [29] Prediction of plume migration using injection data and a model selection approach
    Bhowmik, Sayantan
    Srinivasan, Sanjay
    Bryant, Steven
    [J]. GHGT-11, 2013, 37 : 3672 - 3679
  • [30] Data partition and variable selection for time series prediction using wrappers
    Puma-Villanueva, Wilfredo J.
    dos Santos, Euripedes P.
    Von Zuben, Fernando J.
    [J]. 2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 4740 - +