Clusterwise multivariate regression of mixed-type panel data

被引:0
|
作者
Vavra, Jan [1 ]
Komarek, Arnost [1 ]
Gruen, Bettina [2 ]
Malsiner-Walli, Gertraud [2 ]
机构
[1] Charles Univ Prague, Fac Math & Phys, Sokolovska 49-83, Prague 18675, Czech Republic
[2] Vienna Univ Econ & Business, Inst Stat & Math, Welthandels pl 1, A-1020 Vienna, Austria
基金
奥地利科学基金会;
关键词
Multivariate longitudinal data; Mixed type outcome; Generalised linear mixed model (GLMM); Model-based clustering; Classification; Sparse finite mixture; EU-SILC; MODELS; MIXTURES; PROFILES;
D O I
10.1007/s11222-023-10304-5
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Multivariate panel data of mixed type are routinely collected in many different areas of application, often jointly with additional covariates which complicate the statistical analysis. Moreover, it is often of interest to identify unknown groups of subjects in a study population using such data structure, i.e., to perform clustering. In the Bayesian framework, we propose a finite mixture of multivariate generalised linear mixed effects regression models to cluster numeric, binary, ordinal and categorical panel outcomes jointly. The specification of suitable priors on the model parameters allows for convenient posterior inference based on Markov chain Monte Carlo (MCMC) sampling with data augmentation. This approach allows to classify subjects in the data and new subjects as well as to characterise the cluster-specific models. Model estimation and selection of the number of data clusters are simultaneously performed when approximating the posterior for a single model using MCMC sampling without resorting to multiple model estimations. The performance of the proposed methodology is evaluated in a simulation study. Its application is illustrated on two data sets, one from a longitudinal patient study to infer prognosis groups, and a second one from the Czech part of the EU-SILC survey where households are annually interviewed to obtain insights into changes in their financial capability.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Clusterwise multivariate regression of mixed-type panel data
    Jan Vávra
    Arnošt Komárek
    Bettina Grün
    Gertraud Malsiner-Walli
    [J]. Statistics and Computing, 2024, 34
  • [2] Mixed-type multivariate response regression with covariance estimation
    Ekvall, Karl Oskar
    Molstad, Aaron J.
    [J]. STATISTICS IN MEDICINE, 2022, 41 (15) : 2768 - 2785
  • [3] Causal Inference on Multivariate and Mixed-Type Data
    Marx, Alexander
    Vreeken, Jilles
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2018, PT II, 2019, 11052 : 655 - 671
  • [4] Multivariate semiparametric control charts for mixed-type data
    Sofikitou, Elisavet M.
    Markatou, Marianthi
    Koutras, Markos, V
    [J]. STATISTICAL METHODS IN MEDICAL RESEARCH, 2023, 32 (04) : 671 - 690
  • [5] A multivariate sign chart for monitoring dependence among mixed-type data
    Wang, Junjie
    Su, Qin
    Fang, Yue
    Zhang, Pengwei
    [J]. COMPUTERS & INDUSTRIAL ENGINEERING, 2018, 126 : 625 - 636
  • [6] Joint regression analysis of mixed-type outcome data via efficient scores
    Marchese, Scott
    Diao, Guoqing
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 125 : 156 - 170
  • [7] Review and evaluation of imputation methods for multivariate longitudinal data with mixed-type incomplete variables
    Cao, Yi
    Allore, Heather
    Vander Wyk, Brent
    Gutman, Roee
    [J]. STATISTICS IN MEDICINE, 2022, 41 (30) : 5844 - 5876
  • [8] Spectral Clustering of Mixed-Type Data
    Mbuga, Felix
    Tortora, Cristina
    [J]. STATS, 2022, 5 (01): : 1 - 11
  • [9] DETERMINATION OF THE FACTORS AFFECTING AIR POLLUTION BY ANALYSIS OF CLUSTERED MIXED-TYPE PANEL DATA
    Akay, Ozlem
    Yuksel, Guzin
    [J]. FRESENIUS ENVIRONMENTAL BULLETIN, 2020, 29 (01): : 659 - 671
  • [10] Regression analysis of multivariate panel count data
    He, Xin
    Tong, Xingwei
    Sun, Jianguo
    Cook, Richard J.
    [J]. BIOSTATISTICS, 2008, 9 (02) : 234 - 248