A flexible Bayesian variable selection approach for modeling interval data

被引:0
|
作者
Sen, Shubhajit [1 ,3 ]
Kundu, Damitri [1 ]
Das, Kiranmoy [1 ,2 ]
机构
[1] Indian Stat Inst, Appl Stat Div, Kolkata, India
[2] Yanqi Lake Beijing Inst Math Sci & Applicat, Beijing, Peoples R China
[3] North Carolina State Univ, Dept Stat, Raleigh, NC USA
来源
STATISTICAL METHODS AND APPLICATIONS | 2024年 / 33卷 / 01期
关键词
Global-local shrinkage prior; Interval data; Joint model; MCMC; Spike and slab prior; REGRESSION; ESTIMATOR;
D O I
10.1007/s10260-023-00727-9
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Interval datasets are not uncommon in many disciplines including medical experiments, econometric studies, environmental studies etc. For modeling interval data traditionally separate models are used for modeling the center and the radius of the response variable. In this article, we consider a Bayesian regression framework for jointly modeling the center and the radius of the intervals corresponding to the response, and then use appropriate priors for variable selection. Unlike the traditional setting, both the centres and the radii of all the predictors are used for modeling the center and the radius of response. We consider spike and slab priors for the regression coefficients corresponding to the centers (radii) of the predictors while modeling the center (radius) of the response, and global-local shrinkage prior for the coefficients corresponding to the radii (centers) of the predictors. Through extensive simulation studies, we illustrate the effectiveness of our proposed variable selection approach for the analysis and prediction of interval datasets. Finally, we analyze a real dataset from a clinical trial related to the Acute Lymphocytic Leukemia (ALL), and then select the important set of predictors for modeling the lymphocyte count which is an important biomarker for ALL. Our numerical studies show that the proposed approach is efficient, and it provides a powerful statistical inference for handling interval datasets.
引用
收藏
页码:267 / 286
页数:20
相关论文
共 50 条
  • [1] Bayesian approach for flexible modeling of semicompeting risks data
    Han, Baoguang
    Yu, Menggang
    Dignam, James J.
    Rathouz, Paul J.
    STATISTICS IN MEDICINE, 2014, 33 (29) : 5111 - 5125
  • [2] Gene selection: a Bayesian variable selection approach
    Lee, KE
    Sha, NJ
    Dougherty, ER
    Vannucci, M
    Mallick, BK
    BIOINFORMATICS, 2003, 19 (01) : 90 - 97
  • [3] Integration of survival and binary data for variable selection and prediction: a Bayesian approach
    Maity, Arnab Kumar
    Carroll, Raymond J.
    Mallick, Bani K.
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2019, 68 (05) : 1577 - 1595
  • [4] Variable selection in a flexible parametric mixture cure model with interval-censored data
    Scolas, Sylvie
    El Ghouch, Anouar
    Legrand, Catherine
    Oulhaj, Abderrahim
    STATISTICS IN MEDICINE, 2016, 35 (07) : 1210 - 1225
  • [5] Dirichlet Lasso: A Bayesian approach to variable selection
    Das, Kiranmoy
    Sobel, Marc
    STATISTICAL MODELLING, 2015, 15 (03) : 215 - 232
  • [6] EMVS: The EM Approach to Bayesian Variable Selection
    Rockova, Veronika
    George, Edward I.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2014, 109 (506) : 828 - 846
  • [7] Flexible variable selection in the presence of missing data
    Williamson, Brian D.
    Huang, Ying
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2024, 20 (02): : 347 - 359
  • [8] Online Bayesian Variable Selection and Bayesian Model Averaging for Streaming Data
    Ghosh, Joyee
    Tan, Aixin
    Luo, Lan
    STAT, 2025, 14 (01):
  • [9] A Bayesian decision theory approach to variable selection for discrimination
    T. Fearn
    P. J. Brown
    P. Besbeas
    Statistics and Computing, 2002, 12 : 253 - 260
  • [10] Bayesian variable selection for time series count data
    Ibrahim, JG
    Chen, MH
    Ryan, LM
    STATISTICA SINICA, 2000, 10 (03) : 971 - 987