Bayesian analysis of two-part nonlinear latent variable model: Semiparametric method

被引:3
|
作者
Gou, Jian-Wei [1 ]
Xia, Ye-Mao [1 ]
Jiang, De-Peng [2 ]
机构
[1] Nanjing Forestry Univ, Sch Sci, Dept Appl Math, Nanjing 210037, Jiangsu, Peoples R China
[2] Univ Manitoba, Dept Community Hlth Sci, Winnipeg, MB, Canada
关键词
Markov Chains Monte Carlo; Semi-parametric Bayesian methods; semi-continuous data; truncated Dirichlet process; two-part nonlinear latent variable model; FINITE MIXTURES; DIRICHLET; COCAINE; TRAIT; DISTRIBUTIONS; TUTORIAL;
D O I
10.1177/1471082X211059233
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Two-part model (TPM) is a widely appreciated statistical method for analyzing semi-continuous data. Semi-continuous data can be viewed as arising from two distinct stochastic processes: one governs the occurrence or binary part of data and the other determines the intensity or continuous part. In the regression setting with the semi-continuous outcome as functions of covariates, the binary part is commonly modelled via logistic regression and the continuous component via a log-normal model. The conventional TPM, still imposes assumptions such as log-normal distribution of the continuous part, with no unobserved heterogeneity among the response, and no collinearity among covariates, which are quite often unrealistic in practical applications. In this article, we develop a two-part nonlinear latent variable model (TPNLVM) with mixed multiple semi-continuous and continuous variables. The semi-continuous variables are treated as indicators of the latent factor analysis along with other manifest variables. This reduces the dimensionality of the regression model and alleviates the potential multicollinearity problems. Our TPNLVM can accommodate the nonlinear relationships among latent variables extracted from the factor analysis. To downweight the influence of distribution deviations and extreme observations, we develop a Bayesian semiparametric analysis procedure. The conventional parametric assumptions on the related distributions are relaxed and the Dirichlet process (DP) prior is used to improve model fitting. By taking advantage of the discreteness of DP, our method is effective in capturing the heterogeneity underlying population. Within the Bayesian paradigm, posterior inferences including parameters estimates and model assessment are carried out through Markov Chains Monte Carlo (MCMC) sampling method. To facilitate posterior sampling, we adapt the Polya-Gamma stochastic representation for the logistic model. Using simulation studies, we examine properties and merits of our proposed methods and illustrate our approach by evaluating the effect of treatment on cocaine use and examining whether the treatment effect is moderated by psychiatric problems.
引用
收藏
页码:376 / 399
页数:24
相关论文
共 50 条
  • [31] A two-part model of alcohol sensitivity
    Williams, M. A.
    Sher, K. J.
    Wood, P. K.
    Bartholow, B. D.
    ALCOHOLISM-CLINICAL AND EXPERIMENTAL RESEARCH, 2007, 31 (06) : 188A - 188A
  • [32] Bayesian marginalized two-part mixed effects model based on generalized gamma distribution
    Kwon, Yongtae
    Lee, Keunbaik
    KOREAN JOURNAL OF APPLIED STATISTICS, 2023, 36 (03) : 225 - 243
  • [33] Variable selection for random effects two-part models
    Han, Dongxiao
    Liu, Lei
    Su, Xiaogang
    Johnson, Bankole
    Sun, Liuquan
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2019, 28 (09) : 2697 - 2709
  • [34] Bayesian Analysis of Nonnegative Data Using Dependency-Extended Two-Part Models
    Mariana Rodrigues-Motta
    Johannes Forkman
    Journal of Agricultural, Biological and Environmental Statistics, 2022, 27 : 201 - 221
  • [35] Bayesian Analysis of Nonnegative Data Using Dependency-Extended Two-Part Models
    Rodrigues-Motta, Mariana
    Forkman, Johannes
    JOURNAL OF AGRICULTURAL BIOLOGICAL AND ENVIRONMENTAL STATISTICS, 2022, 27 (02) : 201 - 221
  • [36] Bayesian Analysis of Semiparametric Hidden Markov Models With Latent Variables
    Song, Xinyuan
    Kang, Kai
    Ouyang, Ming
    Jiang, Xuejun
    Cai, Jingheng
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2018, 25 (01) : 1 - 20
  • [38] Latent variable model and its application to Bayesian operational modal analysis
    Zhu, Wei
    Li, Bin-Bin
    Xie, Yan-Long
    Chen, Xiao-Yu
    Zhendong Gongcheng Xuebao/Journal of Vibration Engineering, 2024, 37 (09): : 1476 - 1484
  • [39] Bayesian Analysis of ARCH-M model with a dynamic latent variable
    Song, Zefang
    Song, Xinyuan
    Li, Yuan
    ECONOMETRICS AND STATISTICS, 2023, 28 : 47 - 62
  • [40] Bayesian inference for a two-part hierarchical model: An application to profiling providers in managed health care
    Zhang, Min
    Strawderman, Robert L.
    Cowen, Mark E.
    T Wells, Martin
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (475) : 934 - 945