Bayesian analysis of two-part nonlinear latent variable model: Semiparametric method

被引:3
|
作者
Gou, Jian-Wei [1 ]
Xia, Ye-Mao [1 ]
Jiang, De-Peng [2 ]
机构
[1] Nanjing Forestry Univ, Sch Sci, Dept Appl Math, Nanjing 210037, Jiangsu, Peoples R China
[2] Univ Manitoba, Dept Community Hlth Sci, Winnipeg, MB, Canada
关键词
Markov Chains Monte Carlo; Semi-parametric Bayesian methods; semi-continuous data; truncated Dirichlet process; two-part nonlinear latent variable model; FINITE MIXTURES; DIRICHLET; COCAINE; TRAIT; DISTRIBUTIONS; TUTORIAL;
D O I
10.1177/1471082X211059233
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Two-part model (TPM) is a widely appreciated statistical method for analyzing semi-continuous data. Semi-continuous data can be viewed as arising from two distinct stochastic processes: one governs the occurrence or binary part of data and the other determines the intensity or continuous part. In the regression setting with the semi-continuous outcome as functions of covariates, the binary part is commonly modelled via logistic regression and the continuous component via a log-normal model. The conventional TPM, still imposes assumptions such as log-normal distribution of the continuous part, with no unobserved heterogeneity among the response, and no collinearity among covariates, which are quite often unrealistic in practical applications. In this article, we develop a two-part nonlinear latent variable model (TPNLVM) with mixed multiple semi-continuous and continuous variables. The semi-continuous variables are treated as indicators of the latent factor analysis along with other manifest variables. This reduces the dimensionality of the regression model and alleviates the potential multicollinearity problems. Our TPNLVM can accommodate the nonlinear relationships among latent variables extracted from the factor analysis. To downweight the influence of distribution deviations and extreme observations, we develop a Bayesian semiparametric analysis procedure. The conventional parametric assumptions on the related distributions are relaxed and the Dirichlet process (DP) prior is used to improve model fitting. By taking advantage of the discreteness of DP, our method is effective in capturing the heterogeneity underlying population. Within the Bayesian paradigm, posterior inferences including parameters estimates and model assessment are carried out through Markov Chains Monte Carlo (MCMC) sampling method. To facilitate posterior sampling, we adapt the Polya-Gamma stochastic representation for the logistic model. Using simulation studies, we examine properties and merits of our proposed methods and illustrate our approach by evaluating the effect of treatment on cocaine use and examining whether the treatment effect is moderated by psychiatric problems.
引用
收藏
页码:376 / 399
页数:24
相关论文
共 50 条
  • [1] Variational Bayesian analysis for two-part latent variable model
    Yemao Xia
    Jinye Chen
    Depeng Jiang
    Computational Statistics, 2024, 39 : 2259 - 2290
  • [2] Variational Bayesian analysis for two-part latent variable model
    Xia, Yemao
    Chen, Jinye
    Jiang, Depeng
    COMPUTATIONAL STATISTICS, 2024, 39 (04) : 2259 - 2290
  • [3] Bayesian Analysis of Two-Part Latent Variable Model with Mixed Data
    Xiong, Shuang-Can
    Xia, Ye-Mao
    Lu, Bin
    COMMUNICATIONS IN MATHEMATICS AND STATISTICS, 2023,
  • [4] Bayesian analysis for two-part latent variable model with application to fractional data
    Chen, Jinye
    Zheng, Linyi
    Xia, Yemao
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2024, 53 (21) : 7760 - 7788
  • [5] Bayesian Feature Extraction for Two-Part Latent Variable Model with Polytomous Manifestations
    Zhang, Qi
    Zhang, Yihui
    Xia, Yemao
    MATHEMATICS, 2024, 12 (05)
  • [6] Inference on Two-Part Latent Variable Analysis Model With Multivariate Longitudinal Data
    Xia, Ye-Mao
    Lu, Bin
    Tang, Nian-Sheng
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2019, 26 (05) : 685 - 709
  • [7] Financial literacy and household finances: A Bayesian two-part latent variable modeling approach
    Feng, Xiangnan
    Lu, Bin
    Song, Xinyuan
    Ma, Shuang
    JOURNAL OF EMPIRICAL FINANCE, 2019, 51 : 119 - 137
  • [8] A Bayesian Semiparametric Latent Variable Model for Mixed Responses
    Ludwig Fahrmeir
    Alexander Raach
    Psychometrika, 2007, 72 : 327 - 346
  • [9] A bayesian semiparametric latent variable model for mixed responses
    Fahrmeir, Ludwig
    Raach, Alexander
    PSYCHOMETRIKA, 2007, 72 (03) : 327 - 346
  • [10] Bayesian semiparametric latent variable model with DP prior for joint analysis: Implementation with nimble
    Ma, Zhihua
    Chen, Guanghui
    STATISTICAL MODELLING, 2020, 20 (01) : 71 - 95