Bayesian analysis of two-part nonlinear latent variable model: Semiparametric method

被引:3
|
作者
Gou, Jian-Wei [1 ]
Xia, Ye-Mao [1 ]
Jiang, De-Peng [2 ]
机构
[1] Nanjing Forestry Univ, Sch Sci, Dept Appl Math, Nanjing 210037, Jiangsu, Peoples R China
[2] Univ Manitoba, Dept Community Hlth Sci, Winnipeg, MB, Canada
关键词
Markov Chains Monte Carlo; Semi-parametric Bayesian methods; semi-continuous data; truncated Dirichlet process; two-part nonlinear latent variable model; FINITE MIXTURES; DIRICHLET; COCAINE; TRAIT; DISTRIBUTIONS; TUTORIAL;
D O I
10.1177/1471082X211059233
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Two-part model (TPM) is a widely appreciated statistical method for analyzing semi-continuous data. Semi-continuous data can be viewed as arising from two distinct stochastic processes: one governs the occurrence or binary part of data and the other determines the intensity or continuous part. In the regression setting with the semi-continuous outcome as functions of covariates, the binary part is commonly modelled via logistic regression and the continuous component via a log-normal model. The conventional TPM, still imposes assumptions such as log-normal distribution of the continuous part, with no unobserved heterogeneity among the response, and no collinearity among covariates, which are quite often unrealistic in practical applications. In this article, we develop a two-part nonlinear latent variable model (TPNLVM) with mixed multiple semi-continuous and continuous variables. The semi-continuous variables are treated as indicators of the latent factor analysis along with other manifest variables. This reduces the dimensionality of the regression model and alleviates the potential multicollinearity problems. Our TPNLVM can accommodate the nonlinear relationships among latent variables extracted from the factor analysis. To downweight the influence of distribution deviations and extreme observations, we develop a Bayesian semiparametric analysis procedure. The conventional parametric assumptions on the related distributions are relaxed and the Dirichlet process (DP) prior is used to improve model fitting. By taking advantage of the discreteness of DP, our method is effective in capturing the heterogeneity underlying population. Within the Bayesian paradigm, posterior inferences including parameters estimates and model assessment are carried out through Markov Chains Monte Carlo (MCMC) sampling method. To facilitate posterior sampling, we adapt the Polya-Gamma stochastic representation for the logistic model. Using simulation studies, we examine properties and merits of our proposed methods and illustrate our approach by evaluating the effect of treatment on cocaine use and examining whether the treatment effect is moderated by psychiatric problems.
引用
收藏
页码:376 / 399
页数:24
相关论文
共 50 条
  • [41] Analysis of longitudinal semicontinuous data using marginalized two-part model
    Jaffa, Miran A.
    Gebregziabher, Mulugeta
    Garrett, Sara M.
    Luttrell, Deirdre K.
    Lipson, Kenneth E.
    Luttrell, Louis M.
    Jaffa, Ayad A.
    JOURNAL OF TRANSLATIONAL MEDICINE, 2018, 16
  • [42] Analysis of longitudinal semicontinuous data using marginalized two-part model
    Miran A. Jaffa
    Mulugeta Gebregziabher
    Sara M. Garrett
    Deirdre K. Luttrell
    Kenneth E. Lipson
    Louis M. Luttrell
    Ayad A. Jaffa
    Journal of Translational Medicine, 16
  • [43] A Two-Part Tariff Model for Energy Intermediation
    Srinivasan, Sunderasan
    ENGINEERING ECONOMIST, 2013, 58 (04): : 265 - 281
  • [44] A note on optimal designs for a two-part model
    Han, C
    STATISTICS & PROBABILITY LETTERS, 2003, 65 (04) : 343 - 351
  • [45] DEPRESSION AND CPAP ADHERENCE: A TWO-PART MODEL
    Wohlgemuth, William
    Tutek, Joshua
    Wallace, Douglas
    Fins, Ana
    Martinez-Garcia, Ana-Maria
    Satyanarayana, Satya
    Gonzalez, Alexandria
    PSYCHOSOMATIC MEDICINE, 2020, 82 (06): : A73 - A73
  • [46] A marginalized two-part model for semicontinuous data
    Smith, Valerie A.
    Preisser, John S.
    Neelon, Brian
    Maciejewski, Matthew L.
    STATISTICS IN MEDICINE, 2014, 33 (28) : 4891 - 4903
  • [47] Bayesian Semiparametric Analysis of Multivariate Continuous Responses, With Variable Selection
    Papageorgiou, Georgios
    Marshall, Benjamin C.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2020, 29 (04) : 896 - 909
  • [48] When Two-Part Tariffs are Not Enough: Mixing with Nonlinear Pricing
    Hoernig, Steffen
    Valletti, Tommaso M.
    B E JOURNAL OF THEORETICAL ECONOMICS, 2011, 11 (01):
  • [49] Study on calculation method of Two-part heat price
    Wang, Limei
    Wang, Zhenhai
    AUTOMATION EQUIPMENT AND SYSTEMS, PTS 1-4, 2012, 468-471 : 2573 - +
  • [50] Marijuana Use among Juvenile Arrestees: A Two-Part Growth Model Analysis
    Dembo, Richard
    Wareham, Jennifer
    Greenbaum, Paul E.
    Childs, Kristina
    Schmeidler, James
    JOURNAL OF CHILD & ADOLESCENT SUBSTANCE ABUSE, 2009, 18 (04) : 379 - 397