Decomposition of variation of mixed variables by a latent mixed Gaussian copula model

被引:1
|
作者
Liu, Yutong [1 ]
Darville, Toni [2 ]
Zheng, Xiaojing [1 ,2 ]
Li, Quefeng [1 ]
机构
[1] Univ N Carolina, Dept Biostat, Chapel Hill, NC 27515 USA
[2] Univ N Carolina, Dept Pediat, Chapel Hill, NC 27515 USA
基金
美国国家卫生研究院;
关键词
high-dimensional matrix estimation; Kendall's tau; latent Gaussian copula model; variation decomposition; GRAPHICAL MODEL; INFECTION; NUMBER; CELLS; JOINT; AXIS; SETS;
D O I
10.1111/biom.13660
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Many biomedical studies collect data of mixed types of variables from multiple groups of subjects. Some of these studies aim to find the group-specific and the common variation among all these variables. Even though similar problems have been studied by some previous works, their methods mainly rely on the Pearson correlation, which cannot handle mixed data. To address this issue, we propose a latent mixed Gaussian copula (LMGC) model that can quantify the correlations among binary, ordinal, continuous, and truncated variables in a unified framework. We also provide a tool to decompose the variation into the group-specific and the common variation over multiple groups via solving a regularized M-estimation problem. We conduct extensive simulation studies to show the advantage of our proposed method over the Pearson correlation-based methods. We also demonstrate that by jointly solving the M-estimation problem over multiple groups, our method is better than decomposing the variation group by group. We also apply our method to a Chlamydia trachamatis genital tract infection study to demonstrate how it can be used to discover informative biomarkers that differentiate patients.
引用
收藏
页码:1187 / 1200
页数:14
相关论文
共 50 条
  • [31] Inclusion of latent variables in Mixed Logit models: Modelling and forecasting
    Yanez, M. F.
    Raveau, S.
    Ortuzar, J. de D.
    TRANSPORTATION RESEARCH PART A-POLICY AND PRACTICE, 2010, 44 (09) : 744 - 753
  • [32] Factor analysis with (mixed) observed and latent variables in the exponential family
    Michel Wedel
    Wagner A. Kamakura
    Psychometrika, 2001, 66 : 515 - 530
  • [33] Joint modelling of mixed outcome types using latent variables
    McCulloch, Charles
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2008, 17 (01) : 53 - 73
  • [34] Factor analysis with (mixed) observed and latent variables in the exponential family
    Wedel, M
    Kamakura, WA
    PSYCHOMETRIKA, 2001, 66 (04) : 515 - 530
  • [35] Selection of Latent Variables for Multiple Mixed-outcome Models
    Zhou, Ling
    Lin, Huazhen
    Song, Xinyuan
    Li, Yi
    SCANDINAVIAN JOURNAL OF STATISTICS, 2014, 41 (04) : 1064 - 1082
  • [36] Robust parameter design of mixed multiple responses based on a latent variable Gaussian process model
    Zhai, Cuihong
    Wang, Jianjun
    Feng, Zebiao
    Ma, Yan
    Deng, Haisong
    ENGINEERING OPTIMIZATION, 2023, 55 (10) : 1760 - 1777
  • [37] A choice model for mixed decision variables
    Lee, Sanghak
    Kim, Hyowon
    Kim, Jaehwan
    Allenby, Greg M.
    JOURNAL OF CHOICE MODELLING, 2018, 28 : 82 - 96
  • [38] A general Isserlis theorem for mixed-Gaussian random variables
    Michalowicz, J. V.
    Nichols, J. M.
    Bucholtz, F.
    Olson, C. C.
    STATISTICS & PROBABILITY LETTERS, 2011, 81 (08) : 1233 - 1240
  • [39] Analysing demand for suburban trips:: A mixed RP/SP model with latent variables and interaction effects
    Espino, R
    Román, C
    Ortúzar, JD
    TRANSPORTATION, 2006, 33 (03) : 241 - 261
  • [40] Analysing Demand for Suburban Trips: A Mixed RP/SP Model with Latent Variables and Interaction Effects
    Raquel Espino
    Concepción Román
    Juan Dios De Ortúzar
    Transportation, 2006, 33 : 241 - 261