Joint latent space models for network data with high-dimensional node variables

被引:8
|
作者
Zhang, Xuefei [1 ]
Xu, Gongjun [1 ]
Zhu, Ji [1 ]
机构
[1] Univ Michigan, Dept Stat, 1085 South Univ Ave, Ann Arbor, MI 48109 USA
基金
美国国家科学基金会;
关键词
High-dimensional data; Latent space model; Network analysis; COMMUNITY DETECTION;
D O I
10.1093/biomet/asab063
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Network latent space models assume that each node is associated with an unobserved latent position in a Euclidean space, and such latent variables determine the probability of two nodes connecting with each other. In many applications, nodes in the network are often observed along with high-dimensional node variables, and these node variables provide important information for understanding the network structure. However, classical network latent space models have several limitations in incorporating node variables. In this paper, we propose a joint latent space model where we assume that the latent variables not only explain the network structure, but are also informative for the multivariate node variables. We develop a projected gradient descent algorithm that estimates the latent positions using a criterion incorporating both network structure and node variables. We establish theoretical properties of the estimators and provide insights into how incorporating high-dimensional node variables could improve the estimation accuracy of the latent positions. We demonstrate the improvement in latent variable estimation and the improvements in associated downstream tasks, such as missing value imputation for node variables, by simulation studies and an application to a Facebook data example.
引用
收藏
页码:707 / 720
页数:14
相关论文
共 50 条
  • [1] High-dimensional factor copula models with estimation of latent variables
    Fan, Xinyao
    Joe, Harry
    JOURNAL OF MULTIVARIATE ANALYSIS, 2024, 201
  • [2] Latent class models for joint analysis of disease prevalence and high-dimensional semicontinuous biomarker data
    Zhang, Bo
    Chen, Zhen
    Albert, Paul S.
    BIOSTATISTICS, 2012, 13 (01) : 74 - 88
  • [3] Joint latent space models for ranking data and social network
    Jiaqi Gu
    Philip L. H. Yu
    Statistics and Computing, 2022, 32
  • [4] Joint latent space models for ranking data and social network
    Gu, Jiaqi
    Yu, Philip L. H.
    STATISTICS AND COMPUTING, 2022, 32 (03)
  • [5] Supervised Bayesian latent class models for high-dimensional data
    Desantis, Stacia M.
    Houseman, E. Andres
    Coull, Brent A.
    Nutt, Catherine L.
    Betensky, Rebecca A.
    STATISTICS IN MEDICINE, 2012, 31 (13) : 1342 - 1360
  • [6] Joint hierarchical models for sparsely sampled high-dimensional LiDAR and forest variables
    Finley, Andrew O.
    Banerjee, Sudipto
    Zhou, Yuzhen
    Cook, Bruce D.
    Babcock, Chad
    REMOTE SENSING OF ENVIRONMENT, 2017, 190 : 149 - 161
  • [7] Regression analysis on high-dimensional, block diagonal structure data with focus on latent variables
    Seki, Shinei
    Nagata, Yasushi
    MATHEMATICAL METHODS AND COMPUTATIONAL TECHNIQUES IN SCIENCE AND ENGINEERING II, 2018, 1982
  • [8] On node models for high-dimensional road networks
    Wright, Matthew A.
    Gomes, Gabriel
    Horowitz, Roberto
    Kurzhanskiy, Alex A.
    TRANSPORTATION RESEARCH PART B-METHODOLOGICAL, 2017, 105 : 212 - 234
  • [9] High-dimensional conditionally Gaussian state space models with missing data
    Chan, Joshua C. C.
    Poon, Aubrey
    Zhu, Dan
    JOURNAL OF ECONOMETRICS, 2023, 236 (01)
  • [10] Network Infrastructure Visualisation Using High-Dimensional Node-Attribute Data
    Gibson, Helen
    Vickers, Paul
    2012 IEEE CONFERENCE ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY (VAST), 2012, : 293 - 294