Graph-Guided Bayesian Factor Model for Integrative Analysis of Multi-modal Data with Noisy Network Information

被引:0
|
作者
Li, Wenrui [1 ]
Zhang, Qiyiwen [2 ]
Qu, Kewen [3 ]
Long, Qi [3 ]
机构
[1] Univ Connecticut, Dept Stat, 215 Glenbrook Rd, Storrs, CT 06269 USA
[2] Univ Pittsburgh, Dept Med, 3550 Terrace St, Pittsburgh, PA 15261 USA
[3] Univ Penn, Dept Biostat Epidemiol & Informat, 423 Guardian Dr, Philadelphia, PA 19104 USA
关键词
Bayesian shrinkage; Factor analysis; Latent scale network model; MCMC algorithm; Noisy graph; INVERSE COVARIANCE ESTIMATION; VARIABLE SELECTION; GENES; JOINT;
D O I
10.1007/s12561-024-09452-7
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
There is a growing body of literature on factor analysis that can capture individual and shared structures in multi-modal data. However, few of these approaches incorporate biological knowledge such as functional genomics and functional metabolomics. Graph-guided statistical learning methods that can incorporate knowledge of underlying networks have been shown to improve predication and classification accuracy, and yield more interpretable results. Moreover, these methods typically use graphs extracted from existing databases or rely on subject matter expertise which are known to be incomplete and may contain false edges. To address this gap, we propose a graph-guided Bayesian factor model that can account for network noise and identify globally shared, partially shared and modality-specific latent factors in multi-modal data. Specifically, we use two sources of network information, including the noisy graph extracted from existing databases and the estimated graph from observed features in the dataset at hand, to inform the model for the true underlying network via a latent scale modeling framework. This model is coupled with the Bayesian factor analysis model with shrinkage priors to encourage feature-wise and modal-wise sparsity, thereby allowing feature selection and identification of factors of each type. We develop an efficient Markov chain Monte Carlo algorithm for posterior sampling. We demonstrate the advantages of our method over existing methods in simulations, and through analyses of gene expression and metabolomics datasets for Alzheimer's disease.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] A double-branch convolutional neural network model for species identification based on multi-modal data
    Sun, Yuxin
    Tian, Ye
    Zhang, Yiyi
    Yu, Mengting
    Su, Xiaoquan
    Wang, Qi
    Guo, Jinjia
    Lu, Yuan
    Ren, Lihui
    SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY, 2024, 318
  • [42] Multi-modal information analysis for fault diagnosis with time-series data from power transformer
    Xing, Zhikai
    He, Yigang
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2023, 144
  • [43] Examining Sustainable Overseas Investment Information-Sharing Model for Automobile Enterprises: A Multi-Modal Weight Network Approach
    Cheng, Yuan
    Chen, Xiaofang
    Lin, Changbo
    Ma, Sheqing
    Feng, Jie
    JOURNAL OF THE KNOWLEDGE ECONOMY, 2024, 15 (4) : 17705 - 17725
  • [44] M3GAT: A Multi-modal, Multi-task Interactive Graph Attention Network for Conversational Sentiment Analysis and Emotion Recognition
    Zhang, Yazhou
    Jia, Ao
    Wang, Bo
    Zhang, Peng
    Zhao, Dongming
    Li, Pu
    Hou, Yuexian
    Jin, Xiaojia
    Song, Dawei
    Qin, Jing
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2024, 42 (01)
  • [45] Outcome Prediction Using Multi-Modal Information: Integrating Large Language Model-Extracted Clinical Information and Image Analysis
    Sun, Di
    Hadjiiski, Lubomir
    Gormley, John
    Chan, Heang-Ping
    Caoili, Elaine
    Cohan, Richard
    Alva, Ajjai
    Bruno, Grace
    Mihalcea, Rada
    Zhou, Chuan
    Gulani, Vikas
    CANCERS, 2024, 16 (13)
  • [46] Alzheimer's disease diagnosis from multi-modal data via feature inductive learning and dual multilevel graph neural network
    Lei, Baiying
    Li, Yafeng
    Fu, Wanyi
    Yang, Peng
    Chen, Shaobin
    Wang, Tianfu
    Xiao, Xiaohua
    Niu, Tianye
    Fu, Yu
    Wang, Shuqiang
    Han, Hongbin
    Qin, Jing
    MEDICAL IMAGE ANALYSIS, 2024, 97
  • [47] Information quality mapping in resource-constrained multi-modal data fusion system over wireless sensor network with losses
    Tolstikov, Andrei
    Tham, Chen-Khong
    Xiao, Wendong
    Biswas, Jit
    2007 6TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS & SIGNAL PROCESSING, VOLS 1-4, 2007, : 1432 - +
  • [48] A fully Bayesian latent variable model for integrative clustering analysis of multi-type omics data
    Mo, Qianxing
    Shen, Ronglai
    Guo, Cui
    Vannucci, Marina
    Chan, Keith S.
    Hilsenbeck, Susan G.
    BIOSTATISTICS, 2018, 19 (01) : 71 - 86
  • [49] A Multi-Modal Convolutional Neural Network Model for Intelligent Analysis of the Influence of Music Genres on Children's Emotions
    Qian, Qingfang
    Chen, Xiaofeng
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [50] Calibrating a transit assignment model using smart card data in a large-scale multi-modal transit network
    Tavassoli, Ahmad
    Mesbah, Mahmoud
    Hickman, Mark
    TRANSPORTATION, 2020, 47 (05) : 2133 - 2156