Deep latent space fusion for adaptive representation of heterogeneous multi-omics data

被引:28
|
作者
Zhang, Chengming
Chen, Yabin
Zeng, Tao
Zhang, Chuanchao
Chen, Luonan
机构
[1] School of Mathematics and Statistics, Shandong University
[2] School of Life and Pharmaceutical Sciences, Dalian University of Technology
[3] Wuhan University, Wuhan
[4] The Huazhong University of Science and Technology, Wuhan
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
deep learning; latent space fusion; adaptive representation; omics data; complex disease; DATA INTEGRATION; VARIABLE MODEL; CANCER; NETWORK; BIOMARKERS; CLASSIFICATION; IDENTIFICATION; DISEASES; BREAST;
D O I
10.1093/bib/bbab600
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The integration of multi-omics data makes it possible to understand complex biological organisms at the system level. Numerous integration approaches have been developed by assuming a common underlying data space. Due to the noise and heterogeneity of biological data, the performance of these approaches is greatly affected. In this work, we propose a novel deep neural network architecture, named Deep Latent Space Fusion (DLSF), which integrates the multi-omics data by learning consistent manifold in the sample latent space for disease subtypes identification. DLSF is built upon a cycle autoencoder with a shared self-expressive layer, which can naturally and adaptively merge nonlinear features at each omics level into one unified sample manifold and produce adaptive representation of heterogeneous samples at the multi-omics level. We have assessed DLSF on various biological and biomedical datasets to validate its effectiveness. DLSF can efficiently and accurately capture the intrinsic manifold of the sample structures or sample clusters compared with other state-of-the-art methods, and DLSF yielded more significant outcomes for biological significance, survival prognosis and clinical relevance in application of cancer study in The Cancer Genome Atlas. Notably, as a deep case study, we determined a new molecular subtype of kidney renal clear cell carcinoma that may benefit immunotherapy in the viewpoint of multi-omics, and we further found potential subtype-specific biomarkers from multiple omics data, which were validated by independent datasets. In addition, we applied DLSF to identify potential therapeutic agents of different molecular subtypes of chronic lymphocytic leukemia, demonstrating the scalability of DLSF in diverse omics data types and application scenarios.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Deep structure integrative representation of multi-omics data for cancer subtyping
    Yang, Bo
    Yang, Yan
    Su, Xueping
    BIOINFORMATICS, 2022,
  • [2] Deep structure integrative representation of multi-omics data for cancer subtyping
    Yang, Bo
    Yang, Yan
    Su, Xueping
    BIOINFORMATICS, 2022, 38 (13) : 3337 - 3342
  • [3] Representation Learning for the Clustering of Multi-Omics Data
    Viaud, Gautier
    Mayilvahanan, Prasanna
    Cournede, Paul-Henry
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2022, 19 (01) : 135 - 145
  • [4] AFEI: adaptive optimized vertical federated learning for heterogeneous multi-omics data integration
    Wang, Qingyong
    He, Minfan
    Guo, Longyi
    Chai, Hua
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (05)
  • [5] Capturing the latent space of an Autoencoder for multi-omics integration and cancer subtyping
    Madhumita
    Paul, Sushmita
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 148
  • [6] A benchmark study of deep learning-based multi-omics data fusion methods for cancer
    Dongjin Leng
    Linyi Zheng
    Yuqi Wen
    Yunhao Zhang
    Lianlian Wu
    Jing Wang
    Meihong Wang
    Zhongnan Zhang
    Song He
    Xiaochen Bo
    Genome Biology, 23
  • [7] A benchmark study of deep learning-based multi-omics data fusion methods for cancer
    Leng, Dongjin
    Zheng, Linyi
    Wen, Yuqi
    Zhang, Yunhao
    Wu, Lianlian
    Wang, Jing
    Wang, Meihong
    Zhang, Zhongnan
    He, Song
    Bo, Xiaochen
    GENOME BIOLOGY, 2022, 23 (01)
  • [8] An extension of latent unknown clustering integrating multi-omics data (LUCID) incorporating incomplete omics data
    Zhao, Yinqi
    Jia, Qiran
    Goodrich, Jesse
    Darst, Burcu
    Conti, David, V
    BIOINFORMATICS ADVANCES, 2024, 4 (01):
  • [9] Multi -view spectral clustering with latent representation learning for applications on multi-omics cancer subtyping
    Ge, Shuguang
    Liu, Jian
    Cheng, Yuhu
    Meng, Xiaojing
    Wang, Xuesong
    BRIEFINGS IN BIOINFORMATICS, 2023, 24 (01)
  • [10] Survey on Multi-omics, and Multi-omics Data Analysis, Integration and Application
    Shahrajabian, Mohamad Hesam
    Sun, Wenli
    CURRENT PHARMACEUTICAL ANALYSIS, 2023, 19 (04) : 267 - 281