Systematic characterization and efficient prediction of cobalamin C deficiency clinical phenotypes using network analysis and deep learning on multi-omics data

被引:0
|
作者
Li, Ze-Yu [1 ]
Liu, Xiao-Ying [1 ]
Xiao, Wen [2 ]
Yang, Jiang-Tao [2 ]
Jiang, Pan-Pan [2 ]
Wu, Ben-Qing [3 ]
Liu, Xiang-Ju [4 ]
Xue, Ming [4 ]
Lv, Hui-Jing [5 ]
Zhou, Shi-Hao [6 ]
Yang, Qin [1 ]
Xu, Lu [7 ]
Yang, Yan-Ling [8 ]
机构
[1] Yangtze Univ, Sch Phys & Optoelect Engn, Jingzhou 434023, Peoples R China
[2] Shenzhen Aone Med Lab Co Ltd, Shenzhen Rare Dis Engn Res Ctr Metabol Precis Med, Shenzhen 518000, Peoples R China
[3] Shenzhen Guangming Dist Peoples Hosp, Dept Pediat, Shenzhen 518000, Peoples R China
[4] Taian Matern & Child Care Hosp, Genet Diagnost Lab, Tai An 271000, Peoples R China
[5] Xingtai Maternal & Child Hlth Hosp, Dept neonatal screening Ctr, Xingtai 054000, Peoples R China
[6] Hunan Normal Univ, Changsha Hosp Maternal & Child Hlth Care, Dept Genet Eugenics, Changsha 410007, Peoples R China
[7] Tongren Univ, Sch Sports & Hlth Sci, Tongren 554300, Peoples R China
[8] Peking Univ, Dept Pediat, Hosp 1, Beijing 100034, Peoples R China
基金
中国国家自然科学基金;
关键词
Cobalamin C deficiency; Systematic characterization and efficient; prediction of clinical phenotypes; CTD-based network analysis; Clinical phenotype-specific main disease mod-; ule; Hybrid data structural representation; GCN-based multi-omics learning; METABOLIC SYNDROME; INBORN-ERRORS; PROTEOMICS;
D O I
10.1016/j.microc.2024.112018
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
As a monogenic disease, cobalamin C (cblC) deficiency lacks a clear correlation between gene pathogenic mutations and its spectrum of disease phenotypes, necessitating the understanding of molecular mechanisms how diverse clinical phenotypes emerge. This work aimed to disentangle the phenotypic complexity of cblC deficiency via network analysis and deep learning on multi-omics (proteomics and metabolomics) data. For this purpose, a novel computational framework was developed to systematically characterize and efficiently predict clinical phenotypes of cblC deficiency utilizing a Connect the Dots (CTD)-based Hybrid data Structural Representation of each patient and graph convolutional network (GCN)-based Multi-Omics Learning (CTDHSR-GCNMOL). CTD algorithm enabled the identification of relevant perturbed proteins or metabolites and the construction of clinical phenotype-specific co-perturbation network. GCN allowed efficient learning of subtle change patterns across clinical phenotypes not only depending on the hybrid feature description (Euclidean structure hybridized with non-Euclidean structure) of each patient but on the interaction exploration between patients offered by sample similarity network. Investigated by three clinical phenotypes (epilepsy, developmental delay and metabolic syndrome), the results showed that CTDHSR-GCNMOL identified the subsets of perturbed proteins or metabolites highly specific to each clinical phenotype and established each main disease module (network) for systematic characterization. For proteomics, epilepsy was characterized by the dysregulation of reported TAGLN2, SH3BGRL3 and LTA4H, and developmental delay was characterized by the dysregulation of reported HSP90AB1, PRDX1, GDI2, VIM, PNP and BLVRA with high confidence (selection frequencies). For untargeted metabolomics in negative ion mode, the disease status of metabolic syndrome could be well interpreted by the disturbed pathways of the top-ranked 20 perturbed metabolites all of which have been reported to be closely related with its pathogenesis in previous studies. These disturbed pathways involved butanoate metabolism, purine metabolism, alanine, aspartate and glutamate metabolism, pyrimidine metabolism, fructose and mannose metabolism, galactose metabolism, amino sugar and nucleotide sugar metabolism, and steroid hormone biosynthesis. Based on the hybridization of the abundances of perturbed proteins and metabolites with the topological structures of patient-specific perturbation network, CTDHSR-GCNMOL yielded desired prediction performance across three clinical phenotypes and outperformed the traditional block PLSDA. All these findings verified the effectiveness of CTDHSR-GCNMOL in gaining useful insights into the phenotypic complexity of cblC deficiency and guiding its targeted treatment strategies.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Prediction of drug sensitivity based on multi-omics data using deep learning and similarity network fusion approaches
    Liu, Xiao-Ying
    Mei, Xin-Yue
    FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2023, 11
  • [2] A roadmap for multi-omics data integration using deep learning
    Kang, Mingon
    Ko, Euiseong
    Mersha, Tesfaye B.
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [3] Enhancing Lung Cancer Classification and Prediction With Deep Learning and Multi-Omics Data
    Mohamed, Tehnan I. A.
    Ezugwu, Absalom El-Shamir
    IEEE ACCESS, 2024, 12 : 59880 - 59892
  • [4] Deep Learning for Integrated Analysis of Insulin Resistance with Multi-Omics Data
    Huang, Eunchong
    Kim, Sarah
    Ahn, TaeJin
    JOURNAL OF PERSONALIZED MEDICINE, 2021, 11 (02): : 1 - 14
  • [5] Network analysis with multi-omics data using graphical LASSO
    Park, Jaehyun
    Won, Sungho
    GENETIC EPIDEMIOLOGY, 2020, 44 (05) : 509 - 509
  • [6] Prediction of Composite Clinical Outcomes for Childhood Neuroblastoma Using Multi-Omics Data and Machine Learning
    Wang, Panru
    Zhang, Junying
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2025, 26 (01)
  • [7] Integrating multi-omics data through deep learning for accurate cancer prognosis prediction
    Chai, Hua
    Zhou, Xiang
    Zhang, Zhongyue
    Rao, Jiahua
    Zhao, Huiying
    Yang, Yuedong
    COMPUTERS IN BIOLOGY AND MEDICINE, 2021, 134
  • [8] Deep learning-based approaches for multi-omics data integration and analysis
    Ballard, Jenna L.
    Wang, Zexuan
    Li, Wenrui
    Shen, Li
    Long, Qi
    BIODATA MINING, 2024, 17 (01):
  • [9] DeepProg: an ensemble of deep-learning and machine-learning models for prognosis prediction using multi-omics data
    Poirion, Olivier B.
    Jing, Zheng
    Chaudhary, Kumardeep
    Huang, Sijia
    Garmire, Lana X.
    GENOME MEDICINE, 2021, 13 (01)
  • [10] DeepProg: an ensemble of deep-learning and machine-learning models for prognosis prediction using multi-omics data
    Olivier B. Poirion
    Zheng Jing
    Kumardeep Chaudhary
    Sijia Huang
    Lana X. Garmire
    Genome Medicine, 13