Systematic characterization and efficient prediction of cobalamin C deficiency clinical phenotypes using network analysis and deep learning on multi-omics data

被引:0
|
作者
Li, Ze-Yu [1 ]
Liu, Xiao-Ying [1 ]
Xiao, Wen [2 ]
Yang, Jiang-Tao [2 ]
Jiang, Pan-Pan [2 ]
Wu, Ben-Qing [3 ]
Liu, Xiang-Ju [4 ]
Xue, Ming [4 ]
Lv, Hui-Jing [5 ]
Zhou, Shi-Hao [6 ]
Yang, Qin [1 ]
Xu, Lu [7 ]
Yang, Yan-Ling [8 ]
机构
[1] Yangtze Univ, Sch Phys & Optoelect Engn, Jingzhou 434023, Peoples R China
[2] Shenzhen Aone Med Lab Co Ltd, Shenzhen Rare Dis Engn Res Ctr Metabol Precis Med, Shenzhen 518000, Peoples R China
[3] Shenzhen Guangming Dist Peoples Hosp, Dept Pediat, Shenzhen 518000, Peoples R China
[4] Taian Matern & Child Care Hosp, Genet Diagnost Lab, Tai An 271000, Peoples R China
[5] Xingtai Maternal & Child Hlth Hosp, Dept neonatal screening Ctr, Xingtai 054000, Peoples R China
[6] Hunan Normal Univ, Changsha Hosp Maternal & Child Hlth Care, Dept Genet Eugenics, Changsha 410007, Peoples R China
[7] Tongren Univ, Sch Sports & Hlth Sci, Tongren 554300, Peoples R China
[8] Peking Univ, Dept Pediat, Hosp 1, Beijing 100034, Peoples R China
基金
中国国家自然科学基金;
关键词
Cobalamin C deficiency; Systematic characterization and efficient; prediction of clinical phenotypes; CTD-based network analysis; Clinical phenotype-specific main disease mod-; ule; Hybrid data structural representation; GCN-based multi-omics learning; METABOLIC SYNDROME; INBORN-ERRORS; PROTEOMICS;
D O I
10.1016/j.microc.2024.112018
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
As a monogenic disease, cobalamin C (cblC) deficiency lacks a clear correlation between gene pathogenic mutations and its spectrum of disease phenotypes, necessitating the understanding of molecular mechanisms how diverse clinical phenotypes emerge. This work aimed to disentangle the phenotypic complexity of cblC deficiency via network analysis and deep learning on multi-omics (proteomics and metabolomics) data. For this purpose, a novel computational framework was developed to systematically characterize and efficiently predict clinical phenotypes of cblC deficiency utilizing a Connect the Dots (CTD)-based Hybrid data Structural Representation of each patient and graph convolutional network (GCN)-based Multi-Omics Learning (CTDHSR-GCNMOL). CTD algorithm enabled the identification of relevant perturbed proteins or metabolites and the construction of clinical phenotype-specific co-perturbation network. GCN allowed efficient learning of subtle change patterns across clinical phenotypes not only depending on the hybrid feature description (Euclidean structure hybridized with non-Euclidean structure) of each patient but on the interaction exploration between patients offered by sample similarity network. Investigated by three clinical phenotypes (epilepsy, developmental delay and metabolic syndrome), the results showed that CTDHSR-GCNMOL identified the subsets of perturbed proteins or metabolites highly specific to each clinical phenotype and established each main disease module (network) for systematic characterization. For proteomics, epilepsy was characterized by the dysregulation of reported TAGLN2, SH3BGRL3 and LTA4H, and developmental delay was characterized by the dysregulation of reported HSP90AB1, PRDX1, GDI2, VIM, PNP and BLVRA with high confidence (selection frequencies). For untargeted metabolomics in negative ion mode, the disease status of metabolic syndrome could be well interpreted by the disturbed pathways of the top-ranked 20 perturbed metabolites all of which have been reported to be closely related with its pathogenesis in previous studies. These disturbed pathways involved butanoate metabolism, purine metabolism, alanine, aspartate and glutamate metabolism, pyrimidine metabolism, fructose and mannose metabolism, galactose metabolism, amino sugar and nucleotide sugar metabolism, and steroid hormone biosynthesis. Based on the hybridization of the abundances of perturbed proteins and metabolites with the topological structures of patient-specific perturbation network, CTDHSR-GCNMOL yielded desired prediction performance across three clinical phenotypes and outperformed the traditional block PLSDA. All these findings verified the effectiveness of CTDHSR-GCNMOL in gaining useful insights into the phenotypic complexity of cblC deficiency and guiding its targeted treatment strategies.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Deep Learning for Integrated Analysis of Breast Cancer Subtype Specific Multi-omics Data
    Rakshit, Somnath
    Saha, Indrajit
    Chakraborty, Subha Shankar
    Plewczyski, Dariusz
    PROCEEDINGS OF TENCON 2018 - 2018 IEEE REGION 10 CONFERENCE, 2018, : 1917 - 1922
  • [22] Unsupervised classification of multi-omics data during cardiac remodeling using deep learning
    Chung, Neo Christopher
    Mirza, Bilal
    Choi, Howard
    Wang, Jie
    Wang, Ding
    Ping, Peipei
    Wang, Wei
    METHODS, 2019, 166 : 66 - 73
  • [23] The Three Pillars of Glioblastoma: A Systematic Review and Novel Analysis of Multi-Omics and Clinical Data
    De Luca, Ciro
    Virtuoso, Assunta
    Papa, Michele
    Cirillo, Giovanni
    La Rocca, Giuseppe
    Corvino, Sergio
    Barbarisi, Manlio
    Altieri, Roberto
    CELLS, 2024, 13 (21)
  • [24] Machine learning and integrative multi-omics network analysis for survival prediction in acute myeloid leukemia
    Kosvyra Α.
    Karadimitris Α.
    Papaioannou Μ.
    Chouvarda I.
    Computers in Biology and Medicine, 2024, 178
  • [25] A Contrastive-Learning-Based Deep Neural Network for Cancer Subtyping by Integrating Multi-Omics Data
    Chai, Hua
    Deng, Weizhen
    Wei, Junyu
    Guan, Ting
    He, Minfan
    Liang, Yong
    Li, Le
    INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2024, 16 (04) : 966 - 975
  • [26] Performance Comparison of Deep Learning Autoencoders for Cancer Subtype Detection Using Multi-Omics Data
    Franco, Edian F.
    Rana, Pratip
    Cruz, Aline
    Calderon, Victor V.
    Azevedo, Vasco
    Ramos, Rommel T. J.
    Ghosh, Preetam
    CANCERS, 2021, 13 (09)
  • [27] Functional Analysis of Molecular Subtypes with Deep Similarity Learning Model Based on Multi-omics Data
    Liu, Shuhui
    Zhang Yupei
    Shang, Xuequn
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 126 - 137
  • [28] Diagnostic Classification of Lung Cancer Using Deep Transfer Learning Technology and Multi-Omics Data
    Rong, Z. H. U.
    Lingyun, D. A., I
    Jinxing, L. I. U.
    Ying, G. U. O.
    CHINESE JOURNAL OF ELECTRONICS, 2021, 30 (05) : 843 - 852
  • [29] Deep learning-based ovarian cancer subtypes identification using multi-omics data
    Long-Yi Guo
    Ai-Hua Wu
    Yong-xia Wang
    Li-ping Zhang
    Hua Chai
    Xue-Fang Liang
    BioData Mining, 13
  • [30] Prediction of metabolic risk in childhood obesity using machine learning models with multi-omics data
    Torres-Martos, A.
    Anguita-Ruiz, A.
    Bustos-Aibar, M.
    Alcala, R.
    Alcala-Fdez, J.
    Aguilera, C. M.
    ANNALS OF NUTRITION AND METABOLISM, 2022, 78 (SUPPL 3) : 22 - 22