Exploring the correlation between DNA methylation and biological age using an interpretable machine learning framework

被引:0
|
作者
Zhou, Sheng [1 ]
Chen, Jing [2 ]
Wei, Shanshan [1 ]
Zhou, Chengxing [3 ]
Wang, Die [4 ]
Yan, Xiaofan [5 ]
He, Xun [5 ]
Yan, Pengcheng [6 ]
机构
[1] Guizhou Med Univ, Dept Publ Hlth & Hlth, Guiyang, Guizhou, Peoples R China
[2] Guizhou Prov Drug Adm Inspect Ctr, Guiyang, Guizhou, Peoples R China
[3] Guizhou Med Univ, Sch Biology&Engineering, Sch Hlth Med Modern Ind, Guiyang, Guizhou, Peoples R China
[4] Guizhou Med Univ, Coll Anesthesia, Guiyang, Guizhou, Peoples R China
[5] Guizhou Med Univ, Sch Med & Hlth Management, Guiyang, Guizhou, Peoples R China
[6] Guizhou Med Univ, Sch Clin Med, Guiyang, Guizhou, Peoples R China
来源
SCIENTIFIC REPORTS | 2024年 / 14卷 / 01期
关键词
DNA methylation; Biological age; GO enrichment analysis; XGBoost; Interpretable machine learning; Shapley Additive exPlanations;
D O I
10.1038/s41598-024-75586-9
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
DNA methylation plays a significant role in regulating transcription and exhibits a systematic change with age. These changes can be used to predict an individual's age. First, to identify methylation sites associated with biological age; second, to construct a biological age prediction model and preliminarily explore the biological significance of methylation-associated genes using machine learning. A biological age prediction model was constructed using human methylation data through data preprocessing, feature selection procedures, statistical analysis, and machine learning techniques. Subsequently, 15 methylation data sets were subjected to in-depth analysis using SHAP, GO enrichment, and KEGG analysis. XGBoost, LightGBM, and CatBoost identified 15 groups of methylation sites associated with biological age. The cg23995914 locus was identified as the most significant contributor to predicting biological age by calculating SHAP values. Furthermore, GO enrichment and KEGG analyses were employed to initially explore the methylated loci's biological significance.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Enabling interpretable machine learning for biological data with reliability scores
    Ahlquist, K. D.
    Sugden, Lauren
    Ramachandran, Sohini
    PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (05)
  • [22] INTERPRETABLE MACHINE LEARNING Mining for informative signals in biological sequences
    Alaa, Ahmed M.
    NATURE MACHINE INTELLIGENCE, 2022, 4 (08) : 665 - 666
  • [23] A framework for vehicle quality evaluation based on interpretable machine learning
    Alwadi M.
    Chetty G.
    Yamin M.
    International Journal of Information Technology, 2023, 15 (1) : 129 - 136
  • [24] PathMethy: an interpretable AI framework for cancer origin tracing based on DNA methylation
    Xie, Jiajing
    Song, Yuhang
    Zheng, Hailong
    Luo, Shijie
    Chen, Ying
    Zhang, Chen
    Yu, Rongshan
    Tong, Mengsha
    BRIEFINGS IN BIOINFORMATICS, 2024, 25 (06)
  • [25] Exploring Evolutionary Fitness in Biological Systems Using Machine Learning Methods
    Kuzenkov, Oleg
    Morozov, Andrew
    Kuzenkova, Galina
    ENTROPY, 2021, 23 (01) : 1 - 17
  • [26] Exploring pollutant joint effects in disease through interpretable machine learning
    Wang, Shuo
    Zhang, Tianzhuo
    Li, Ziheng
    Hong, Jinglan
    JOURNAL OF HAZARDOUS MATERIALS, 2024, 467
  • [27] Exploring Impact of Age and Gender on Sentiment Analysis Using Machine Learning
    Kumar, Sudhanshu
    Gahalawat, Monika
    Roy, Partha Pratim
    Dogra, Debi Prosad
    Kim, Byung-Gyu
    ELECTRONICS, 2020, 9 (02)
  • [28] DNA methylation-based age prediction using massively parallel sequencing data and multiple machine learning models
    Aliferi, Anastasia
    Ballard, David
    Gallidabino, Matteo D.
    Thurtle, Helen
    Barron, Leon
    Court, Denise Syndercombe
    FORENSIC SCIENCE INTERNATIONAL-GENETICS, 2018, 37 : 215 - 226
  • [29] Negative correlation learning in the extreme learning machine framework
    Perales-Gonzalez, Carlos
    Carbonero-Ruz, Mariano
    Perez-Rodriguez, Javier
    Becerra-Alonso, David
    Fernandez-Navarro, Francisco
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (17): : 13805 - 13823
  • [30] Negative correlation learning in the extreme learning machine framework
    Carlos Perales-González
    Mariano Carbonero-Ruz
    Javier Pérez-Rodríguez
    David Becerra-Alonso
    Francisco Fernández-Navarro
    Neural Computing and Applications, 2020, 32 : 13805 - 13823