Predicting lifespan-extending chemical compounds for C. elegans with machine learning and biologically interpretable features

被引:0
|
作者
Ribeiro, Caio [1 ]
Farmer, Christopher K. [2 ]
de Magalhaes, Joao Pedro [3 ]
Freitas, Alex A. [1 ]
机构
[1] Univ Kent, Sch Comp, Canterbury, England
[2] Univ Kent, Ctr Hlth Serv Studies, Canterbury, England
[3] Univ Birmingham, Inst Inflammat & Ageing, Genom Ageing & Rejuvenat Lab, Birmingham, England
来源
AGING-US | 2023年 / 15卷 / 13期
基金
英国生物技术与生命科学研究理事会;
关键词
lifespan-extension compounds; longevity drugs; machine learning; feature selection; GENE ONTOLOGY; METABOLISM; DATABASE;
D O I
暂无
中图分类号
Q2 [细胞生物学];
学科分类号
071009 ; 090102 ;
摘要
Recently, there has been a growing interest in the development of pharmacological interventions targeting ageing, as well as in the use of machine learning for analysing ageing-related data. In this work, we use machine learning methods to analyse data from DrugAge, a database of chemical compounds (including drugs) modulating lifespan in model organisms. To this end, we created four types of datasets for predicting whether or not a compound extends the lifespan of C. elegans (the most frequent model organism in DrugAge), using four different types of predictive biological features, based on: compound-protein interactions, interactions between compounds and proteins encoded by ageing-related genes, and two types of terms annotated for proteins targeted by the compounds, namely Gene Ontology (GO) terms and physiology terms from the WormBase's Phenotype Ontology. To analyse these datasets, we used a combination of feature selection methods in a data pre-processing phase and the well-established random forest algorithm for learning predictive models from the selected features. In addition, we interpreted the most important features in the two best models in light of the biology of ageing. One noteworthy feature was the GO term "Glutathione metabolic process", which plays an important role in cellular redox homeostasis and detoxification. We also predicted the most promising novel compounds for extending lifespan from a list of previously unlabelled compounds. These include nitroprusside, which is used as an antihypertensive medication. Overall, our work opens avenues for future work in employing machine learning to predict novel life-extending compounds.
引用
收藏
页码:6073 / 6099
页数:27
相关论文
共 44 条
  • [41] Using machine learning to speed up manual image annotation: application to a 3D imaging protocol for measuring single cell gene expression in the developing C. elegans embryo
    Zafer Aydin
    John I Murray
    Robert H Waterston
    William S Noble
    BMC Bioinformatics, 11
  • [42] Using machine learning to speed up manual image annotation: application to a 3D imaging protocol for measuring single cell gene expression in the developing C. elegans embryo
    Aydin, Zafer
    Murray, John I.
    Waterston, Robert H.
    Noble, William S.
    BMC BIOINFORMATICS, 2010, 11
  • [43] Chemical feature-based machine learning model for predicting photophysical properties of BODIPY compounds: density functional theory and quantitative structure-property relationship modeling
    Casanola-Martin, Gerardo M.
    Wang, Jing
    Zhou, Jian-ge
    Rasulev, Bakhtiyor
    Leszczynski, Jerzy
    JOURNAL OF MOLECULAR MODELING, 2025, 31 (01)
  • [44] Predicting Post-Operative C. difficile Infection (CDI) With Automated Machine Learning (AutoML) Algorithms Using the American College of Surgeons National Surgical Quality Improvement Program (ACS NSQIP) Database
    Thangirala, Avinash
    Li, Terry
    Abaza, Ehab
    Aphinyanaphongs, Yindalon
    Axelrad, Jordan
    Chen, Ji
    Kelleher, Andrew
    Oeding, Jacob
    Hu, Eric
    Martin, Jacob
    Katz, Gregory
    Brejt, Shelly
    Castillo, Gabriel
    Ostberg, Nicolai
    Kan, Karen
    AMERICAN JOURNAL OF GASTROENTEROLOGY, 2022, 117 (10): : S143 - S144