Machine Learning Approach to Metabolomic Data Predicts Type 2 Diabetes Mellitus Incidence

被引:0
|
作者
Leiherer, Andreas [1 ,2 ,3 ]
Muendlein, Axel [1 ]
Mink, Sylvia [2 ,3 ]
Mader, Arthur [1 ,4 ]
Saely, Christoph H. [1 ,3 ,4 ]
Festa, Andreas [1 ]
Fraunberger, Peter [2 ,3 ]
Drexel, Heinz [1 ,3 ,5 ,6 ]
机构
[1] Vorarlberg Inst Vasc Invest & Treatment VIVIT, A-6800 Feldkirch, Austria
[2] Cent Med Labs, A-6800 Feldkirch, Austria
[3] Private Univ Principal Liechtenstein, Fac Med Sci, FL-9495 Triesen, Liechtenstein
[4] Acad Teaching Hosp Feldkirch, Dept Internal Med 3, A-6800 Feldkirch, Austria
[5] Acad Teaching Hosp Feldkirch, Vorarlberger Landeskrankenhausbetriebsgesell, A-6800 Feldkirch, Austria
[6] Drexel Univ, Coll Med, Philadelphia, PA 19129 USA
关键词
ML; machine learning; artificial intelligence; diabetes; incidence; metabolomics; support vector machine; accuracy; CERAMIDES; MODEL;
D O I
10.3390/ijms25105331
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Metabolomics, with its wealth of data, offers a valuable avenue for enhancing predictions and decision-making in diabetes. This observational study aimed to leverage machine learning (ML) algorithms to predict the 4-year risk of developing type 2 diabetes mellitus (T2DM) using targeted quantitative metabolomics data. A cohort of 279 cardiovascular risk patients who underwent coronary angiography and who were initially free of T2DM according to American Diabetes Association (ADA) criteria was analyzed at baseline, including anthropometric data and targeted metabolomics, using liquid chromatography (LC)-mass spectroscopy (MS) and flow injection analysis (FIA)-MS, respectively. All patients were followed for four years. During this time, 11.5% of the patients developed T2DM. After data preprocessing, 362 variables were used for ML, employing the Caret package in R. The dataset was divided into training and test sets (75:25 ratio) and we used an oversampling approach to address the classifier imbalance of T2DM incidence. After an additional recursive feature elimination step, identifying a set of 77 variables that were the most valuable for model generation, a Support Vector Machine (SVM) model with a linear kernel demonstrated the most promising predictive capabilities, exhibiting an F1 score of 50%, a specificity of 93%, and balanced and unbalanced accuracies of 72% and 88%, respectively. The top-ranked features were bile acids, ceramides, amino acids, and hexoses, whereas anthropometric features such as age, sex, waist circumference, or body mass index had no contribution. In conclusion, ML analysis of metabolomics data is a promising tool for identifying individuals at risk of developing T2DM and opens avenues for personalized and early intervention strategies.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] A Machine-Learning Approach on Metabolomic Data to Predict Type 2 Diabetes Mellitus Incidence
    Leiherer, Andreas
    Muendlein, Axel
    Saely, Christoph H.
    Plattner, Thomas
    Larcher, Barbara
    Mader, Arthur
    Vonbank, Alexander
    Laaksonen, Reijo
    Fraunberger, Peter
    Drexel, Heinz
    DIABETES, 2024, 73
  • [2] Body fat predicts exercise capacity in persons with Type 2 Diabetes Mellitus: A machine learning approach
    Nath, Tanmay
    Ahima, Rexford S.
    Santhanam, Prasanna
    PLOS ONE, 2021, 16 (03):
  • [3] Metabolomic Selection in the Progression of Type 2 Diabetes Mellitus: A Genetic Algorithm Approach
    Morgan-Benita, Jorge
    Sanchez-Reyna, Ana G.
    Espino-Salinas, Carlos H.
    Jose Oropeza-Valdez, Juan
    Luna-Garcia, Huizilopoztli
    Galvan-Tejada, Carlos E.
    Galvan-Tejada, Jorge, I
    Gamboa-Rosales, Hamurabi
    Antonio Enciso-Moreno, Jose
    Celaya-Padilla, Jose
    DIAGNOSTICS, 2022, 12 (11)
  • [4] Integrating metabolomic data with machine learning approach for discovery of Q-markers from Jinqi Jiangtang preparation against type 2 diabetes
    Lele Yang
    Yan Xue
    Jinchao Wei
    Qi Dai
    Peng Li
    Chinese Medicine, 16
  • [5] Prediction of Diabetes Mellitus Type-2 Using Machine Learning
    Apoorva, S.
    Aditya, K. S.
    Snigdha, P.
    Darshini, P.
    Sanjay, H. A.
    COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 364 - 370
  • [6] Integrating metabolomic data with machine learning approach for discovery of Q-markers from Jinqi Jiangtang preparation against type 2 diabetes
    Yang, Lele
    Xue, Yan
    Wei, Jinchao
    Dai, Qi
    Li, Peng
    CHINESE MEDICINE, 2021, 16 (01)
  • [7] Understanding Type 2 Diabetes Mellitus Risk Parameters through Intermittent Fasting: A Machine Learning Approach
    Shazman, Shula
    NUTRIENTS, 2023, 15 (18)
  • [8] The incidence of type 2 diabetes mellitus in Taiwan
    Tseng, CH
    Chong, CK
    Heng, LT
    Tseng, CP
    Tai, TY
    DIABETES RESEARCH AND CLINICAL PRACTICE, 2000, 50 : S61 - S64
  • [9] Prediction and Diagnosis of Diabetes Mellitus -A Machine Learning Approach
    Vijayan, Veena V.
    Anjali, C.
    PROCEEDINGS OF THE 2015 IEEE RECENT ADVANCES IN INTELLIGENT COMPUTATIONAL SYSTEMS (RAICS), 2015, : 122 - 127
  • [10] Prediction of complications of type 2 Diabetes: A Machine learning approach
    Nicolucci, Antonio
    Romeo, Luca
    Bernardini, Michele
    Vespasiani, Marco
    Rossi, Maria Chiara
    Petrelli, Massimiliano
    Ceriello, Antonio
    Di Bartolo, Paolo
    Frontoni, Emanuele
    Vespasiani, Giacomo
    DIABETES RESEARCH AND CLINICAL PRACTICE, 2022, 190