A cost-effective machine learning-based method for preeclampsia risk assessment and driver genes discovery

被引:15
|
作者
Wang, Hao [1 ,2 ]
Zhang, Zhaoyue [3 ]
Li, Haicheng [1 ,2 ]
Li, Jinzhao [1 ]
Li, Hanshuang [1 ]
Liu, Mingzhu [1 ,2 ]
Liang, Pengfei [1 ]
Xi, Qilemuge [1 ]
Xing, Yongqiang [4 ]
Yang, Lei [5 ]
Zuo, Yongchun [1 ,2 ]
机构
[1] Inner Mongolia Univ, Coll Life Sci, State Key Lab Reprod Regulat & Breeding Grassland, Hohhot 010070, Peoples R China
[2] Inner Mongolia Wesure Date Technol Co Ltd, Inner Mongolia Intelligent Union Big Data Acad, Digital Coll, Hohhot 010010, Peoples R China
[3] Univ Elect Sci & Technol China, Ctr Informat Biol, Sch Life Sci & Technol, Chengdu 610054, Peoples R China
[4] Inner Mongolia Univ Sci & Technol, Sch Life Sci & Technol, Baotou 014010, Peoples R China
[5] Harbin Med Univ, Coll Bioinformat Sci & Technol, Harbin 150081, Peoples R China
来源
CELL AND BIOSCIENCE | 2023年 / 13卷 / 01期
关键词
Preeclampsia risk; Machine learning; Feature selection; Marker genes; Web server; SINGLE-CELL; CANCER CLASSIFICATION; DIFFERENTIATION; EXPRESSION; IDENTIFICATION; PREDICTION;
D O I
10.1186/s13578-023-00991-y
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background The placenta, as a unique exchange organ between mother and fetus, is essential for successful human pregnancy and fetal health. Preeclampsia (PE) caused by placental dysfunction contributes to both maternal and infant morbidity and mortality. Accurate identification of PE patients plays a vital role in the formulation of treatment plans. However, the traditional clinical methods of PE have a high misdiagnosis rate.Results Here, we first designed a computational biology method that used single-cell transcriptome (scRNA-seq) of healthy pregnancy (38 wk) and early-onset PE (28-32 wk) to identify pathological cell subpopulations and predict PE risk. Based on machine learning methods and feature selection techniques, we observed that the Tuning ReliefF (TURF) score hybrid with XGBoost (TURF_XGB) achieved optimal performance, with 92.61% accuracy and 92.46% recall for classifying nine cell subpopulations of healthy placentas. Biological landscapes of placenta heterogeneity could be mapped by the 110 marker genes screened by TURF_XGB, which revealed the superiority of the TURF feature mining. Moreover, we processed the PE dataset with LASSO to obtain 497 biomarkers. Integration analysis of the above two gene sets revealed that dendritic cells were closely associated with early-onset PE, and C1QB and C1QC might drive preeclampsia by mediating inflammation. In addition, an ensemble model-based risk stratification card was developed to classify preeclampsia patients, and its area under the receiver operating characteristic curve (AUC) could reach 0.99. For broader accessibility, we designed an accessible online web server ().Conclusion Single-cell transcriptome-based preeclampsia risk assessment using an ensemble machine learning framework is a valuable asset for clinical decision-making. C1QB and C1QC may be involved in the development and progression of early-onset PE by affecting the complement and coagulation cascades pathway that mediate inflammation, which has important implications for better understanding the pathogenesis of PE.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] A cost-effective machine learning-based method for preeclampsia risk assessment and driver genes discovery
    Hao Wang
    Zhaoyue Zhang
    Haicheng Li
    Jinzhao Li
    Hanshuang Li
    Mingzhu Liu
    Pengfei Liang
    Qilemuge Xi
    Yongqiang Xing
    Lei Yang
    Yongchun Zuo
    Cell & Bioscience, 13
  • [2] Cost-Effective Machine Learning-based Localization Algorithm for WSNs
    Singh, Omkar
    Vinoth, R.
    Singh, Navanendra
    Singh, Abhilasha
    INTERNATIONAL JOURNAL OF EARLY CHILDHOOD SPECIAL EDUCATION, 2022, 14 (02) : 7093 - 7105
  • [3] Machine Learning-Based Method for Personalized and Cost-Effective Detection of Alzheimer's Disease
    Escudero, Javier
    Ifeachor, Emmanuel
    Zajicek, John P.
    Green, Colin
    Shearer, James
    Pearson, Stephen
    IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2013, 60 (01) : 164 - 168
  • [4] A Machine Learning-based Method for Cyber Risk Assessment
    Rafaiani, Giulia
    Battaglioni, Massimo
    Compagnoni, Simone
    Senigagliesi, Linda
    Chiaraluce, Franco
    Baldi, Marco
    2023 IEEE 36TH INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS, CBMS, 2023, : 263 - 268
  • [5] Machine Learning-Based Cost-Effective Smart Home Data Analysis and Forecasting for Energy Saving
    Park, Sanguk
    BUILDINGS, 2023, 13 (09)
  • [6] A Machine Learning-Based Prediction Model for Cardiovascular Risk in Women With Preeclampsia
    Wang, Guan
    Zhang, Yanbo
    Li, Sijin
    Zhang, Jun
    Jiang, Dongkui
    Li, Xiuzhen
    Li, Yulin
    Du, Jie
    FRONTIERS IN CARDIOVASCULAR MEDICINE, 2021, 8
  • [7] Deep Learning-Based Cost-Effective and Responsive Robot for Autism Treatment
    Singh, Aditya
    Raj, Kislay
    Kumar, Teerath
    Verma, Swapnil
    Roy, Arunabha M.
    DRONES, 2023, 7 (02)
  • [8] A cost-effective, machine learning-based new unified risk-classification score (NU-CATS) for patients with endometrial cancer
    Zheng, Shuhua
    Wu, Yilin
    Donnelly, Eric D.
    Strauss, Jonathan B.
    GYNECOLOGIC ONCOLOGY, 2023, 175 : 97 - 106
  • [9] A Cost-Effective, Machine Learning-Based New Unified RiskClassification Score (NU -CATS) for Patients with Endometrial Cancer
    Zheng, S.
    Donnelly, E. D.
    Strauss, J. B.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2023, 117 (02): : S9 - S9
  • [10] IoT-Pi: A machine learning-based lightweight framework for cost-effective distributed computing using IoT
    Shao, Tianchen
    Chowdhury, Deepraj
    Gill, Sukhpal Singh
    Buyya, Rajkumar
    INTERNET TECHNOLOGY LETTERS, 2022, 5 (03)