Establishing machine learning models to predict the early risk of gastric cancer based on lifestyle factors

被引:19
|
作者
Afrash, Mohammad Reza [1 ]
Shafiee, Mohsen [2 ]
Kazemi-Arpanahi, Hadi [3 ]
机构
[1] Smart Univ Med Sci, Dept Artificial Intelligence, Tehran, Iran
[2] Abadan Univ Med Sci, Dept Nursing, Abadan, Iran
[3] Abadan Univ Med Sci, Dept Hlth Informat Technol, Abadan, Iran
关键词
Machine learning; Gastric cancer; Behavioral lifestyle; Prevention; Prognosis; ENDOSCOPIC SUBMUCOSAL DISSECTION; NEURAL-NETWORK; DECISION TREE; PROGNOSIS; SURGERY;
D O I
10.1186/s12876-022-02626-x
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background Gastric cancer is one of the leading causes of death worldwide. Screening for gastric cancer greatly relies on endoscopy and pathology biopsy, which are invasive and pose financial burdens. Thus, the prevention of the disease by modifying lifestyle-related behaviors and dietary habits or even the prevention of risk factor formation is of great importance. This study aimed to construct an inexpensive, non-invasive, fast, and high-precision diagnostic model using six machine learning (ML) algorithms to classify patients at high or low risk of developing gastric cancer by analyzing individual lifestyle factors.Methods This retrospective study used the data of 2029 individuals from the gastric cancer database of Ayatollah Taleghani Hospital in Abadan City, Iran. The data were randomly separated into training and test sets (ratio 0.7:0.3). Six ML methods, including multilayer perceptron (MLP), support vector machine (SVM) (linear kernel), SVM (RBF kernel), k-nearest neighbors (KNN) (K = 1, 3, 7, 9), random forest (RF), and eXtreme Gradient Boosting (XGBoost), were trained to construct prognostic models before and after performing the relief feature selection method. Finally, to evaluate the models' performance, the metrics derived from the confusion matrix were calculated via a test split and cross-validation.Results This study found 11 important influence factors for the risk of gastric cancer, such as Helicobacter pylori infection, high salt intake, and chronic atrophic gastritis, among other factors. Comparisons indicated that the XGBoost had the best performance for the risk prediction of gastric cancer.Conclusions The results suggest that based on simple baseline patient data, the ML techniques have the potential to start the prescreening of gastric cancer and identify high-risk individuals who should proceed with invasive examinations. Our model could also considerably lessen the number of cases that need endoscopic surveillance. Future studies are required to validate the efficacy of the models in a larger and multicenter population.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] The prediction of semen quality based on lifestyle behaviours by the machine learning based models
    Aykac, Aykut
    Kaya, Coskun
    Celik, Ozer
    Aydin, Mehmet Erhan
    Sungur, Mustafa
    REPRODUCTIVE BIOLOGY AND ENDOCRINOLOGY, 2024, 22 (01)
  • [32] Machine Learning Based on Multi-Parametric MRI to Predict Risk of Breast Cancer
    Tao, Weijing
    Lu, Mengjie
    Zhou, Xiaoyu
    Montemezzi, Stefania
    Bai, Genji
    Yue, Yangming
    Li, Xiuli
    Zhao, Lun
    Zhou, Changsheng
    Lu, Guangming
    FRONTIERS IN ONCOLOGY, 2021, 11
  • [33] Using Machine Learning to Identify Risk Factors and Establishing a Clinical Prediction Model to Predict Atherosclerosis Complications in Idiopathic Membranous Nephropathy
    Chen, Yipeng
    He, Ying
    Xing, Guangqun
    DISCOVERY MEDICINE, 2023, 35 (177) : 517 - 524
  • [34] Predicting postoperative gastric cancer prognosis based on inflammatory factors and machine learning technology
    Zhou, Cheng-Mao
    Wang, Ying
    Yang, Jian-Jun
    Zhu, Yu
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [35] Predicting postoperative gastric cancer prognosis based on inflammatory factors and machine learning technology
    Cheng-Mao Zhou
    Ying Wang
    Jian-Jun Yang
    Yu Zhu
    BMC Medical Informatics and Decision Making, 23
  • [36] Determining the Importance of Lifestyle Risk Factors in Predicting Binge Eating Disorder After Bariatric Surgery Using Machine Learning Models and Lifestyle Scores
    Mousavi, Maryam
    Tabesh, Mastaneh Rajabian
    Moghadami, Seyyedeh Mahila
    Saidpour, Atoosa
    Jahromi, Soodeh Razeghi
    OBESITY SURGERY, 2025, : 1396 - 1406
  • [37] Machine learning-based models for the prediction of breast cancer recurrence risk
    Duo Zuo
    Lexin Yang
    Yu Jin
    Huan Qi
    Yahui Liu
    Li Ren
    BMC Medical Informatics and Decision Making, 23
  • [38] Comparison of Machine Learning Models to Predict Risk of Falling in Osteoporosis Elderly
    Cuaya-Simbro, German
    Perez-Sanpablo, Alberto-Isaac
    Munoz-Melendez, Angelica
    Quinones Uriostegui, Ivett
    Morales-Manzanares, Eduardo-F
    Nunez-Carrera, Lidia
    FOUNDATIONS OF COMPUTING AND DECISION SCIENCES, 2020, 45 (02) : 65 - 77
  • [39] Machine learning-based models for the prediction of breast cancer recurrence risk
    Zuo, Duo
    Yang, Lexin
    Jin, Yu
    Qi, Huan
    Liu, Yahui
    Ren, Li
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2023, 23 (01)
  • [40] Development and validation of machine learning models to predict frailty risk for elderly
    Zhang, Wei
    Wang, Junchao
    Xie, Fang
    Wang, Xinghui
    Dong, Shanshan
    Luo, Nan
    Li, Feng
    Li, Yuewei
    JOURNAL OF ADVANCED NURSING, 2024, 80 (12) : 5064 - 5075