Establishing machine learning models to predict the early risk of gastric cancer based on lifestyle factors

被引:19
|
作者
Afrash, Mohammad Reza [1 ]
Shafiee, Mohsen [2 ]
Kazemi-Arpanahi, Hadi [3 ]
机构
[1] Smart Univ Med Sci, Dept Artificial Intelligence, Tehran, Iran
[2] Abadan Univ Med Sci, Dept Nursing, Abadan, Iran
[3] Abadan Univ Med Sci, Dept Hlth Informat Technol, Abadan, Iran
关键词
Machine learning; Gastric cancer; Behavioral lifestyle; Prevention; Prognosis; ENDOSCOPIC SUBMUCOSAL DISSECTION; NEURAL-NETWORK; DECISION TREE; PROGNOSIS; SURGERY;
D O I
10.1186/s12876-022-02626-x
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background Gastric cancer is one of the leading causes of death worldwide. Screening for gastric cancer greatly relies on endoscopy and pathology biopsy, which are invasive and pose financial burdens. Thus, the prevention of the disease by modifying lifestyle-related behaviors and dietary habits or even the prevention of risk factor formation is of great importance. This study aimed to construct an inexpensive, non-invasive, fast, and high-precision diagnostic model using six machine learning (ML) algorithms to classify patients at high or low risk of developing gastric cancer by analyzing individual lifestyle factors.Methods This retrospective study used the data of 2029 individuals from the gastric cancer database of Ayatollah Taleghani Hospital in Abadan City, Iran. The data were randomly separated into training and test sets (ratio 0.7:0.3). Six ML methods, including multilayer perceptron (MLP), support vector machine (SVM) (linear kernel), SVM (RBF kernel), k-nearest neighbors (KNN) (K = 1, 3, 7, 9), random forest (RF), and eXtreme Gradient Boosting (XGBoost), were trained to construct prognostic models before and after performing the relief feature selection method. Finally, to evaluate the models' performance, the metrics derived from the confusion matrix were calculated via a test split and cross-validation.Results This study found 11 important influence factors for the risk of gastric cancer, such as Helicobacter pylori infection, high salt intake, and chronic atrophic gastritis, among other factors. Comparisons indicated that the XGBoost had the best performance for the risk prediction of gastric cancer.Conclusions The results suggest that based on simple baseline patient data, the ML techniques have the potential to start the prescreening of gastric cancer and identify high-risk individuals who should proceed with invasive examinations. Our model could also considerably lessen the number of cases that need endoscopic surveillance. Future studies are required to validate the efficacy of the models in a larger and multicenter population.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Establishing machine learning models to predict the early risk of gastric cancer based on lifestyle factors
    Mohammad Reza Afrash
    Mohsen Shafiee
    Hadi Kazemi-Arpanahi
    BMC Gastroenterology, 23
  • [2] Establishing Machine Learning Models to Predict Curative Resection in Early Gastric Cancer with Undifferentiated Histology: Development and Usability Study
    Bang, Chang Seok
    Ahn, Ji Yong
    Kim, Jie-Hyun
    Kim, Young-Il
    Choi, Il Ju
    Shin, Woon Geon
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (04)
  • [3] A retrospective analysis based on multiple machine learning models to predict lymph node metastasis in early gastric cancer
    Yang, Tao
    Martinez-Useros, Javier
    Liu, JingWen
    Alarcon, Isaias
    Li, Chao
    Li, WeiYao
    Xiao, Yuanxun
    Ji, Xiang
    Zhao, YanDong
    Wang, Lei
    Morales-Conde, Salvador
    Yang, Zuli
    FRONTIERS IN ONCOLOGY, 2022, 12
  • [4] Critical Analysis of Risk Factors and Machine-Learning-Based Gastric Cancer Risk Prediction Models: A Systematic Review
    Fan, Zeyu
    He, Ziju
    Miao, Wenjun
    Huang, Rongrong
    PROCESSES, 2023, 11 (08)
  • [5] Machine learning models to predict submucosal invasion in early gastric cancer based on endoscopy features and standardized color metrics
    Chen, Keyan
    Wang, Ye
    Lang, Yanfei
    Yang, Linjian
    Guo, Zhijun
    Wu, Wei
    Zhang, Jing
    Ding, Shigang
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [6] Explainable machine learning models for early gastric cancer diagnosis
    Du, Hongyang
    Yang, Qingfen
    Ge, Aimin
    Zhao, Chenhao
    Ma, Yunhua
    Wang, Shuyu
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [7] Editorial: machine learning models for gastric cancer risk prediction
    Spaander, Manon C. W.
    Kuipers, Ernst J.
    ALIMENTARY PHARMACOLOGY & THERAPEUTICS, 2021, 53 (08) : 943 - 944
  • [8] Environmental and Lifestyle Risk Factors of Gastric Cancer
    Lee, Yeong Yeh
    Derakhshan, Mohammad H.
    ARCHIVES OF IRANIAN MEDICINE, 2013, 16 (06) : 358 - 365
  • [9] Machine learning based models for predicting presentation delay risk among gastric cancer patients
    Zhou, Huali
    Gu, Qiong
    Bao, Rong
    Qiu, Liping
    Zhang, Yuhan
    Wang, Fang
    Liu, Wenlian
    Wu, Lingling
    Li, Li
    Ren, Yihua
    Qiu, Lei
    Wang, Qian
    Zhang, Gaomin
    Qiao, Xiaoqing
    Yuan, Wenjie
    Ren, Juan
    Luo, Min
    Huang, Rong
    Yang, Qing
    FRONTIERS IN ONCOLOGY, 2025, 14
  • [10] Noninvasive predictive models based on lifestyle analysis and risk factors for early-onset colorectal cancer
    Deng, Jia-Wen
    Zhou, Yi-Lu
    Dai, Wei-Xing
    Chen, Hui-Min
    Zhou, Cheng-Bei
    Zhu, Chun-Qi
    Ma, Xin-Yue
    Pan, Si-Yuan
    Cui, Yun
    Xu, Jia
    Zhao, En-Hao
    Wang, Ming
    Chen, Jin-Xian
    Wang, Zheng
    Liu, Qiang
    Wang, Ji-Lin
    Cai, Guo-Xiang
    Chen, Ying-Xuan
    Fang, Jing-Yuan
    JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2023, 38 (10) : 1768 - 1777