Establishing machine learning models to predict the early risk of gastric cancer based on lifestyle factors

被引:19
|
作者
Afrash, Mohammad Reza [1 ]
Shafiee, Mohsen [2 ]
Kazemi-Arpanahi, Hadi [3 ]
机构
[1] Smart Univ Med Sci, Dept Artificial Intelligence, Tehran, Iran
[2] Abadan Univ Med Sci, Dept Nursing, Abadan, Iran
[3] Abadan Univ Med Sci, Dept Hlth Informat Technol, Abadan, Iran
关键词
Machine learning; Gastric cancer; Behavioral lifestyle; Prevention; Prognosis; ENDOSCOPIC SUBMUCOSAL DISSECTION; NEURAL-NETWORK; DECISION TREE; PROGNOSIS; SURGERY;
D O I
10.1186/s12876-022-02626-x
中图分类号
R57 [消化系及腹部疾病];
学科分类号
摘要
Background Gastric cancer is one of the leading causes of death worldwide. Screening for gastric cancer greatly relies on endoscopy and pathology biopsy, which are invasive and pose financial burdens. Thus, the prevention of the disease by modifying lifestyle-related behaviors and dietary habits or even the prevention of risk factor formation is of great importance. This study aimed to construct an inexpensive, non-invasive, fast, and high-precision diagnostic model using six machine learning (ML) algorithms to classify patients at high or low risk of developing gastric cancer by analyzing individual lifestyle factors.Methods This retrospective study used the data of 2029 individuals from the gastric cancer database of Ayatollah Taleghani Hospital in Abadan City, Iran. The data were randomly separated into training and test sets (ratio 0.7:0.3). Six ML methods, including multilayer perceptron (MLP), support vector machine (SVM) (linear kernel), SVM (RBF kernel), k-nearest neighbors (KNN) (K = 1, 3, 7, 9), random forest (RF), and eXtreme Gradient Boosting (XGBoost), were trained to construct prognostic models before and after performing the relief feature selection method. Finally, to evaluate the models' performance, the metrics derived from the confusion matrix were calculated via a test split and cross-validation.Results This study found 11 important influence factors for the risk of gastric cancer, such as Helicobacter pylori infection, high salt intake, and chronic atrophic gastritis, among other factors. Comparisons indicated that the XGBoost had the best performance for the risk prediction of gastric cancer.Conclusions The results suggest that based on simple baseline patient data, the ML techniques have the potential to start the prescreening of gastric cancer and identify high-risk individuals who should proceed with invasive examinations. Our model could also considerably lessen the number of cases that need endoscopic surveillance. Future studies are required to validate the efficacy of the models in a larger and multicenter population.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Risk Factors of Microscopic Invasion in Early Gastric Cancer
    Choi, Jong-Ho
    Suh, Yun-Suhk
    Park, Shin-Hoo
    Kong, Seong-Ho
    Lee, Hyuk-Joon
    Kim, Woo Ho
    Yang, Han-Kwang
    JOURNAL OF GASTRIC CANCER, 2017, 17 (04) : 331 - 341
  • [42] Machine learning techniques in early screening for gastric and oesophageal cancer
    Liu, WZ
    White, AP
    Hallissey, MT
    Fielding, JWL
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 1996, 8 (04) : 327 - 341
  • [43] Machine learning models to predict 6-month mortality risk in home-based hospice patients with advanced cancer
    Cheng, Wan
    Zheng, Jianwei
    Lu, Yuanfeng
    Chen, Guojuan
    Zhu, Zheng
    Wu, Hong
    Wei, Yitao
    Xiao, Huimin
    ASIA-PACIFIC JOURNAL OF ONCOLOGY NURSING, 2025, 12
  • [44] Lifestyle and occupational risks assessment of bladder cancer using machine learning-based prediction models
    Shakhssalim, Naser
    Talebi, Atefeh
    Pahlevan-Fallahy, Mohammad-Taha
    Sotoodeh, Kasra
    Alavimajd, Hamid
    Borumandnia, Nasrin
    Taheri, Maryam
    CANCER REPORTS, 2023, 6 (09)
  • [45] Lung Cancer Risk Prediction with Machine Learning Models
    Dritsas, Elias
    Trigka, Maria
    BIG DATA AND COGNITIVE COMPUTING, 2022, 6 (04)
  • [46] High-risk population for gastric cancer development based on serum pepsinogen status and lifestyle factors
    Yamaji, Yutaka
    Watabe, Hirotsugu
    Kawabe, Takao
    Mitsuhima, Toru
    Omato, Masao
    GASTROENTEROLOGY, 2008, 134 (04) : A482 - A482
  • [47] High-risk Population for Gastric Cancer Development Based on Serum Pepsinogen Status and Lifestyle Factors
    Yamaji, Yutaka
    Watabe, Hirotsugu
    Yoshida, Haruhiko
    Kawabe, Takao
    Wada, Ryoichi
    Mitsushima, Toru
    Omata, Masao
    HELICOBACTER, 2009, 14 (02) : 81 - 86
  • [48] Prediction and analysis of risk factors for diabetic retinopathy based on machine learning and interpretable models
    Wang, Xu
    Wang, Weijie
    Ren, Huiling
    Li, Xiaoying
    Wen, Yili
    HELIYON, 2024, 10 (09)
  • [49] Lifestyle factors associated with gastritis in a restricted area at risk of gastric cancer.
    DeBoni, M
    Bellumat, A
    Bidoli, E
    DeBona, M
    Guido, E
    Russo, A
    Franceschi, S
    Naccarato, R
    GASTROENTEROLOGY, 1996, 110 (04) : A12 - A12
  • [50] EFFECT OF LIFESTYLE FACTORS ON RISK OF EARLY-ONSET COLORECTAL CANCER
    Win, Aung Ko
    Taunde, Sergio A.
    Jayasekara, Harindra
    Buchanan, Daniel D.
    Young, Joanne P.
    Potter, John D.
    Baron, John A.
    Le Marchand, Loic
    Casey, Graham
    Haile, Robert W.
    Lindor, Noralane M.
    Newcomb, Polly A.
    Cotterchio, Michelle
    Gallinger, Steven
    Hopper, John L.
    Jenkins, Mark A.
    ASIA-PACIFIC JOURNAL OF CLINICAL ONCOLOGY, 2014, 10 : 245 - 245