Early diagnosis of pancreatic cancer by machine learning methods using urine biomarker combinations

被引:4
|
作者
Acer, Irem [1 ,3 ]
Bulucu, Firat Orhan [2 ,3 ]
Icer, Semra [3 ]
Latifoglu, Fatma [3 ]
机构
[1] Kutahya Dumlupinar Univ, Dept Biomed Device Technol, Kutahya, Turkiye
[2] Inonu Univ, Dept Biomed Engn, Malatya, Turkiye
[3] Erciyes Univ, Dept Biomed Engn, Kayseri, Turkiye
关键词
Pancreatic cancer; urine biomarker; machine learning; ensemble learning; classification;
D O I
10.55730/1300-0632.3974
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The most common type of pancreatic cancer is pancreatic ductal adenocarcinoma (PDAC), which accounts for the vast majority of pancreatic cancers. The five-year survival rate for PDAC due to late diagnosis is 9%. Early diagnosed PDAC patients survive longer than patients diagnosed at a more advanced stage. Biomarkers can play an essential role in the early detection of PDAC to assist the health professional. Machine learning and deep learning methods are used with biomarkers obtained in recent studies for diagnostic purposes. In order to increase the survival rates of PDAC patients, early diagnosis of the disease with a noninvasive test is a critical need. Our study offers a promising approach for the early detection of PDAC with noninvasive urinary biomarkers and carbohydrate antigen 19-9 (CA19-9). The Kaggle Urinary Biomarkers for Pancreatic Cancer (2020) open-access dataset consisting of 590 participants was used in this study. Seven machine learning classifiers (support vector machine (SVM), naive Bayes (NB), k-nearest neighbors (kNN), random forest (RF), light gradient boosting machine (LightGBM), AdaBoost, and gradient boosting classifier (GBC)) to detect PDAC disease classifier were used. Binary and multiple classification processes were carried out. Data was validated in our study using 5-10-fold crossvalidation. This study aimed to determine the best machine learning model by analyzing the performance of machine learning models in determining the classes of healthy controls, pancreatic disorders, and patients with PDAC. It is a remarkable finding that ensemble learning models were more successful in all our groups. The most successful classification method in classifying healthy controls and patients with PDAC was CV-10, while the GBC (92.99%) model was (AUC = 0.9761). The most successful classification method in classifying patients with pancreatic disorders and PDAC was CV-10, while the LightGBM (86.37%) model was (AUC = 0.9348). In the classification of healthy controls, pancreatic disorders, and patients with PDAC, the most successful classification method was CV-5, while the GBC (72.91%) model was (AUC = 0.8733).
引用
收藏
页码:112 / 125
页数:16
相关论文
共 50 条
  • [21] Machine learning algorithms identify novel biomarker combinations for NAFLD
    Lee, Jenny
    Westphal, Max
    Vali, Yasaman
    Chen, Yu
    Alexander, Leigh
    Boursier, Jerome
    Anstee, Quentin
    Zwinderman, Aeilko
    Bossuyt, Patrick
    JOURNAL OF HEPATOLOGY, 2022, 77 : S74 - S74
  • [22] Biomarker combinations for diagnosis and prognosis in multicenter studies: Principles and methods
    Meisner, Allison
    Parikh, Chirag R.
    Kerr, Kathleen F.
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2019, 28 (04) : 969 - 985
  • [23] Using Machine Learning to Detect Cancer Early
    Schellenberger, Jan
    KDD '20: PROCEEDINGS OF THE 26TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2020, : 3604 - 3604
  • [24] Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods
    Ullah, Zahid
    Saleem, Farrukh
    Jamjoom, Mona
    Fakieh, Bahjat
    Kateb, Faris
    Ali, Abdullah Marish
    Shah, Babar
    Computational Intelligence and Neuroscience, 2022, 2022
  • [25] Detecting High-Risk Factors and Early Diagnosis of Diabetes Using Machine Learning Methods
    Ullah, Zahid
    Saleem, Farrukh
    Jamjoom, Mona
    Fakieh, Bahjat
    Kateb, Faris
    Ali, Abdullah Marish
    Shah, Babar
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [26] IBCBML: interpreting breast cancer biomarker using machine learning
    Lamba, Monika
    Munjal, Geetika
    Gigras, Yogita
    HEALTH AND TECHNOLOGY, 2024, 14 (04) : 725 - 746
  • [27] Pancreatic ductal adenocarcinoma metastatic biomarker discovery using machine learning approach
    Mahawan, T.
    Mielgo, A.
    Pornputtapong, N.
    Caamano-Gutierrez, E.
    FEBS OPEN BIO, 2024, 14 : 338 - 338
  • [28] Aided diagnosis methods of breast cancer based on machine learning
    Zhao, Yue
    Wang, Nian
    Cui, Xiaoyu
    2ND ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION SYSTEM AND ARTIFICIAL INTELLIGENCE (ISAI2017), 2017, 887
  • [29] Explainable machine learning models for early gastric cancer diagnosis
    Du, Hongyang
    Yang, Qingfen
    Ge, Aimin
    Zhao, Chenhao
    Ma, Yunhua
    Wang, Shuyu
    SCIENTIFIC REPORTS, 2024, 14 (01):
  • [30] Machine learning and multiparametric MRI for early diagnosis of prostate cancer
    Bonekamp, D.
    Schlemmer, H. -P.
    UROLOGE, 2021, 60 (05): : 576 - 591