Early diagnosis of pancreatic cancer by machine learning methods using urine biomarker combinations

被引:4
|
作者
Acer, Irem [1 ,3 ]
Bulucu, Firat Orhan [2 ,3 ]
Icer, Semra [3 ]
Latifoglu, Fatma [3 ]
机构
[1] Kutahya Dumlupinar Univ, Dept Biomed Device Technol, Kutahya, Turkiye
[2] Inonu Univ, Dept Biomed Engn, Malatya, Turkiye
[3] Erciyes Univ, Dept Biomed Engn, Kayseri, Turkiye
关键词
Pancreatic cancer; urine biomarker; machine learning; ensemble learning; classification;
D O I
10.55730/1300-0632.3974
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The most common type of pancreatic cancer is pancreatic ductal adenocarcinoma (PDAC), which accounts for the vast majority of pancreatic cancers. The five-year survival rate for PDAC due to late diagnosis is 9%. Early diagnosed PDAC patients survive longer than patients diagnosed at a more advanced stage. Biomarkers can play an essential role in the early detection of PDAC to assist the health professional. Machine learning and deep learning methods are used with biomarkers obtained in recent studies for diagnostic purposes. In order to increase the survival rates of PDAC patients, early diagnosis of the disease with a noninvasive test is a critical need. Our study offers a promising approach for the early detection of PDAC with noninvasive urinary biomarkers and carbohydrate antigen 19-9 (CA19-9). The Kaggle Urinary Biomarkers for Pancreatic Cancer (2020) open-access dataset consisting of 590 participants was used in this study. Seven machine learning classifiers (support vector machine (SVM), naive Bayes (NB), k-nearest neighbors (kNN), random forest (RF), light gradient boosting machine (LightGBM), AdaBoost, and gradient boosting classifier (GBC)) to detect PDAC disease classifier were used. Binary and multiple classification processes were carried out. Data was validated in our study using 5-10-fold crossvalidation. This study aimed to determine the best machine learning model by analyzing the performance of machine learning models in determining the classes of healthy controls, pancreatic disorders, and patients with PDAC. It is a remarkable finding that ensemble learning models were more successful in all our groups. The most successful classification method in classifying healthy controls and patients with PDAC was CV-10, while the GBC (92.99%) model was (AUC = 0.9761). The most successful classification method in classifying patients with pancreatic disorders and PDAC was CV-10, while the LightGBM (86.37%) model was (AUC = 0.9348). In the classification of healthy controls, pancreatic disorders, and patients with PDAC, the most successful classification method was CV-5, while the GBC (72.91%) model was (AUC = 0.8733).
引用
收藏
页码:112 / 125
页数:16
相关论文
共 50 条
  • [31] High-performance Collective Biomarker from Liquid Biopsy for Diagnosis of Pancreatic Cancer Based on Mass Spectrometry and Machine Learning
    Iwano, Tomohiko
    Yoshimura, Kentaro
    Watanabe, Genki
    Saito, Ryo
    Kiritani, Sho
    Kawaida, Hiromichi
    Moriguchi, Takeshi
    Murata, Tasuku
    Ogata, Koretsugu
    Ichikawa, Daisuke
    Arita, Junichi
    Hasegawa, Kiyoshi
    Sen Takeda
    JOURNAL OF CANCER, 2021, 12 (24): : 7477 - 7487
  • [32] EARLY DIAGNOSIS OF PANCREATIC CANCER
    DREILING, DA
    SCANDINAVIAN JOURNAL OF GASTROENTEROLOGY, 1970, 5 : 115 - &
  • [33] Early diagnosis of pancreatic cancer
    Furukawa, H
    Okada, S
    Kakizoe, T
    HEPATO-GASTROENTEROLOGY, 1999, 46 (25) : 4 - 7
  • [34] Circulating miRNAs as biomarker for the diagnosis of pancreatic cancer
    Du, Yiqi
    Li, Zhaoshen
    JOURNAL OF GASTROENTEROLOGY AND HEPATOLOGY, 2013, 28 : 676 - 677
  • [35] Studying the impact of marital status on diagnosis and survival prediction in pancreatic ductal carcinoma using machine learning methods
    Qingquan Chen
    Yiming Hu
    Wen Lin
    Zhimin Huang
    Jiaxin Li
    Haibin Lu
    Rongrong Dai
    Liuxia You
    Scientific Reports, 14
  • [36] Fault diagnosis of ball bearings using machine learning methods
    Kankar, P. K.
    Sharma, Satish C.
    Harsha, S. P.
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (03) : 1876 - 1886
  • [37] DIAGNOSIS OF THE DISEASES USING RESAMPLING METHODS WITH MACHINE LEARNING ALGORITHMS
    Celik, Ahmet
    COMPTES RENDUS DE L ACADEMIE BULGARE DES SCIENCES, 2023, 76 (07): : 1065 - 1076
  • [38] Fault Diagnosis of Batch Reactor Using Machine Learning Methods
    Subramanian, Sujatha
    Ghouse, Fathima
    Natarajan, Pappa
    MODELLING AND SIMULATION IN ENGINEERING, 2014, 2014 (2014)
  • [39] Urine Analysed by FTIR, Chemometrics and Machine Learning Methods in Determination Spectroscopy Marker of Prostate Cancer in Urine
    Mitura, Przemyslaw
    Paja, Wieslaw
    Klebowski, Bartosz
    Plaza, Pawel
    Bar, Krzyszof
    Mlynarczyk, Grzegorz
    Depciuch, Joanna
    JOURNAL OF BIOPHOTONICS, 2025, 18 (01)
  • [40] Diagnosis of skin cancer using machine learning techniques
    Murugan, A.
    Nair, S. Anu H.
    Preethi, A. Angelin Peace
    Kumar, K. P. Sanal
    MICROPROCESSORS AND MICROSYSTEMS, 2021, 81