Early diagnosis of pancreatic cancer by machine learning methods using urine biomarker combinations

被引:4
|
作者
Acer, Irem [1 ,3 ]
Bulucu, Firat Orhan [2 ,3 ]
Icer, Semra [3 ]
Latifoglu, Fatma [3 ]
机构
[1] Kutahya Dumlupinar Univ, Dept Biomed Device Technol, Kutahya, Turkiye
[2] Inonu Univ, Dept Biomed Engn, Malatya, Turkiye
[3] Erciyes Univ, Dept Biomed Engn, Kayseri, Turkiye
关键词
Pancreatic cancer; urine biomarker; machine learning; ensemble learning; classification;
D O I
10.55730/1300-0632.3974
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The most common type of pancreatic cancer is pancreatic ductal adenocarcinoma (PDAC), which accounts for the vast majority of pancreatic cancers. The five-year survival rate for PDAC due to late diagnosis is 9%. Early diagnosed PDAC patients survive longer than patients diagnosed at a more advanced stage. Biomarkers can play an essential role in the early detection of PDAC to assist the health professional. Machine learning and deep learning methods are used with biomarkers obtained in recent studies for diagnostic purposes. In order to increase the survival rates of PDAC patients, early diagnosis of the disease with a noninvasive test is a critical need. Our study offers a promising approach for the early detection of PDAC with noninvasive urinary biomarkers and carbohydrate antigen 19-9 (CA19-9). The Kaggle Urinary Biomarkers for Pancreatic Cancer (2020) open-access dataset consisting of 590 participants was used in this study. Seven machine learning classifiers (support vector machine (SVM), naive Bayes (NB), k-nearest neighbors (kNN), random forest (RF), light gradient boosting machine (LightGBM), AdaBoost, and gradient boosting classifier (GBC)) to detect PDAC disease classifier were used. Binary and multiple classification processes were carried out. Data was validated in our study using 5-10-fold crossvalidation. This study aimed to determine the best machine learning model by analyzing the performance of machine learning models in determining the classes of healthy controls, pancreatic disorders, and patients with PDAC. It is a remarkable finding that ensemble learning models were more successful in all our groups. The most successful classification method in classifying healthy controls and patients with PDAC was CV-10, while the GBC (92.99%) model was (AUC = 0.9761). The most successful classification method in classifying patients with pancreatic disorders and PDAC was CV-10, while the LightGBM (86.37%) model was (AUC = 0.9348). In the classification of healthy controls, pancreatic disorders, and patients with PDAC, the most successful classification method was CV-5, while the GBC (72.91%) model was (AUC = 0.8733).
引用
收藏
页码:112 / 125
页数:16
相关论文
共 50 条
  • [1] Using Machine Learning Methods in Early Diagnosis of Breast Cancer
    Erkal, Begum
    Ayyildiz, Tulin Ercelebi
    TIP TEKNOLOJILERI KONGRESI (TIPTEKNO'21), 2021,
  • [2] Machine Learning for Diagnosis of Pancreatic Ductal Adenocarcinoma Using Urine Samples
    Bhalla, Harshit
    Kumar, Pravir
    ADVANCES IN DIGITAL HEALTH AND MEDICAL BIOENGINEERING, VOL 1, EHB-2023, 2024, 109 : 377 - 384
  • [3] Early lung cancer diagnostic biomarker discovery by machine learning methods
    Xie, Ying
    Meng, Wei-Yu
    Li, Run-Ze
    Wang, Yu-Wei
    Qian, Xin
    Chan, Chang
    Yu, Zhi-Fang
    Fan, Xing-Xing
    Pan, Hu-Dan
    Xie, Chun
    Wu, Qi-Biao
    Yan, Pei-Yu
    Liu, Liang
    Tang, Yi-Jun
    Yao, Xiao-Jun
    Wang, Mei-Fang
    Leung, Elaine Lai-Han
    TRANSLATIONAL ONCOLOGY, 2021, 14 (01):
  • [4] Modelling the benefits of early diagnosis of pancreatic cancer using a biomarker signature
    Ghatnekar, Ola
    Andersson, Roland
    Svensson, Marianne
    Persson, Ulf
    Ringdahl, Ulrika
    Zeilon, Paula
    Borrebaeck, Carl A. K.
    INTERNATIONAL JOURNAL OF CANCER, 2013, 133 (10) : 2392 - 2397
  • [5] Identification of a serum proteomic biomarker panel using diagnosis specific ensemble learning and symptoms for early pancreatic cancer detection
    Ney, Alexander
    Nene, Nuno R.
    Sedlak, Eva
    Acedo, Pilar
    Blyuss, Oleg
    Whitwell, Harry J.
    Costello, Eithne
    Gentry-Maharaj, Aleksandra
    Williams, Norman R.
    Menon, Usha
    Fusai, Giuseppe K.
    Zaikin, Alexey
    Pereira, Stephen P.
    PLOS COMPUTATIONAL BIOLOGY, 2024, 20 (08)
  • [6] Advance in MicroRNAs as an Effective Biomarker for Early Diagnosis of Pancreatic Cancer
    Huang, J.
    Xiao, G. G.
    PANCREAS, 2017, 46 (07) : 959 - 960
  • [7] Diagnosis of prostate cancer in a Chinese population by using machine learning methods
    Wang, Guanjin
    Teoh, Jeremy Yuen-Chun
    Choi, Kup-Sze
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 3971 - 3974
  • [8] Classification of Urine Odour Using Machine Learning Methods
    Xing, Yuxin
    Gardner, Julian W.
    2022 IEEE INTERNATIONAL SYMPOSIUM ON OLFACTION AND ELECTRONIC NOSE (ISOEN 2022), 2022,
  • [9] Early Diagnosis of Diabetes: A Comparison of Machine Learning Methods
    Alzboon, Mowafaq Salem
    Al-Batah, Mohammad Subhi
    Alqaraleh, Muhyeeddin
    Abuashour, Ahmad
    Bader, Ahmad Fuad Hamadah
    INTERNATIONAL JOURNAL OF ONLINE AND BIOMEDICAL ENGINEERING, 2023, 19 (15) : 144 - 165
  • [10] Advancements in Pancreatic Cancer Detection: Integrating Biomarkers, Imaging Technologies, and Machine Learning for Early Diagnosis
    Daher, Hisham
    Punchayil, Sneha A.
    Ismail, Amro Ahmed Elbeltagi
    Fernandes, Reuben Ryan
    Jacob, Joel
    Algazzar, Mohab H.
    Mansour, Mohammad
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (03)