Machine learning-based colorectal cancer prediction using global dietary data

被引:7
|
作者
Abdul Rahman, Hanif [1 ,2 ]
Ottom, Mohammad Ashraf [1 ,3 ]
Dinov, Ivo D. [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48128 USA
[2] Univ Brunei Darussalam, PAPRSB Inst Hlth Sci, Bandar Seri Begawan, Brunei
[3] Yarmouk Univ, Irbid, Jordan
基金
美国国家科学基金会;
关键词
Colorectal cancer; Machine learning; Dietary information; RISK;
D O I
10.1186/s12885-023-10587-x
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
BackgroundColorectal cancer (CRC) is the third most commonly diagnosed cancer worldwide. Active health screening for CRC yielded detection of an increasingly younger adults. However, current machine learning algorithms that are trained using older adults and smaller datasets, may not perform well in practice for large populations.AimTo evaluate machine learning algorithms using large datasets accounting for both younger and older adults from multiple regions and diverse sociodemographics.MethodsA large dataset including 109,343 participants in a dietary-based colorectal cancer ase study from Canada, India, Italy, South Korea, Mexico, Sweden, and the United States was collected by the Center for Disease Control and Prevention. This global dietary database was augmented with other publicly accessible information from multiple sources. Nine supervised and unsupervised machine learning algorithms were evaluated on the aggregated dataset.ResultsBoth supervised and unsupervised models performed well in predicting CRC and non-CRC phenotypes. A prediction model based on an artificial neural network (ANN) was found to be the optimal algorithm with CRC misclassification of 1% and non-CRC misclassification of 3%.ConclusionsANN models trained on large heterogeneous datasets may be applicable for both younger and older adults. Such models provide a solid foundation for building effective clinical decision support systems assisting healthcare providers in dietary-related, non-invasive screening that can be applied in large studies. Using optimal algorithms coupled with high compliance to cancer screening is expected to significantly improve early diagnoses and boost the success rate of timely and appropriate cancer interventions.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Machine learning-based colorectal cancer prediction using global dietary data
    Hanif Abdul Rahman
    Mohammad Ashraf Ottom
    Ivo D. Dinov
    [J]. BMC Cancer, 23
  • [2] Machine learning-based approaches for cancer prediction using microbiome data
    Freitas, Pedro
    Silva, Francisco
    Sousa, Joana Vale
    Ferreira, Rui M.
    Figueiredo, Ceu
    Pereira, Tania
    Oliveira, Helder P.
    [J]. SCIENTIFIC REPORTS, 2023, 13 (01):
  • [3] Machine learning-based approaches for cancer prediction using microbiome data
    Pedro Freitas
    Francisco Silva
    Joana Vale Sousa
    Rui M. Ferreira
    Céu Figueiredo
    Tania Pereira
    Hélder P. Oliveira
    [J]. Scientific Reports, 13 (1)
  • [4] Machine Learning-Based Colorectal Cancer Detection
    Blanes-Vidal, Victoria
    Baatrup, Gunnar
    Nadimi, Esmaeil S.
    [J]. PROCEEDINGS OF THE 2018 CONFERENCE ON RESEARCH IN ADAPTIVE AND CONVERGENT SYSTEMS (RACS 2018), 2018, : 43 - 46
  • [5] Machine Learning-Based Prediction of Cattle Activity Using Sensor-Based Data
    Hernandez, Guillermo
    Gonzalez-Sanchez, Carlos
    Gonzalez-Arrieta, Angelica
    Sanchez-Brizuela, Guillermo
    Fraile, Juan-Carlos
    [J]. SENSORS, 2024, 24 (10)
  • [6] Machine Learning-Based Cellular Traffic Prediction Using Data Reduction Techniques
    Nashaat, Heba
    Mohammed, Nihal H.
    Abdel-Mageid, Salah M.
    Rizk, Rawya Y.
    [J]. IEEE ACCESS, 2024, 12 : 58927 - 58939
  • [7] Machine Learning-Based Prediction of Hemoglobinopathies Using Complete Blood Count Data
    Schipper, Anoeska
    Rutten, Matthieu
    van Gammeren, Adriaan
    Harteveld, Cornelis L.
    Urrechaga, Eloisa
    Weerkamp, Floor
    den Besten, Gijs
    Krabbe, Johannes
    Slomp, Jennichjen
    Schoonen, Lise
    Broeren, Maarten
    van Wijnen, Merel
    Huijskens, Mirelle J. A. J.
    Koopmann, Tamara
    van Ginneken, Bram
    Kusters, Ron
    Kurstjens, Steef
    [J]. CLINICAL CHEMISTRY, 2024, 70 (08) : 1064 - 1075
  • [8] Machine learning-based prediction of diabetic patients using blood routine data
    Li, Honghao
    Su, Dongqing
    Zhang, Xinpeng
    He, Yuanyuan
    Luo, Xu
    Xiong, Yuqiang
    Zou, Min
    Wei, Huiyan
    Wen, Shaoran
    Xi, Qilemuge
    Zuo, Yongchun
    Yang, Lei
    [J]. METHODS, 2024, 229 : 156 - 162
  • [9] Machine learning-based prediction of cancer immunotherapy response using circulating cytokines
    Wei, Feifei
    Azuma, Koichi
    Nakahara, Yoshiro
    Saito, Haruhiro
    Kouro, Taku
    Himuro, Hidetomo
    Horaguchi, Shun
    Tsuji, Kayoko
    Sasada, Tetsuro
    [J]. CANCER SCIENCE, 2023, 114 : 1013 - 1013
  • [10] A Global Machine Learning-Based Scoring Function For Protein Structure Prediction
    Kloczkowski, Andrzej
    Faraggi, Eshel
    [J]. PROTEIN SCIENCE, 2014, 23 : 244 - 244