A data-driven approach for prioritising microbial and chemical hazards associated with dairy products using open-source databases

被引:2
|
作者
Talari, Gopaiah [1 ,2 ]
Nag, Rajat [2 ]
O'Brien, John [1 ]
Mcnamara, Cronan [1 ]
Cummins, Enda [2 ]
机构
[1] Creme Global, Trinity Technol & Enterprise Campus, 4th Floor,Design Tower,Grand Canal Quay, Dublin 2, Ireland
[2] Univ Coll Dublin, Sch Biosyst & Food Engn, Belfield, Dublin, Ireland
关键词
Food safety alerts; Chemical contaminants; Microbial contaminants; Risk ranking; RASFF and GEMS; Machine learning; FOOD SAFETY HAZARDS; TRACEABILITY;
D O I
10.1016/j.scitotenv.2023.168456
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This study presents a data-driven approach for classifying food safety alerts related to chemical and microbial contaminants in dairy products using the Rapid Alert System for Food and Feed (RASFF) and the World Health Organization (WHO)'s Global Environmental Monitoring System (GEMS) food contaminants databases. This research aimed to prioritise microbial and chemical hazards based on their presence and severity through exploratory data analysis and to classify the severity of chemical hazards using machine learning (ML) approaches. It identified Listeria monocytogenes, Escherichia coli, Salmonella, Pseudomonas spp., Staphylococcus spp., Bacillus cereus, Clostridium spp., and Cronobacter sakazakii as the microbial hazards of priority in dairy products. The study also prioritised the top ten chemical hazards based on their presence and severity. These hazards include nitrate, nitrite, ergocornine, 3-MCPD ester, lead, arsenic, ochratoxin A, cadmium, mercury, and aflatoxin (G1, B1, G2, B2, G5 and M1). Using ML techniques, the accuracy rate of classifying food safety alerts as either 'serious' or 'non-serious' was up to 98 %. Additionally, the study identified Reference dose (RfD), substance amount, notification type, product, and substance as the most important features affecting the ML models' performance. These ML models (decision trees, random forests, k-nearest neighbors, linear discriminant analysis, and support vector machines) were also validated on an external dataset of RASFF alerts related to chemical contaminants in dairy products. They achieved an accuracy of up to 95.1 %. The study's findings demonstrate the models' robustness and ability to classify food safety alerts related to chemical contaminants in dairy products, even on new data. These results can enhance the development of more effective machine-learning models for classifying food safety alerts related to chemical contaminants in dairy products, highlighting the importance of developing accurate and efficient classification models for timely intervention.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Is Open Source the Future of AI? A Data-Driven Approach
    Vake, Domen
    Sinik, Bogdan
    Vicic, Jernej
    Tosic, Aleksandar
    APPLIED SCIENCES-BASEL, 2025, 15 (05):
  • [2] EcBot: Data-Driven Energy Consumption Open-Source MATLAB Library for Manipulators
    Heredia, Juan
    Schlette, Christian
    Kjaergaard, Mikkel Baun
    2023 21ST INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS, ICAR, 2023, : 340 - 347
  • [3] MagNet: An Open-Source Database for Data-Driven Magnetic Core Loss Modeling
    Li, Haoran
    Serrano, Diego
    Guillod, Thomas
    Dogariu, Evan
    Nadler, Andrew
    Wang, Shukai
    Luo, Min
    Bansal, Vineet
    Chen, Yuxin
    Sullivan, Charles R.
    Chen, Minjie
    2022 IEEE APPLIED POWER ELECTRONICS CONFERENCE AND EXPOSITION, APEC, 2022, : 588 - 595
  • [4] A framework for BEB energy prediction using low-resolution open-source data-driven model
    Abdelaty, Hatem
    Mohamed, Moataz
    TRANSPORTATION RESEARCH PART D-TRANSPORT AND ENVIRONMENT, 2022, 103
  • [5] An open-source framework for data-driven trajectory extraction from AIS data-The α-method
    Paulig, Niklas
    Okhrin, Ostap
    OCEAN ENGINEERING, 2024, 312
  • [6] Towards data-driven energy communities: A review of open-source datasets, models and tools
    Kazmi, Hussain
    Munne-Collado, Ingrid
    Mehmood, Fahad
    Syed, Tahir Abbas
    Driesen, Johan
    RENEWABLE & SUSTAINABLE ENERGY REVIEWS, 2021, 148
  • [7] Open-source chemogenomic data-driven algorithms for predicting drug-target interactions
    Hao, Ming
    Bryant, Stephen H.
    Wang, Yanli
    BRIEFINGS IN BIOINFORMATICS, 2019, 20 (04) : 1465 - 1474
  • [8] An Open-source Adjoint-based Field Inversion Tool for Data-driven RANS Modelling
    Bidar, Omid
    He, Ping
    Anderson, Sean
    Qin, Ning
    AIAA AVIATION 2022 FORUM, 2022,
  • [9] ACN-Sim: An Open-Source Simulator for Data-Driven Electric Vehicle Charging Research
    Lee, Zachary J.
    Johansson, Daniel
    Low, Steven H.
    2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CONTROL, AND COMPUTING TECHNOLOGIES FOR SMART GRIDS (SMARTGRIDCOMM), 2019,
  • [10] ACN-Sim: An Open-Source Simulator for Data-Driven Electric Vehicle Charging Research
    Lee, Zachary J.
    Sharma, Sunash
    Johansson, Daniel
    Low, Steven H.
    IEEE TRANSACTIONS ON SMART GRID, 2021, 12 (06) : 5113 - 5123