High-Efficiency Machine Learning Method for Identifying Foodborne Disease Outbreaks and Confounding Factors

被引:12
|
作者
Zhang, Peng [1 ,2 ]
Cui, Wenjuan [1 ]
Wang, Hanxue [1 ,2 ]
Du, Yi [1 ,2 ]
Zhou, Yuanchun [1 ,2 ]
机构
[1] Chinese Acad Sci, Comp Network Informat Ctr, Bldg 2,Software Pk 4,South Fourth St, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing, Peoples R China
关键词
foodborne disease outbreaks; machine learning; foodborne disease; SURVEILLANCE;
D O I
10.1089/fpd.2020.2913
中图分类号
TS2 [食品工业];
学科分类号
0832 ;
摘要
The China National Center for Food Safety Risk Assessment (CFSA) uses the Foodborne Disease Monitoring and Reporting System (FDMRS) to monitor outbreaks of foodborne diseases across the country. However, there are problems of underreporting or erroneous reporting in FDMRS, which significantly increase the cost of related epidemic investigations. To solve this problem, we designed a model to identify suspected outbreaks from the data generated by the FDMRS of CFSA. In this study, machine learning models were used to fit the data. The recall rate and F1-score were used as evaluation metrics to compare the classification performance of each model. Feature importance and pathogenic factors were identified and analyzed using tree-based and gradient boosting models. Three real foodborne disease outbreaks were then used to evaluate the best performing model. Furthermore, the SHapley Additive exPlanation value was used to identify the effect of features. Among all machine learning classification models, the eXtreme Gradient Boosting (XGBoost) model achieved the best performance, with the highest recall rate and F1-score of 0.9699 and 0.9582, respectively. In terms of model validation, the model provides a correct judgment of real outbreaks. In the feature importance analysis with the XGBoost model, the health status of the other people with the same exposure has the highest weight, reaching 0.65. The machine learning model built in this study exhibits high accuracy in recognizing foodborne disease outbreaks, thus reducing the manual burden for medical staff. The model helped us identify the confounding factors of foodborne disease outbreaks. Attention should be paid not only to the health status of those with the same exposure but also to the similarity of the cases in time and space.
引用
收藏
页码:590 / 598
页数:9
相关论文
共 50 条
  • [31] Crowdsourcing and machine learning approaches for extracting entities indicating potential foodborne outbreaks from social media
    Dandan Tao
    Dongyu Zhang
    Ruofan Hu
    Elke Rundensteiner
    Hao Feng
    [J]. Scientific Reports, 11
  • [32] Machine-Learning Modeling for Ultra-Stable High-Efficiency Perovskite Solar Cells
    Hu, Yingjie
    Hu, Xiaobing
    Zhang, Lu
    Zheng, Tao
    You, Jiaxue
    Jia, Binxia
    Ma, Yabin
    Du, Xinyi
    Zhang, Lei
    Wang, Jincheng
    Che, Bo
    Chen, Tao
    Liu, Shengzhong
    [J]. ADVANCED ENERGY MATERIALS, 2022, 12 (41)
  • [33] Design and implementation for a high-efficiency hardware accelerator to realize the learning machine for predicting OLED degradation
    I.-Feng Chang
    Hao-Ren Chen
    Paul C.-P. Chao
    [J]. Microsystem Technologies, 2023, 29 : 1069 - 1081
  • [34] High-efficiency scattering field modeling in metallic components: a machine-learning-inspired approach
    Chiang, Po-Jui
    Tseng, Chih Lung
    Wang, Chien-Kun
    [J]. JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2024, 41 (06) : 1019 - 1026
  • [35] Design and implementation for a high-efficiency hardware accelerator to realize the learning machine for predicting OLED degradation
    Chang, I. -Feng
    Chen, Hao-Ren
    Chao, Paul C. -P.
    [J]. MICROSYSTEM TECHNOLOGIES-MICRO-AND NANOSYSTEMS-INFORMATION STORAGE AND PROCESSING SYSTEMS, 2023, 29 (08): : 1069 - 1081
  • [36] Machine learning techniques and research framework in foodborne disease surveillance system
    Du, Yi
    Guo, Yunchang
    [J]. FOOD CONTROL, 2022, 131
  • [37] Identifying factors via automatic debiased machine learning
    Maasoumi, Esfandiar
    Wang, Jianqiu
    Wang, Zhuo
    Wu, Ke
    [J]. JOURNAL OF APPLIED ECONOMETRICS, 2024, 39 (03) : 438 - 461
  • [38] Contributing Factors in Restaurant- Associated Foodborne Disease Outbreaks, FoodNet Sites, 2006 and 2007
    Gould, L. Hannah
    Rosenblum, Ida
    Nicholas, David
    Phan, Quyen
    Jones, Timothy F.
    [J]. JOURNAL OF FOOD PROTECTION, 2013, 76 (11) : 1824 - 1828
  • [39] High-efficiency plating method for Leishmania infantum
    Quijada, L
    Soto, M
    Alonso, C
    Requena, JM
    [J]. MOLECULAR AND BIOCHEMICAL PARASITOLOGY, 2003, 130 (02) : 139 - 141
  • [40] HIGH-EFFICIENCY OF THE INDUSTRY-COMPLEX METHOD
    STOLYAROV, EV
    RIZVANOV, NM
    KAGARMANOV, NF
    [J]. NEFTYANOE KHOZYAISTVO, 1982, (05): : 22 - 24