Data-driven methods for dengue prediction and surveillance using real-world and Big Data: A systematic review

被引:21
|
作者
Sylvestre, Emmanuelle [1 ,2 ]
Joachim, Clarisse [3 ,4 ]
Cecilia-Joseph, Elsa [2 ]
Bouzille, Guillaume [1 ]
Campillo-Gimenez, Boris [1 ,5 ]
Cuggia, Marc [1 ]
Cabie, Andre [6 ,7 ,8 ]
机构
[1] Univ Rennes, CHU Rennes, INSERM, LTSI,UMR 1099, Rennes, France
[2] CHU Martinique, Ctr Donnees Clin, Martinique, France
[3] CHU Martinique, Pole Cancerol Hematol Urol, Registre Gen Canc Martinique, Martinique, France
[4] CHU Martinique, Pole Cancerol Hematol Urol, Martinique Canc Data Hub, Martinique, France
[5] Ctr Lutte Canc Eugene Marquis, Rennes, France
[6] CHU Martinique, Infect & Trop Dis Unit, Martinique, France
[7] CHU Martinique, INSERM, CIC 1424, Martinique, France
[8] Univ Montpellier, INSERM, EFS, Univ Antilles,PCCEI, Montpellier, France
来源
PLOS NEGLECTED TROPICAL DISEASES | 2022年 / 16卷 / 01期
关键词
SOCIAL MEDIA SURVEILLANCE; FEVER SURVEILLANCE; DISEASE OUTBREAKS; MODEL; CLASSIFICATION; EPIDEMICS; PROGNOSIS; NETWORKS; CLIMATE; TRENDS;
D O I
10.1371/journal.pntd.0010056
中图分类号
R51 [传染病];
学科分类号
100401 ;
摘要
BackgroundTraditionally, dengue surveillance is based on case reporting to a central health agency. However, the delay between a case and its notification can limit the system responsiveness. Machine learning methods have been developed to reduce the reporting delays and to predict outbreaks, based on non-traditional and non-clinical data sources. The aim of this systematic review was to identify studies that used real-world data, Big Data and/or machine learning methods to monitor and predict dengue-related outcomes. Methodology/Principal findingsWe performed a search in PubMed, Scopus, Web of Science and grey literature between January 1, 2000 and August 31, 2020. The review (ID: CRD42020172472) focused on data-driven studies. Reviews, randomized control trials and descriptive studies were not included. Among the 119 studies included, 67% were published between 2016 and 2020, and 39% used at least one novel data stream. The aim of the included studies was to predict a dengue-related outcome (55%), assess the validity of data sources for dengue surveillance (23%), or both (22%). Most studies (60%) used a machine learning approach. Studies on dengue prediction compared different prediction models, or identified significant predictors among several covariates in a model. The most significant predictors were rainfall (43%), temperature (41%), and humidity (25%). The two models with the highest performances were Neural Networks and Decision Trees (52%), followed by Support Vector Machine (17%). We cannot rule out a selection bias in our study because of our two main limitations: we did not include preprints and could not obtain the opinion of other international experts. Conclusions/SignificanceCombining real-world data and Big Data with machine learning methods is a promising approach to improve dengue prediction and monitoring. Future studies should focus on how to better integrate all available data sources and methods to improve the response and dengue management by stakeholders. Author summaryDengue is one of the most important arbovirus infections in the world and its public health, societal and economic burden is increasing. Although the majority of dengue cases are asymptomatic or mild, severe disease forms can lead to death. For this reason, early diagnosis and monitoring of dengue are crucial to decrease mortality. However, most endemic regions still rely on traditional monitoring methods, despite the growing availability of novel data sources and data-driven methods based on real-world data, Big Data, and machine learning algorithms. In this systematic review, we identified and analyzed studies that used these novel approaches for dengue monitoring and/or prediction. We found that novel data streams, such as Internet search engines and social media platforms, and machine learning methods can be successfully used to improve dengue management, but are still vastly ignored in real life. These approaches should be combined with traditional methods to help stakeholders better prepare for each outbreak and improve early responsiveness.
引用
收藏
页数:22
相关论文
共 50 条
  • [1] Big Geospatial Data and Data-Driven Methods for Urban Dengue Risk Forecasting: A Review
    Li, Zhichao
    Dong, Jinwei
    REMOTE SENSING, 2022, 14 (19)
  • [2] Ready or not? A systematic review of case studies using data-driven approaches to detect real-world antitrust violations
    Amthauer, Jan
    Fleiss, Juergen
    Guggi, Franziska
    Robertson, Viktoria H. S. E.
    COMPUTER LAW & SECURITY REVIEW, 2023, 49
  • [3] Big Data as the Big Game Changer Big Data-driven world needs Big Data-driven ideology
    Smorodin, Gennady
    Kolesnichenko, Olga
    2015 9TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2015, : 40 - 43
  • [4] REAL-WORLD DATA-DRIVEN RISK PREDICTION OF NON-DIABETIC CKD PROGRESSION
    Vaitsiakhovich, Tatsiana
    Schuchhardt, Johannes
    Bauer, Chris
    Kleinjung, Frank
    Wanner, Christoph
    AMERICAN JOURNAL OF KIDNEY DISEASES, 2022, 79 (04) : S52 - S52
  • [5] Synthesis of electric vehicle charging data: A real-world data-driven approach
    Li Z.
    Bian Z.
    Chen Z.
    Ozbay K.
    Zhong M.
    Communications in Transportation Research, 2024, 4
  • [6] The Effectiveness of Big Data-Driven Predictive Policing: Systematic Review
    Lee, Youngsub
    Bradford, Ben
    Posch, Krisztian
    JUSTICE EVALUATION JOURNAL, 2024,
  • [7] Role of real-world digital data for orthopedic implant automated surveillance: a systematic review
    Dhalluin, Thibault
    Fakhiri, Sara
    Bouzille, Guillaume
    Herbert, Julien
    Rosset, Philippe
    Cuggia, Marc
    Grammatico-Guillon, Leslie
    EXPERT REVIEW OF MEDICAL DEVICES, 2021, 18 (08) : 799 - 810
  • [8] Three-dimensional data-driven magnetostatic field computation using real-world measurement data
    Galetzka, Armin
    Loukrezis, Dimitrios
    De Gersem, Herbert
    COMPEL-THE INTERNATIONAL JOURNAL FOR COMPUTATION AND MATHEMATICS IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2022, 41 (02) : 615 - 627
  • [9] Data-driven Semi-supervised Anomaly Detection using Real-World Call Data Record
    Jaffry, Shan
    Shah, Syed Tariq
    Hasan, Syed Faraz
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE WORKSHOPS (WCNCW), 2020,
  • [10] Data Science Methods for Real-World Evidence Generation in Real-World Data
    Liu, Fang
    ANNUAL REVIEW OF BIOMEDICAL DATA SCIENCE, 2024, 7 : 201 - 224