Forecasting tuberculosis using diabetes-related google trends data

被引:9
|
作者
Frauenfeld, Leonie [1 ]
Nann, Dominik [1 ]
Sulyok, Zita [2 ]
Feng, You-Shan [3 ]
Sulyok, Mihaly [1 ]
机构
[1] Eberhard Karls Univ Tubingen, Univ Hosp Tubingen, Inst Pathol & Neuropathol, Liebermeisterstr 8, D-72076 Tubingen, Germany
[2] Eberhard Karls Univ Tubingen, Univ Hosp Tubingen, Inst Trop Med, D-72074 Tubingen, Germany
[3] Univ Hosp Tubingen, Dept Clin Epidemiol & Appl Biometry, D-72076 Tubingen, Germany
关键词
Tuberculosis; Diabetes; Google Trends; Surveillance; Forecasting;
D O I
10.1080/20477724.2020.1767854
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Online activity-based data can be used to aid infectious disease forecasting. Our aim was to exploit the converging nature of the tuberculosis (TB) and diabetes epidemics to forecast TB case numbers. Thus, we extended TB prediction models based on traditional data with diabetes-related Google searches. We obtained data on the weekly case numbers of TB in Germany from June 8(th), 2014, to May 5(th), 2019. Internet search data were obtained from a Google Trends (GTD) search for 'diabetes' to the corresponding interval. A seasonal autoregressive moving average (SARIMA) model (0,1,1) (1,0,0) [52] was selected to describe the weekly TB case numbers with and without GTD as an external regressor. We cross-validated the SARIMA models to obtain the root mean squared errors (RMSE). We repeated this procedure with autoregressive feed-forward neural network (NNAR) models using 5-fold cross-validation. To simulate a data-poor surveillance setting, we also tested traditional and GTD-extended models against a hold-out dataset using a decreased 52-week-long period with missing values for training. Cross-validation resulted in an RMSE of 20.83 for the traditional model and 18.56 for the GTD-extended model. Cross-validation of the NNAR models showed a mean RMSE of 19.49 for the traditional model and 18.99 for the GTD-extended model. When we tested the models trained on a decreased dataset with missing values, the GTD-extended models achieved significantly better prediction than the traditional models (p < 0.001). The GTD-extended models outperformed the traditional models in all assessed model evaluation parameters. Using online activity-based data regarding diabetes can improve TB forecasting, but further validation is warranted.
引用
收藏
页码:236 / 241
页数:6
相关论文
共 50 条
  • [21] Using GIS and Secondary Data to Target Diabetes-Related Public Health Efforts
    Curtis, Amy B.
    Kothari, Catherine
    Paul, Rajib
    Connors, Elyse
    [J]. PUBLIC HEALTH REPORTS, 2013, 128 (03) : 212 - 220
  • [22] In Search of a Job: Forecasting Employment Growth Using Google Trends
    Borup, Daniel
    Schutte, Erik Christian Montes
    [J]. JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2022, 40 (01) : 186 - 200
  • [23] Forecasting railway ticket dynamic price with Google Trends open data
    Stavinova, Elizaveta
    Chunaev, Petr
    Bochenina, Klavdiya
    [J]. 10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021), 2021, 193 : 333 - 342
  • [24] Can Google Trends data improve forecasting of Lyme disease incidence?
    Kapitany-Foveny, Mate
    Ferenci, Tamas
    Sulyok, Zita
    Kegele, Josua
    Richter, Hardy
    Valyi-Nagy, Istvan
    Sulyok, Mihaly
    [J]. ZOONOSES AND PUBLIC HEALTH, 2019, 66 (01) : 101 - 107
  • [25] Forecasting building permits with Google Trends
    David Coble
    Pablo Pincheira
    [J]. Empirical Economics, 2021, 61 : 3315 - 3345
  • [26] Influenza Forecasting with Google Flu Trends
    Dugas, Andrea Freyer
    Jalalpour, Mehdi
    Gel, Yulia
    Levin, Scott
    Torcaso, Fred
    Igusa, Takeru
    Rothman, Richard E.
    [J]. PLOS ONE, 2013, 8 (02):
  • [27] Forecasting building permits with Google Trends
    Coble, David
    Pincheira, Pablo
    [J]. EMPIRICAL ECONOMICS, 2021, 61 (06) : 3315 - 3345
  • [28] Diabetes-related tuberculosis in the Middle East: an urgent need for regional research
    Alkabab, Yosra M.
    Al-Abdely, Hail M.
    Heysell, Scott K.
    [J]. INTERNATIONAL JOURNAL OF INFECTIOUS DISEASES, 2015, 40 : 64 - 70
  • [29] Diabetes-Related Dementia
    Hanyu, Haruo
    [J]. DIABETES MELLITUS: A RISK FACTOR FOR ALZHEIMER'S DISEASE, 2019, 1128 : 147 - 160
  • [30] Trends in diabetes-related visits to US EDs from 1997 to 2007
    Menchine, Michael D.
    Wiechmann, Warren
    Peters, Anne L.
    Arora, Sanjay
    [J]. AMERICAN JOURNAL OF EMERGENCY MEDICINE, 2012, 30 (05): : 754 - 758