Forecasting tuberculosis using diabetes-related google trends data

被引:9
|
作者
Frauenfeld, Leonie [1 ]
Nann, Dominik [1 ]
Sulyok, Zita [2 ]
Feng, You-Shan [3 ]
Sulyok, Mihaly [1 ]
机构
[1] Eberhard Karls Univ Tubingen, Univ Hosp Tubingen, Inst Pathol & Neuropathol, Liebermeisterstr 8, D-72076 Tubingen, Germany
[2] Eberhard Karls Univ Tubingen, Univ Hosp Tubingen, Inst Trop Med, D-72074 Tubingen, Germany
[3] Univ Hosp Tubingen, Dept Clin Epidemiol & Appl Biometry, D-72076 Tubingen, Germany
关键词
Tuberculosis; Diabetes; Google Trends; Surveillance; Forecasting;
D O I
10.1080/20477724.2020.1767854
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Online activity-based data can be used to aid infectious disease forecasting. Our aim was to exploit the converging nature of the tuberculosis (TB) and diabetes epidemics to forecast TB case numbers. Thus, we extended TB prediction models based on traditional data with diabetes-related Google searches. We obtained data on the weekly case numbers of TB in Germany from June 8(th), 2014, to May 5(th), 2019. Internet search data were obtained from a Google Trends (GTD) search for 'diabetes' to the corresponding interval. A seasonal autoregressive moving average (SARIMA) model (0,1,1) (1,0,0) [52] was selected to describe the weekly TB case numbers with and without GTD as an external regressor. We cross-validated the SARIMA models to obtain the root mean squared errors (RMSE). We repeated this procedure with autoregressive feed-forward neural network (NNAR) models using 5-fold cross-validation. To simulate a data-poor surveillance setting, we also tested traditional and GTD-extended models against a hold-out dataset using a decreased 52-week-long period with missing values for training. Cross-validation resulted in an RMSE of 20.83 for the traditional model and 18.56 for the GTD-extended model. Cross-validation of the NNAR models showed a mean RMSE of 19.49 for the traditional model and 18.99 for the GTD-extended model. When we tested the models trained on a decreased dataset with missing values, the GTD-extended models achieved significantly better prediction than the traditional models (p < 0.001). The GTD-extended models outperformed the traditional models in all assessed model evaluation parameters. Using online activity-based data regarding diabetes can improve TB forecasting, but further validation is warranted.
引用
收藏
页码:236 / 241
页数:6
相关论文
共 50 条
  • [31] Short-term forecasting of Japanese tourist inflow to South Korea using Google trends data
    Park, Sangkon
    Lee, Jungmin
    Song, Wonho
    [J]. JOURNAL OF TRAVEL & TOURISM MARKETING, 2017, 34 (03) : 357 - 368
  • [32] Predicting the Number of GHB-Related Toxicologic Admissions Using Google Trends Data
    Kapitany-Foveny, Mate
    Ferenci, Tamas
    Demetrovics, Zsolt
    Sulyok, Mihaly
    [J]. INTERNATIONAL JOURNAL OF MENTAL HEALTH AND ADDICTION, 2023, 21 (03) : 2006 - 2016
  • [33] Predicting the Number of GHB-Related Toxicologic Admissions Using Google Trends Data
    Máté Kapitány-Fövény
    Tamás Ferenci
    Zsolt Demetrovics
    Mihály Sulyok
    [J]. International Journal of Mental Health and Addiction, 2023, 21 : 2006 - 2016
  • [34] Recent trends in diabetes-related consumer health information technology research
    Claiborne, John Preston
    Wellbeloved-Stone, Claire
    Valdez, Rupa S.
    [J]. 2019 41ST ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2019, : 4283 - 4288
  • [35] Trends by gender and predictors of diabetes-related amputations: an Italian regional analysis
    Scampoli, Piera
    Di Giovanni, P.
    Meo, F.
    Cedrone, F.
    Di Martino, G.
    Staniscia, T.
    [J]. EUROPEAN JOURNAL OF PUBLIC HEALTH, 2019, 29
  • [36] Trends in Diabetes-Related Preventable Hospitalizations in the US, 2005-2014
    Rubens, Muni
    Saxena, Anshul
    Ramamoorthy, Venkataraghavan
    Khera, Rohan
    Hong, Jonathan
    Veledar, Emir
    Nasir, Khurram
    [J]. DIABETES CARE, 2018, 41 (05) : E72 - E73
  • [37] Trends in the Costs of Diabetes-Related Preventable Hospitalizations, 2002-2013
    Shrestha, Sundar S.
    Zhang, Ping
    [J]. DIABETES, 2017, 66 : A74 - A74
  • [38] Trends in Racial and Ethnic Disparities in Diabetes-Related Complications, 1997–2017
    Tommy Chiou
    Yusuke Tsugawa
    Dana Goldman
    Rebecca Myerson
    Matthew Kahn
    John A. Romley
    [J]. Journal of General Internal Medicine, 2020, 35 : 950 - 951
  • [39] Forecasting National and Regional Youth Unemployment in Spain Using Google Trends
    Mihaela Simionescu
    Javier Cifuentes-Faura
    [J]. Social Indicators Research, 2022, 164 : 1187 - 1216
  • [40] Forecasting the Diffusion of Product and Technology Innovations: Using Google Trends as an Example
    Duwe, Daniel
    Herrmann, Florian
    Spath, Dieter
    [J]. 2018 PORTLAND INTERNATIONAL CONFERENCE ON MANAGEMENT OF ENGINEERING AND TECHNOLOGY (PICMET '18): MANAGING TECHNOLOGICAL ENTREPRENEURSHIP: THE ENGINE FOR ECONOMIC GROWTH, 2018,