Respiratory Diseases, Malaria and Leishmaniasis: Temporal and Spatial Association with Fire Occurrences from Knowledge Discovery and Data Mining

被引:5
|
作者
Schroeder, Lucas [1 ]
Veronez, Mauricio Roberto [1 ]
de Souza, Eniuce Menezes [2 ]
Brum, Diego [1 ]
Gonzaga, Luiz, Jr. [1 ]
Rofatto, Vinicius Francisco [3 ]
机构
[1] Vale Rio Sinos Univ, X Real & Geoinformat Lab, BR-93022750 Sao Leopoldo, Brazil
[2] Univ Estadual Maringa, Dept Stat, BR-87020900 Maringa, Parana, Brazil
[3] Univ Fed Uberlandia, Dept Geog, BR-38408100 Uberlandia, MG, Brazil
关键词
health; fire; big data; Data Mining; Knowledge Discovery from Databases; machine learning; PARTICULATE MATTER; HEALTH IMPACTS; PUBLIC-HEALTH; DEFORESTATION; FOREST; AGREEMENT; AREAS;
D O I
10.3390/ijerph17103718
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
The relationship between the fires occurrences and diseases is an essential issue for making public health policy and environment protecting strategy. Thanks to the Internet, today, we have a huge amount of health data and fire occurrence reports at our disposal. The challenge, therefore, is how to deal with 4 Vs (volume, variety, velocity and veracity) associated with these data. To overcome this problem, in this paper, we propose a method that combines techniques based on Data Mining and Knowledge Discovery from Databases (KDD) to discover spatial and temporal association between diseases and the fire occurrences. Here, the case study was addressed to Malaria, Leishmaniasis and respiratory diseases in Brazil. Instead of losing a lot of time verifying the consistency of the database, the proposed method uses Decision Tree, a machine learning-based supervised classification, to perform a fast management and extract only relevant and strategic information, with the knowledge of how reliable the database is. Namely, States, Biomes and period of the year (months) with the highest rate of fires could be identified with great success rates and in few seconds. Then, the K-means, an unsupervised learning algorithms that solves the well-known clustering problem, is employed to identify the groups of cities where the fire occurrences is more expressive. Finally, the steps associated with KDD is perfomed to extract useful information from mined data. In that case, Spearman's rank correlation coefficient, a nonparametric measure of rank correlation, is computed to infer the statistical dependence between fire occurrences and those diseases. Moreover, maps are also generated to represent the distribution of the mined data. From the results, it was possible to identify that each region showed a susceptible behaviour to some disease as well as some degree of correlation with fire outbreak, mainly in the drought period.
引用
收藏
页数:23
相关论文
共 50 条
  • [1] Theories and Applications of Spatial-Temporal Data Mining and Knowledge Discovery
    Leung, Yee
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2011, PT IV, 2011, 6785 : XLIII - XLIV
  • [2] Theories and Applications of Spatial-Temporal Data Mining and Knowledge Discovery
    Leung, Yee
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2011, PT I, 2011, 6782
  • [3] Knowledge Discovery from Qualitative Spatial and Temporal Data
    Boukontar, Abderrahmane
    Condotta, Jean-Francois
    Salhi, Yakoub
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 451 - 458
  • [4] Knowledge Discovery from Academic Data using Association Rule Mining
    Ahmed, Shibbir
    Paul, Rajshakhar
    Hoque, Abu Sayed Md Latiful
    2014 17TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY (ICCIT), 2014, : 314 - 319
  • [5] Knowledge Discovery from Data Mining
    Lan, Tian
    EBM 2010: INTERNATIONAL CONFERENCE ON ENGINEERING AND BUSINESS MANAGEMENT, VOLS 1-8, 2010, : 4642 - 4645
  • [6] On spatial knowledge discovery and data mining in grid environment
    Research Center of Spatial Information and Digital Engineering, Wuhan University, 129 Luoyu Road, Wuhan 430079, China
    Wuhan Daxue Xuebao (Xinxi Kexue Ban), 2006, 12 (1105-1107):
  • [7] Research on spatial data mining based on knowledge discovery
    Zhong Qu
    Lian Wang
    COMPUTATIONAL INTELLIGENCE, PT 2, PROCEEDINGS, 2006, 4114 : 946 - 951
  • [8] Theories and technologies of spatial data mining and knowledge discovery
    Li, Deren
    Wang, Shuliang
    Li, Deyi
    Wang, Xinzhou
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2002, 27 (03):
  • [9] Fundamentals of association rules in data mining and knowledge discovery
    Zhang, Shichao
    Wu, Xindong
    WILEY INTERDISCIPLINARY REVIEWS-DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 1 (02) : 97 - 116
  • [10] Knowledge Discovery by Mining Association Rules and Temporal-Spatial Information from Large-Scale Geospatial Image Databases
    Shyu, Chi-Ren
    Klaric, Matt
    Scott, Grant
    Mahamaneerat, Wannapa Kay
    2006 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, VOLS 1-8, 2006, : 17 - 20