Automatized spatio-temporal detection of drought impacts from newspaper articles using natural language processing and machine learning

被引:14
|
作者
Sodoge, Jan [1 ,2 ]
Kuhlicke, Christian [1 ,2 ]
de Brito, Mariana Madruga [1 ]
机构
[1] UFZ Helmholtz Ctr Environm Res, Dept Urban & Environm Sociol, D-04318 Leipzig, Germany
[2] Univ Potsdam, Inst Environm Sci & Geog, D-14476 Potsdam, Germany
来源
关键词
Germany; Drought; NLP; Text mining; Machine learning; Natural hazards; Socio-economic impacts; Longitudinal study; BIG DATA; TEXT; EVENTS; FOREST;
D O I
10.1016/j.wace.2023.100574
中图分类号
P4 [大气科学(气象学)];
学科分类号
0706 ; 070601 ;
摘要
Droughts are expected to increase both in terms of frequency and magnitude across Europe. Despite the multitude of adverse effects these disasters impose on social-ecological systems, most impact assessments are constrained to single event and/or single sector analyses. Furthermore, existing longitudinal multi-sectoral datasets are limited in spatiotemporal homogeneity and scope, resulting in fragmented datasets. To address this gap, we propose a novel method for the automatized detection of drought impacts based on newspaper articles. We employ natural language processing (NLP) and machine learning to identify different socio-economic impacts (e.g. agriculture, forestry, livestock, fires) and their geographic and temporal scope from 40,000 newspaper articles reporting about droughts in Germany between 2000 and 2021. Our method is able to track impacts over long time periods, allowing us to assess how drought impacts evolve. Accuracy levels of 92-96% per impact class were obtained for the automatic classification of the impacts when evaluated on a human-annotated dataset. Furthermore, our resulting impact dataset can replicate both temporal and spatial trends when validated against independent impact and hazard data. Overall, the proposed approach advances current research as it (1) requires a significantly lower workload than conventional impact assessment methods, (2) allows addressing large text datasets, (3) reduces subjectivity and human bias, (4) is generalizable to other hazard types as well as text corpora, and (5) achieves sufficient levels of accuracy. The findings highlight the applicability of NLP and machine learning to create comprehensive longitudinal impact datasets.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] Prediction of Landslides Using Machine Learning Techniques Based on Spatio-Temporal Factors and InSAR Data
    Lin Y.-T.
    Yen H.-Y.
    Chang N.-H.
    Lin H.-M.
    Han J.-Y.
    Yang K.-H.
    Chen C.-S.
    Zheng H.-K.
    Hsu J.-Y.
    Journal of the Chinese Institute of Civil and Hydraulic Engineering, 2021, 33 (02): : 93 - 104
  • [42] Assessment of Spatio-temporal Climatological trends of ozone over the Indian region using Machine Learning
    Pathakoti, Mahesh
    Santhoshi, T.
    Aarathi, M.
    Mahalakshmi, D. V.
    Kanchana, A. L.
    Srinivasulu, J.
    Shekhar, Raja S. S.
    Soni, Vijay Kumar
    Sai, Sesha M. V. R.
    Raja, P.
    SPATIAL STATISTICS, 2021, 43
  • [43] Spatio-temporal dynamics of urban sprawl in a rapidly urbanizing city using machine learning classification
    Krishnaveni, K. S.
    Kumar, P. P. Anil
    GEOCARTO INTERNATIONAL, 2022, 37 (27) : 17403 - 17434
  • [44] Predicting the emplacement of Cordilleran porphyry copper systems using a spatio-temporal machine learning model
    Diaz-Rodriguez, Julian
    Muller, R. Dietmar
    Chandra, Rohitash
    ORE GEOLOGY REVIEWS, 2021, 137
  • [45] Sound Source Separation Using Spatio-temporal Sound Pressure Distribution Images and Machine Learning
    Ozawa, Kenji
    Shiozawa, Koichiro
    Ise, Tomohiko
    PROCEEDINGS 2019 AMITY INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AICAI), 2019, : 54 - 60
  • [46] Analysis of Spatio-Temporal Characteristics of Drought in Serbia From 1961 to 2020 Using SPI and SPEI
    Filipovic, Lazar
    Putnikovic, Suzana
    Stosic, Borko
    Stosic, Tatijana
    Djurdjevic, Vladimir
    Tosic, Ivana
    INTERNATIONAL JOURNAL OF CLIMATOLOGY, 2025,
  • [47] Modelling the seasonal dynamics of Aedesalbopictus populations using a spatio-temporal stacked machine learning model
    Da Re, Daniele
    Marini, Giovanni
    Bonannella, Carmelo
    Laurini, Fabrizio
    Manica, Mattia
    Anicic, Nikoleta
    Albieri, Alessandro
    Angelini, Paola
    Arnoldi, Daniele
    Bertola, Federica
    Caputo, Beniamino
    De Liberato, Claudio
    della Torre, Alessandra
    Flacio, Eleonora
    Franceschini, Alessandra
    Gradoni, Francesco
    Kadriaj, Perparim
    Lencioni, Valeria
    Del Lesto, Irene
    Russa, Francesco La
    Lia, Riccardo Paolo
    Montarsi, Fabrizio
    Otranto, Domenico
    L'Ambert, Gregory
    Rizzoli, Annapaola
    Rombola, Pasquale
    Romiti, Federico
    Stancher, Gionata
    Torina, Alessandra
    Velo, Enkelejda
    Virgillito, Chiara
    Zandonai, Fabiana
    Rosa, Roberto
    SCIENTIFIC REPORTS, 2025, 15 (01):
  • [48] Machine Learning based Biospeckle Technique for Identification of Seed Viability using Spatio-temporal Analysis
    Thakur, Puneet Singh
    Kumar, Abhishek
    Tiwari, Bhavya
    Gedam, Bhavesh
    Bhatia, Vimal
    Rana, Santosh
    Prakash, Shashi
    2022 WORKSHOP ON RECENT ADVANCES IN PHOTONICS (WRAP), 2022, : 118 - 119
  • [49] Predicting Purchase Decisions Based on Spatio-Temporal Functional MRI Features Using Machine Learning
    Wang, Yunzhi
    Chattaraman, Veena
    Kim, Hyejeong
    Deshpande, Gopikrishna
    IEEE TRANSACTIONS ON AUTONOMOUS MENTAL DEVELOPMENT, 2015, 7 (03) : 248 - 255
  • [50] Study on spatio-temporal simulation and prediction of regional deep soil moisture using machine learning
    A, Yinglan
    Jiang, Xiaoman
    Wang, Yuntao
    Wang, Libo
    Zhang, Zihao
    Duan, Limin
    Fang, Qingqing
    JOURNAL OF CONTAMINANT HYDROLOGY, 2023, 258