Big-Data-Driven Machine Learning for Enhancing Spatiotemporal Air Pollution Pattern Analysis

被引:13
|
作者
Zareba, Mateusz [1 ]
Dlugosz, Hubert [1 ]
Danek, Tomasz [1 ]
Weglinska, Elzbieta [1 ]
机构
[1] AGH Univ Sci & Technol, Fac Geol Geophys & Environm Protect, Dept Geoinformat & Appl Comp Sci, PL-30059 Krakow, Poland
关键词
big data; machine learning; spatiotemporal; air pollution; pattern analysis; time series; SPATIAL ASSOCIATION;
D O I
10.3390/atmos14040760
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Air pollution is an important problem for public health. The spatiotemporal analysis is a crucial step for understanding the complex characteristics of air pollution. Using many sensors and high-resolution time-step observations makes this task a big data challenge. In this study, unsupervised machine learning algorithms were applied to analyze spatiotemporal patterns of air pollution. The analysis was conducted using PM10 big data collected from almost 100 sensors located in Krakow, over a period of one year, with data being recorded at 1-h intervals. The analysis results using K-means and SKATER clustering revealed distinct differences between average and maximum values of pollutant concentrations. The study found that the K-means algorithm with Dynamic Time Warping (DTW) was more accurate in identifying yearly patterns and clustering in rapidly and spatially varying data, compared to the SKATER algorithm. Moreover, the clustering analysis of data after kriging greatly facilitated the interpretation of the results. These findings highlight the potential of machine learning techniques and big data analysis for identifying hot-spots, coldspots, and patterns of air pollution and informing policy decisions related to urban planning, traffic management, and public health interventions.
引用
收藏
页数:25
相关论文
共 50 条
  • [1] A big-data-driven matching model based on deep reinforcement learning for cotton blending
    Xia, Huosong
    Wang, Yuan
    Jasimuddin, Sajjad
    Zhang, Justin Zuopeng
    Thomas, Andrew
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2023, 61 (22) : 7573 - 7591
  • [2] Big-Data-Driven Intelligent Wireless Network and Use Cases
    Li, Lanlan
    Tan, Xiaosi
    2021 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2021,
  • [3] Spatiotemporal Pattern of Air Turbulence Risks with QAR Flight Big Data
    Zhang L.
    Sun H.
    Wang C.
    Yu C.
    Lu B.
    Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2024, 49 (03): : 482 - 490
  • [4] Big-data-driven Model Construction and Empirical Analysis of SMEs Credit Assessment in China
    Liu Yadi
    Song Yuning
    Yu Jiayue
    Xie Yingfa
    Wang Yiyuan
    Zeng Xiaoping
    2018 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS, 2019, 147 : 613 - 619
  • [5] A Novel Big-data-driven Credit Reporting Framework for SMEs in China
    Sun, Yunchuan
    Li, Chunlei
    Cui, Xuegang
    Zeng, Xiaoping
    Chang, Xueying
    Zhang, Guangzhi
    Tu, Dengbiao
    Xiong, Yongping
    2016 INTERNATIONAL CONFERENCE ON IDENTIFICATION, INFORMATION AND KNOWLEDGE IN THE INTERNET OF THINGS (IIKI), 2016, : 463 - 469
  • [6] Big-data-driven pre-stack seismic intelligent inversion
    Yan, Xuesong
    Zhang, Mingzhao
    Wu, Qinghua
    INFORMATION SCIENCES, 2021, 549 : 34 - 52
  • [7] EVOLUTIONARY MACHINE LEARNING DRIVEN BIG DATA ANALYSIS AND PROCESSING FOR INDUSTRIAL INTERNET
    Chen, Wei
    Meng, Wei
    Zhang, Lingling
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2023, 31 (06)
  • [8] A Study of Big-Data-Driven Data Visualization and Visual Communication Design Patterns
    Zhu, Weiming
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [9] Deep learning spatiotemporal air pollution data in China using data fusion
    Zhou, Xiaolu
    Tong, Weitian
    Li, Lixin
    EARTH SCIENCE INFORMATICS, 2020, 13 (03) : 859 - 868
  • [10] Deep learning spatiotemporal air pollution data in China using data fusion
    Xiaolu Zhou
    Weitian Tong
    Lixin Li
    Earth Science Informatics, 2020, 13 : 859 - 868