Prediction of spatio-temporal AQI data

被引:0
|
作者
Kim, Kyeong Eun [1 ]
Ma, Mi Ru [2 ]
Lee, Kyeong Won [1 ,3 ]
机构
[1] Seoul Natl Univ, Dept Stat, Seoul, South Korea
[2] Sungkyunkwan Univ, Dept Stat, Seoul, South Korea
[3] Seoul Natl Univ, Dept Stat, 1 Gwanak Ro, Seoul 08826, South Korea
基金
新加坡国家研究基金会;
关键词
AQI data; descriptive model; neural networks model; random effect; random forest;
D O I
10.29220/CSAM.2023.30.2.119
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
With the rapid growth of the economy and fossil fuel consumption, the concentration of air pollutants has increased significantly and the air pollution problem is no longer limited to small areas. We conduct statistical analysis with the actual data related to air quality that covers the entire of South Korea using R and Python. Some factors such as SO2, CO, O3, NO2, PM10, precipitation, wind speed, wind direction, vapor pressure, local pressure, sea level pressure, temperature, humidity, and others are used as covariates. The main goal of this paper is to predict air quality index (AQI) spatio-temporal data. The observations of spatio-temporal big datasets like AQI data are correlated both spatially and temporally, and computation of the prediction or forecasting with dependence structure is often infeasible. As such, the likelihood function based on the spatio-temporal model may be complicated and some special modelings are useful for statistically reliable predictions. In this paper, we propose several methods for this big spatio-temporal AQI data. First, random effects with spatio-temporal basis functions model, a classical statistical analysis, is proposed. Next, neural networks model, a deep learning method based on artificial neural networks, is applied. Finally, random forest model, a machine learning method that is closer to computational science, will be introduced. Then we compare the forecasting performance of each other in terms of predictive diagnostics. As a result of the analysis, all three methods predicted the normal level of PM2.5 well, but the performance seems to be poor at the extreme value.
引用
收藏
页码:119 / 133
页数:16
相关论文
共 50 条
  • [1] Spatio-Temporal Prediction of Suspect Location by Spatio-Temporal Semantics
    Duan, Lian
    Hu, Tao
    Zhu, Xinyan
    Ye, Xinyue
    Wang, Shaohua
    [J]. Wuhan Daxue Xuebao (Xinxi Kexue Ban)/Geomatics and Information Science of Wuhan University, 2019, 44 (05): : 765 - 770
  • [2] Spatio-Temporal Data Mining for Aviation Delay Prediction
    Zhang, Kai
    Jiang, Yushan
    Liu, Dahai
    Song, Houbing
    [J]. 2020 IEEE 39TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2020,
  • [3] A Spatio-Temporal Linked Data Representation for Modeling Spatio-Temporal Dialect Data
    Scholz, Johannes
    Hrastnig, Emanual
    Wandl-Vogt, Eveline
    [J]. PROCEEDINGS OF WORKSHOPS AND POSTERS AT THE 13TH INTERNATIONAL CONFERENCE ON SPATIAL INFORMATION THEORY (COSIT 2017), 2018, : 275 - 282
  • [4] PREDICTION AND IMPUTATION OF SPATIO-TEMPORAL DATA: DENGUE SURVEILLANCE IN THAILAND
    Lessler, J.
    Reich, N. G.
    Iamsirithaworn, S.
    Cummings, D. A. T.
    [J]. AMERICAN JOURNAL OF EPIDEMIOLOGY, 2011, 173 : S183 - S183
  • [5] DNN-Based Prediction Model for Spatio-Temporal Data
    Zhang, Junbo
    Zheng, Yu
    Qi, Dekang
    Li, Ruiyuan
    Yi, Xiuwen
    [J]. 24TH ACM SIGSPATIAL INTERNATIONAL CONFERENCE ON ADVANCES IN GEOGRAPHIC INFORMATION SYSTEMS (ACM SIGSPATIAL GIS 2016), 2016,
  • [6] Approach of the spatio-temporal prediction using vectorial geographic data
    MezzadriCenteno, T
    SaintJoan, D
    Desachy, J
    Vidal, F
    [J]. REMOTE SENSING FOR GEOGRAPHY, GEOLOGY, LAND PLANNING, AND CULTURAL HERITAGE, 1996, 2960 : 96 - 103
  • [7] ASTIR: Spatio-Temporal Data Mining for Crowd Flow Prediction
    Mourad, Lablack
    Qi, Heng
    Shen, Yanming
    Yin, Baocai
    [J]. IEEE ACCESS, 2019, 7 : 175159 - 175165
  • [8] Spatio-Temporal Ensemble Prediction on Mobile Broadband Network Data
    Samulevicius, Saulius
    Pitarch, Yoann
    Pedersen, Torben Bach
    Sorensen, Troels Bundgaard
    [J]. 2013 IEEE 77TH VEHICULAR TECHNOLOGY CONFERENCE (VTC SPRING), 2013,
  • [9] Mining spatio-temporal data
    Gennady Andrienko
    Donato Malerba
    Michael May
    Maguelonne Teisseire
    [J]. Journal of Intelligent Information Systems, 2006, 27 : 187 - 190
  • [10] Statistics for Spatio-Temporal Data
    Mills, Jeff
    [J]. JOURNAL OF REGIONAL SCIENCE, 2012, 52 (03) : 512 - 513