Machine Learning Approach-based Big Data Imputation Methods for Outdoor Air Quality forecasting

被引:1
|
作者
Narasimhan, D. [1 ]
Vanitha, M. [2 ]
机构
[1] SASTRA Deemed Univ, Dept Math, Kumbakonam 612001, Tamil Nadu, India
[2] SASTRA Deemed Univ, Srinivasa Ramanujan Ctr, Dept Comp Sci & Engn, Kumbakonam 612001, Tamil Nadu, India
来源
关键词
Air quality; Big data analytics; Classification; Ensemble; Multiple imputation;
D O I
10.56042/jsir.v82i03.71764
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Missing data from ambient air databases is a typical issue, but it is much worse in small towns or cities. Missing data is a significant concern for environmental epidemiology. These settings have high pollution exposure levels worldwide, and dataset gaps obstruct health investigations that could later affect local and international policies. When a substantial number of observations contain missing values, the standard errors increase due to the smaller sample size, which may significantly affect the final result. Generally, the performance of various missing value imputation algorithms is proportional to the size of the database and the percentage of missing values within it. This paper proposes and demonstrates an ensemble - imputation - classification framework approach to rebuild air quality information using a dataset from Beijing, China, to forecast air quality. Various single and multiple imputation procedures are utilized to fill the missing records. Then ensemble of diverse classifiers is used on the imputed data to find the air pollution level. The recommended model aims to reduce the error rate and improve accuracy. Extensive testing of datasets with actual missing values has revealed that the suggested methodology significantly enhances the air quality forecasting model's accuracy with multiple imputation and ensemble techniques when compared to other conventional single imputation techniques.
引用
收藏
页码:338 / 347
页数:10
相关论文
共 50 条
  • [31] Survey of Machine Learning Methods for Big Data Applications
    Vinothini, A.
    Priya, S. Baghavathi
    2017 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE IN DATA SCIENCE (ICCIDS), 2017,
  • [32] A Research on Machine Learning Methods for Big Data Processing
    Qiu, Junfei
    Sun, Youming
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND MANAGEMENT INNOVATION, 2015, 28 : 920 - 928
  • [33] Air Quality Class Prediction Using Machine Learning Methods Based on Monitoring Data and Secondary Modeling
    Liu, Qian
    Cui, Bingyan
    Liu, Zhen
    ATMOSPHERE, 2024, 15 (05)
  • [34] Missing data imputation using machine learning based methods to improve HCC survival prediction
    Yumus, Mehmethan
    Apaydin, Merve
    Degirmenci, Ali
    Karal, Omer
    2020 28TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2020,
  • [35] A Novel Missing Data Imputation Approach for Time Series Air Quality Data Based on Logistic Regression
    Chen, Mei
    Zhu, Hongyu
    Chen, Yongxu
    Wang, Youshuai
    ATMOSPHERE, 2022, 13 (07)
  • [36] Big Data Analysis Methods Based on Machine Learning to Ensure Information Security
    Olga, Veselska
    Ruslana, Ziubina
    Yuriy, Finenko
    Joanna, Nikodem
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS (KSE 2021), 2021, 192 : 2633 - 2640
  • [37] Machine Learning Approaches for Outdoor Air Quality Modelling: A Systematic Review
    Rybarczyk, Yves
    Zalakeviciute, Rasa
    APPLIED SCIENCES-BASEL, 2018, 8 (12):
  • [38] Deep learning models for air quality forecasting based on spatiotemporal characteristics of data
    Rehman, Khawar
    Abid, Irfan
    Hong, Seung Ho
    PHYSICS OF FLUIDS, 2024, 36 (05)
  • [39] Machine Learning Based Hybrid System for Imputation and Efficient Energy Demand Forecasting
    Khan, Prince Waqas
    Byun, Yung-Cheol
    Lee, Sang-Joon
    Park, Namje
    ENERGIES, 2020, 13 (11)
  • [40] Hot metal quality monitoring system based on big data and machine learning
    Liu, Ran
    Zhang, Zhi-feng
    Li, Xin
    Liu, Xiao-jie
    Li, Hong-yang
    Bu, Xiang-ping
    Zhao, Jun
    Lyu, Qing
    JOURNAL OF IRON AND STEEL RESEARCH INTERNATIONAL, 2023, 30 (05) : 915 - 925