Big-But-Biased Data Analytics for Air Quality

被引:4
|
作者
Borrajo, Laura [1 ,3 ]
Cao, Ricardo [1 ,2 ,3 ]
机构
[1] Univ A Coruna, Dept Math, CITIC, Res Grp MODES, La Coruna 15071, Spain
[2] Univ A Coruna, ITMATI, La Coruna 15071, Spain
[3] Univ A Coruna, Fac Comp Sci, Campus Elvina, La Coruna 15071, Spain
关键词
air quality; automatic bandwidth selection; big data; bootstrap; kernel density estimation; large sample size; sampling bias; smart city; DENSITY-FUNCTION;
D O I
10.3390/electronics9091551
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Air pollution is one of the big concerns for smart cities. The problem of applying big data analytics to sampling bias in the context of urban air quality is studied in this paper. A nonparametric estimator that incorporates kernel density estimation is used. When ignoring the biasing weight function, a small-sized simple random sample of the real population is assumed to be additionally observed. The general parameter considered is the mean of a transformation of the random variable of interest. A new bootstrap algorithm is used to approximate the mean squared error of the new estimator. Its minimization leads to an automatic bandwidth selector. The method is applied to a real data set concerning the levels of different pollutants in the urban air of the city of A Coruna (Galicia, NW Spain). Estimations for the mean and the cumulative distribution function of the level of ozone and nitrogen dioxide when the temperature is greater than or equal to 30 circle C based on 15 years of biased data are obtained.
引用
收藏
页码:1 / 11
页数:11
相关论文
共 50 条
  • [1] Nonparametric estimation for big-but-biased data
    Borrajo, Laura
    Cao, Ricardo
    [J]. TEST, 2021, 30 (04) : 861 - 883
  • [2] Nonparametric estimation for big-but-biased data
    Laura Borrajo
    Ricardo Cao
    [J]. TEST, 2021, 30 : 861 - 883
  • [3] Air Quality Through IoT and Big Data Analytics
    Devi, M. Sree
    Rahamathulla, Vempalli
    [J]. ADVANCES IN DATA SCIENCE AND MANAGEMENT, 2020, 37 : 181 - 187
  • [4] Quality Issues with Big data Analytics
    Sangeeta
    Sharma, Kapil
    [J]. PROCEEDINGS OF THE 10TH INDIACOM - 2016 3RD INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT, 2016, : 3589 - 3591
  • [5] BIG DATA ANALYTICS FOR AIR QUALITY MONITORING ASSESSMENT BASED ON IoT PLATFORM
    Ivanova, Desislava
    Elenkov, Angel
    [J]. INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2019, 11 (02): : 43 - 50
  • [6] Quality Analytics in a Big Data Supply Chain Commodity Data Analytics for Quality Engineering
    Tan, Julian S. K.
    Ang, Ai Kiar
    Lu, Liu
    Gan, Sheena W. Q.
    Corral, Marilyn G.
    [J]. PROCEEDINGS OF THE 2016 IEEE REGION 10 CONFERENCE (TENCON), 2016, : 3455 - 3463
  • [7] Data Quality Alerting Model for Big Data Analytics
    Gyulgyulyan, Eliza
    Aligon, Julien
    Ravat, Franck
    Astsatryan, Hrachya
    [J]. NEW TRENDS IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2019, 2019, 1064 : 489 - 500
  • [8] Traditional marketing analytics, big data analytics and big data system quality and the success of new product development
    Aljumah, Ahmad Ibrahim
    Nuseir, Mohammed T.
    Alam, Md Mahmudul
    [J]. BUSINESS PROCESS MANAGEMENT JOURNAL, 2021, 27 (04) : 1108 - 1125
  • [9] Traditional marketing analytics, big data analytics and big data system quality and the success of new product development
    Aljumah, A., I
    Nuseir, M. T.
    Alam, M. M.
    [J]. BUSINESS PROCESS MANAGEMENT JOURNAL, 2024,
  • [10] Quality of Information for Quality of Life: Healthcare Big Data Analytics
    Dantanarayana, G. G. T.
    Sahama, Tony
    Wikramanayake, G. N.
    [J]. 2015 Fifteenth International Conference on Advances in ICT for Emerging Regions (ICTer), 2015, : 281 - 281