Federated conditional generative adversarial nets imputation method for air quality missing data

被引:24
|
作者
Zhou, Xu [1 ,3 ]
Liu, Xiaofeng [2 ,3 ]
Lan, Gongjin [4 ,5 ]
Wu, Jian [6 ]
机构
[1] Hohai Univ, Coll Comp & Informat, Hohai, Peoples R China
[2] Hohai Univ, Coll Internet Things IOT Engn, Hohai, Peoples R China
[3] Jiangsu Key Lab Special Robot Technol, Changzhou, Jiangsu, Peoples R China
[4] Southern Univ Sci & Technol, Dept Comp Sci & Engn, Shenzhen, Guangdong, Peoples R China
[5] Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
[6] Jiangsu Acad Environm Ind & Technol Corp JSAEIT, Nanjing, Jiangsu, Peoples R China
关键词
Air pollutants; Conditional GAN imputation; Federated learning; Privacy-preserving machine learning; TIME-SERIES; VALUES; NETWORKS;
D O I
10.1016/j.knosys.2021.107261
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The air quality is a topic of extreme concern that attracts a lot of attention in the world. Many intel-ligent air quality monitoring networks have been deployed in various places, especially in big cities. These monitoring networks collect air quality data with some missing data for some reasons which pose an obstacle for air quality publishing and studies. Generative adversarial nets (GAN) methods have achieved state-of-the-art performance in missing data imputation. GAN-based imputation method needs enough training data while one monitoring network has just a few and poor quality monitoring data and these data sets do not meet the independent identical distribution (IID) condition. Therefore, one monitoring network side needs to utilize more monitoring data from other sides as far as possible. However, in the real world, these air quality monitoring networks are owned by different organizations - companies, the government even some secret units. Many of them cannot share detailed monitoring data due to security, privacy, and industrial competition. In this paper, it is the first time to propose a conditional GAN imputation method under a federated learning framework to solve the data sets that come from diverse data-owners without sharing. Furthermore, we improve the vanilla conditional GAN performance with Wasserstein distance and "Hint mask"trick. The experimental results show that our GAN-based imputation methods can achieve the best performance. And our federated GAN imputation method outperforms the GAN imputation method trained locally for each participant which means our imputation model can work. Our proposed federated GAN method can benefit model quality by increasing access to air quality data through private multi-institutional collaborations. We further investigate the effects of data geographical distribution across collaborating participants on model quality and, interestingly, we find that the GAN training process with a federated learning framework performs more stable. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] GAIN: Missing Data Imputation using Generative Adversarial Nets
    Yoon, Jinsung
    Jordon, James
    van der Schaar, Mihaela
    [J]. INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [2] Missing Slice Imputation in Population CMR Imaging via Conditional Generative Adversarial Nets
    Zhang, Le
    Pereanez, Marco
    Bowles, Christopher
    Piechnik, Stefan
    Neubauer, Stefan
    Petersen, Steffen
    Frangi, Alejandro
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 651 - 659
  • [3] LFM-D2GAIN: An Improved Missing Data Imputation Method Based on Generative Adversarial Imputation Nets
    Shen, Yebai
    Zhang, Chao
    Zhang, Songyu
    Yan, Jinghua
    Bu, Fanliang
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, BIG DATA AND ALGORITHMS (EEBDA), 2022, : 447 - 453
  • [4] Imputation of missing data with class imbalance using conditional generative adversarial networks
    Awan, Saqib Ejaz
    Bennamoun, Mohammed
    Sohel, Ferdous
    Sanfilippo, Frank
    Dwivedi, Girish
    [J]. NEUROCOMPUTING, 2021, 453 : 164 - 171
  • [5] Recovering from missing data in population imaging - Cardiac MR image imputation via conditional generative adversarial nets
    Xia, Yan
    Zhang, Le
    Ravikumar, Nishant
    Attar, Rahman
    Piechnik, Stefan K.
    Neubauer, Stefan
    Petersen, Steffen E.
    Frangi, Alejandro F.
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 67
  • [6] Missing Data Imputation Method Combining Random Forest and Generative Adversarial Imputation Network
    Ou, Hongsen
    Yao, Yunan
    He, Yi
    [J]. SENSORS, 2024, 24 (04)
  • [7] Detracking Autoencoding Conditional Generative Adversarial Network: Improved Generative Adversarial Network Method for Tabular Missing Value Imputation
    Liu, Jingrui
    Duan, Zixin
    Hu, Xinkai
    Zhong, Jingxuan
    Yin, Yunfei
    [J]. ENTROPY, 2024, 26 (05)
  • [8] Improved generative adversarial imputation networks for missing data
    Qin, Xiwen
    Shi, Hongyu
    Dong, Xiaogang
    Zhang, Siqi
    Yuan, Liping
    [J]. APPLIED INTELLIGENCE, 2024, 54 (21) : 11068 - 11082
  • [9] A systematic review of generative adversarial imputation network in missing data imputation
    Yuqing Zhang
    Runtong Zhang
    Butian Zhao
    [J]. Neural Computing and Applications, 2023, 35 : 19685 - 19705
  • [10] A systematic review of generative adversarial imputation network in missing data imputation
    Zhang, Yuqing
    Zhang, Runtong
    Zhao, Butian
    [J]. NEURAL COMPUTING & APPLICATIONS, 2023, 35 (27): : 19685 - 19705