A comprehensive investigation of variational auto-encoders for population synthesis

被引:0
|
作者
Sane, Abdoul Razac [1 ]
Vandanjon, Pierre-Olivier [1 ]
Belaroussi, Rachid [2 ]
Hankach, Pierre [3 ]
机构
[1] Univ Gustave Eiffel, AME SPLOTT, All Ponts & Chaussees, F-44340 Bouguenais, France
[2] Univ Gustave Eiffel, COSYS GRETTIA, 5 Bd Descartes, F-77420 Champs Sur Marne, France
[3] Univ Gustave Eiffel, MAST LAMES, All Ponts & Chaussees, F-44340 Bouguenais, France
来源
关键词
Synthetic population; Machine learning; Deep generative model; Variational autoencoders; Sampling zeros; Structural zeros; BAYESIAN NETWORK; IMPACT; AGENT; AREA;
D O I
10.1007/s42001-024-00332-0
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
The use of synthetic populations has grown considerably over the recent years, in revolutionizing studies conducted within various fields, including social science research, urban planning, public health and transportation modeling. These synthetic populations prove to be valuable, as substitutes for the often missing or sensitive real data, and moreover are capable of preserving both privacy and representativeness. They are typically constructed from aggregate and/or sample data. Recently, new methods for generating synthetic populations based on deep learning, notably Variational Autoencoders (VAEs), have been developed. Such methods serve to overcome the limitations of traditional methods, such as Iterative Proportional Fitting (IPF), which are unable to generate agents with cross-modalities not found in the sample data. As such, IPF requires large samples to generate a synthetic population closely resembling the actual one. Conversely, the advantage of VAE lies in their ability to generate agents not found in the sample data, albeit with the risk of creating agents not existing in the actual population. However, the practical documentation as well as detailed analyses of the architectures and results from implementation of these deep learning approaches, in particular VAE, are limited, thus making these methods difficult to appropriate for practitioners. This paper focuses on generating synthetic populations using VAE. First, an in-depth and accessible theoretical explanation of how VAEs function is provided. Next, a detailed study of these methods is carried out by testing the various architectures, parameters, sample sizes and evaluation indicators necessary to guarantee high-quality results. Highlighted herein is the ability of VAEs to generate large datasets with a small training sample, in addition to VAE performance in generating new realistic individuals not present in the learning base. Certain limitations are identified, including the difficulties encountered by VAEs in managing numerical attributes and the need for post-processing to eliminate unrealistic individuals. In conclusion, despite a number of limitations, VAE constitutes a very promising methodology for generating synthetic populations, in offering practitioners numerous advantages. This paper is accompanied by a Python notebook to assist interested readers implement this new methodology.
引用
收藏
页数:34
相关论文
共 50 条
  • [31] Adversarial Training of Variational Auto-encoders for High Fidelity Image Generation
    Khan, Salman H.
    Hayat, Munawar
    Barnes, Nick
    2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, : 1312 - 1320
  • [32] Towards Deeper Understanding of Variational Auto-encoders for Binary Collaborative Filtering
    Zamani, Siamak
    Li, Dingcheng
    Fei, Hongliang
    Li, Ping
    PROCEEDINGS OF THE 2022 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2022, 2022, : 175 - 184
  • [33] Dynamic Feature Collaborative Variational Auto-Encoders for Academic Paper Recommendation
    Niu, Yuanhao
    Jiang, Ting
    Chen, Zhiheng
    Bai, Weichen
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 1620 - 1627
  • [34] Attribute-based regularization of latent spaces for variational auto-encoders
    Pati, Ashis
    Lerch, Alexander
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (09): : 4429 - 4444
  • [35] Interpretable ECG Beat Embedding using Disentangled Variational Auto-Encoders
    Van Steenkiste, Tom
    Deschrijver, Dirk
    Dhaene, Tom
    2019 IEEE 32ND INTERNATIONAL SYMPOSIUM ON COMPUTER-BASED MEDICAL SYSTEMS (CBMS), 2019, : 373 - 378
  • [36] FMCW Radar Sensing for Indoor Drones Using Variational Auto-Encoders
    Safa, Ali
    Verbelen, Tim
    Catal, Ozan
    Van de Maele, Toon
    Hartmann, Matthias
    Dhoedt, Bart
    Bourdoux, Andre
    2023 IEEE RADAR CONFERENCE, RADARCONF23, 2023,
  • [37] Disentangling Factors of Variation with Cycle-Consistent Variational Auto-encoders
    Jha, Ananya Harsh
    Anand, Saket
    Singh, Maneesh
    Veeravasarapu, V. S. R.
    COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 : 829 - 845
  • [38] Variational graph auto-encoders for miRNA-disease association prediction
    Ding, Yulian
    Tian, Li-Ping
    Lei, Xiujuan
    Liao, Bo
    Wu, Fang-Xiang
    METHODS, 2021, 192 : 25 - 34
  • [39] Attribute-based regularization of latent spaces for variational auto-encoders
    Pati, Ashis
    Lerch, Alexander
    Neural Computing and Applications, 2021, 33 (09) : 4429 - 4444
  • [40] Attribute-based regularization of latent spaces for variational auto-encoders
    Ashis Pati
    Alexander Lerch
    Neural Computing and Applications, 2021, 33 : 4429 - 4444