Towards Generating Real-World Time Series Data

被引:15
|
作者
Pei, Hengzhi [1 ,2 ]
Ren, Kan [2 ]
Yang, Yuqing [2 ]
Liu, Chang [3 ]
Qin, Tao [3 ]
Li, Dongsheng [2 ]
机构
[1] Univ Illinois, Urbana, IL USA
[2] Microsoft Res Asia, Shanghai, Peoples R China
[3] Microsoft Res Asia, Beijing, Peoples R China
关键词
Time series; data generation; missing values;
D O I
10.1109/ICDM51629.2021.00058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series data generation has drawn increasing attention in recent years. Several generative adversarial network (GAN) based methods have been proposed to tackle the problem usually with the assumption that the targeted time series data are well-formatted and complete. However, real-world time series (RTS) data are far away from this utopia, e.g., long sequences with variable lengths and informative missing data raise intractable challenges for designing powerful generation algorithms. In this paper, we propose a novel generative framework for RTS data - RTSGAN to tackle the aforementioned challenges. RTSGAN first learns an encoder-decoder module which provides a mapping between a time series instance and a fixed-dimension latent vector and then learns a generation module to generate vectors in the same latent space. By combining the generator and the decoder, RTSGAN is able to generate RTS which respect the original feature distributions and the temporal dynamics. To generate time series with missing values, we further equip RTSGAN with an observation embedding layer and a decide-and-generate decoder to better utilize the informative missing patterns. Experiments on the four RTS datasets show that the proposed framework outperforms the previous generation methods in terms of synthetic data utility for downstream classification and prediction tasks. Our code is available at https://seqml.github.io/rtsgan.
引用
收藏
页码:469 / 478
页数:10
相关论文
共 50 条
  • [21] TIME IN ONCOLOGY. GLIOBLASTOMA REAL-WORLD DATA
    Macia i Garau, M.
    Calduch, A. Lucas
    Exposito, N. Garcia
    NEURO-ONCOLOGY, 2023, 25
  • [22] Data Science Methods for Real-World Evidence Generation in Real-World Data
    Liu, Fang
    ANNUAL REVIEW OF BIOMEDICAL DATA SCIENCE, 2024, 7 : 201 - 224
  • [23] Strategies to Turn Real-world Data Into Real-world Knowledge
    Hong, Julian C.
    JAMA NETWORK OPEN, 2021, 4 (10)
  • [24] Deriving Real-World Insights From Real-World Data
    Baker, Stuart G.
    ANNALS OF INTERNAL MEDICINE, 2019, 170 (09) : 664 - 665
  • [25] Generating Edge Cases for Testing Autonomous Vehicles Using Real-World Data
    Karunakaran, Dhanoop
    Perez, Julie Stephany Berrio
    Worrall, Stewart
    SENSORS, 2024, 24 (01)
  • [26] Biases in Electronic Health Records Data for Generating Real-World Evidence: An Overview
    Ban Al-Sahab
    Alan Leviton
    Tobias Loddenkemper
    Nigel Paneth
    Bo Zhang
    Journal of Healthcare Informatics Research, 2024, 8 : 121 - 139
  • [27] Biases in Electronic Health Records Data for Generating Real-World Evidence: An Overview
    Al-Sahab, Ban
    Leviton, Alan
    Loddenkemper, Tobias
    Paneth, Nigel
    Zhang, Bo
    JOURNAL OF HEALTHCARE INFORMATICS RESEARCH, 2024, 8 (01) : 121 - 139
  • [28] Vita: A Versatile Toolkit for Generating Indoor Mobility Data for Real-World Buildings
    Li, Huan
    Lu, Hua
    Chen, Xin
    Chen, Gang
    Chen, Ke
    Shou, Lidan
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (13): : 1453 - 1456
  • [29] Assessing Real-World Data Quality: The Application of Patient Registry Quality Criteria to Real-World Data and Real-World Evidence
    Gliklich, Richard E.
    Leavy, Michelle B.
    THERAPEUTIC INNOVATION & REGULATORY SCIENCE, 2020, 54 (02) : 303 - 307
  • [30] Assessing Real-World Data Quality: The Application of Patient Registry Quality Criteria to Real-World Data and Real-World Evidence
    Richard E. Gliklich
    Michelle B. Leavy
    Therapeutic Innovation & Regulatory Science, 2020, 54 : 303 - 307