Towards Generating Real-World Time Series Data

被引:15
|
作者
Pei, Hengzhi [1 ,2 ]
Ren, Kan [2 ]
Yang, Yuqing [2 ]
Liu, Chang [3 ]
Qin, Tao [3 ]
Li, Dongsheng [2 ]
机构
[1] Univ Illinois, Urbana, IL USA
[2] Microsoft Res Asia, Shanghai, Peoples R China
[3] Microsoft Res Asia, Beijing, Peoples R China
关键词
Time series; data generation; missing values;
D O I
10.1109/ICDM51629.2021.00058
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Time series data generation has drawn increasing attention in recent years. Several generative adversarial network (GAN) based methods have been proposed to tackle the problem usually with the assumption that the targeted time series data are well-formatted and complete. However, real-world time series (RTS) data are far away from this utopia, e.g., long sequences with variable lengths and informative missing data raise intractable challenges for designing powerful generation algorithms. In this paper, we propose a novel generative framework for RTS data - RTSGAN to tackle the aforementioned challenges. RTSGAN first learns an encoder-decoder module which provides a mapping between a time series instance and a fixed-dimension latent vector and then learns a generation module to generate vectors in the same latent space. By combining the generator and the decoder, RTSGAN is able to generate RTS which respect the original feature distributions and the temporal dynamics. To generate time series with missing values, we further equip RTSGAN with an observation embedding layer and a decide-and-generate decoder to better utilize the informative missing patterns. Experiments on the four RTS datasets show that the proposed framework outperforms the previous generation methods in terms of synthetic data utility for downstream classification and prediction tasks. Our code is available at https://seqml.github.io/rtsgan.
引用
收藏
页码:469 / 478
页数:10
相关论文
共 50 条
  • [1] Towards Theory for Real-World Data
    Martens, Wim
    PROCEEDINGS OF THE 41ST ACM SIGMOD-SIGACT-SIGAI SYMPOSIUM ON PRINCIPLES OF DATABASE SYSTEMS (PODS '22), 2022, : 261 - 276
  • [2] Time Series Prediction Methodology and Ensemble Model Using Real-World Data
    Kim, Mintai
    Lee, Sungju
    Jeong, Taikyeong
    ELECTRONICS, 2023, 12 (13)
  • [3] TOWARDS MEASURING REAL-WORLD FORGETTING IN REAL-TIME
    CROVITZ, HF
    BULLETIN OF THE PSYCHONOMIC SOCIETY, 1982, 20 (03) : 133 - 133
  • [4] Towards the integration of real-time real-world data in urban search and rescue simulation
    Kenn, Holger
    Kleiner, Alexander
    MOBILE RESPONSE, 2007, 4458 : 106 - 115
  • [5] Benefits of Mandated Registries for Generating Real-World Outcome Data
    Salminen, Paulina
    Stenberg, Erik
    Batterham, Rachel
    JAMA SURGERY, 2023, 158 (08) : 824 - 824
  • [6] Generating and using real-world data: A worthwhile uphill battle
    Verkerk, K.
    Voest, E. E.
    CELL, 2024, 187 (07) : 1636 - 1650
  • [7] Real-World Battles with Real-World Data
    Brown, Jeffrey
    Bate, Andrew
    Platt, Robert
    Raebel, Marsha
    Sauer, Brian
    Trifiro, Gianluca
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2017, 26 : 254 - 255
  • [8] Towards Machine Learning with Zero Real-World Data
    Kang, Cholmin
    Jung, Hyunwoo
    Lee, Youngki
    WEARSYS'19: PROCEEDINGS OF THE 5TH ACM WORKSHOP ON WEARABLE SYSTEMS AND APPLICATIONS, 2019, : 41 - 46
  • [9] Real-world study: from real-world data to real-world evidence
    Wen, Yi
    TRANSLATIONAL BREAST CANCER RESEARCH, 2020, 1
  • [10] The City Brain: Towards Real-Time Search for the Real-World
    Hua, Xian-Sheng
    ACM/SIGIR PROCEEDINGS 2018, 2018, : 1343 - 1344