Multiscale-attention masked autoencoder for missing data imputation of wind turbines

被引:0
|
作者
Fan, Yuwei [1 ]
Feng, Chenlong [1 ]
Wu, Rui [1 ]
Liu, Chao [1 ]
Jiang, Dongxiang [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Energy & Power Engn, Beijing 100084, Peoples R China
[2] Tsinghua Univ, State Key Lab Control & Simulat Power Syst & Gener, Beijing 100084, Peoples R China
关键词
Renewable energy systems; Missing data imputation; Masked autoencoder; Multiscale attention; Feature combination;
D O I
10.1016/j.knosys.2024.112114
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-quality data is essential for effective operation and maintenance of wind farms. However, data missing is a persistent issue in the supervisory control and data acquisition (SCADA) system, which seriously affects the data quality. To tackle the two limitations of current missing data imputation methods: the gap between training tasks and imputation tasks, and the inadequate extraction of correlations within SCADA data, this work proposes a data-driven framework named multiscale-attention masked autoencoder (MAMAE) for missing data imputation of wind turbines. The MAMAE employs masked autoencoding as a self-supervised training method, bridging the gap between the training and imputing task. Additionally, considering the importance of correlations in imputation for the SCADA data, a multiscale attention architecture built upon transformer is employed. Comprising four transformer stages, each applying attention mechanisms at distinct scales, the multiscale attention efficiently extracts feature, turbine, and temporal correlations. To ameliorate the problem of large computation cost caused by increased sequence length in different scales, localized attention is implemented in shifted windows, reducing the computational complexity from quadratic to a linear relationship with the sequence length. Furthermore, a turbine correlation-based feature combination method is proposed to coordinate with the multiscale attention and introduce turbine correlations into the imputation process. Experiments were conducted on a SCADA dataset collected in a real-world wind farm. The results show that the proposed method achieves higher accuracy than existing methods in most cases (especially in the cases with band missing and feature missing) and the ablation experiments verify the effectiveness of each proposed modification in improving accuracy or efficiency.
引用
收藏
页数:28
相关论文
共 47 条
  • [1] Masked Autoencoder Transformer for Missing Data Imputation of PISA
    Freire, Guilherme Mendonca
    Curi, Mariana
    [J]. ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2024, PT I, 2024, 2150 : 364 - 372
  • [2] Denoising Masked Autoencoder-Based Missing Imputation within Constrained Environments for Electric Load Data
    Jeong, Jaeik
    Ku, Tai-Yeon
    Park, Wan-Ki
    [J]. ENERGIES, 2023, 16 (24)
  • [3] Spatiotemporal Generative Adversarial Imputation Networks: An Approach to Address Missing Data for Wind Turbines
    Hu, Xuguang
    Zhan, Zhaokang
    Ma, Dazhong
    Zhang, Siqi
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [4] Siamese Autoencoder Architecture for the Imputation of Data Missing Not at Random
    Pereira, Ricardo Cardoso
    Abreu, Pedro Henriques
    Rodrigues, Pedro Pereira
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2024, 78
  • [5] Imputation of Missing Values in Training Data using Variational Autoencoder
    Hong, Xuerui
    Hao, Shuang
    [J]. 2023 IEEE 39TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS, ICDEW, 2023, : 49 - 54
  • [6] MISSING DATA IN TRAFFIC ESTIMATION: A VARIATIONAL AUTOENCODER IMPUTATION METHOD
    Boquet, Guillem
    Lopez Vicario, Jose
    Morell, Antoni
    Serrano, Javier
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 2882 - 2886
  • [7] A Two-Stage Deep Autoencoder-Based Missing Data Imputation Method for Wind Farm SCADA Data
    Liu, Xin
    Zhang, Zijun
    [J]. IEEE SENSORS JOURNAL, 2021, 21 (09) : 10933 - 10945
  • [8] Missing Data Imputation With OLS-Based Autoencoder for Intelligent Manufacturing
    Wang, Yanxia
    Li, Kang
    Gan, Shaojun
    Cameron, Che
    [J]. IEEE TRANSACTIONS ON INDUSTRY APPLICATIONS, 2019, 55 (06) : 7219 - 7229
  • [9] Autoencoder imputation of missing heterogeneous data for Alzheimer's disease classification
    Haridas, Namitha Thalekkara
    Sanchez-Bornot, Jose M.
    McClean, Paula L.
    Wong-Lin, KongFatt
    [J]. HEALTHCARE TECHNOLOGY LETTERS, 2024, : 452 - 460
  • [10] Multivariate Time Series Missing Data Imputation Using Recurrent Denoising Autoencoder
    Zhang, Jianye
    Yin, Peng
    [J]. 2019 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2019, : 760 - 764