Multiscale-attention masked autoencoder for missing data imputation of wind turbines

被引:0
|
作者
Fan, Yuwei [1 ]
Feng, Chenlong [1 ]
Wu, Rui [1 ]
Liu, Chao [1 ]
Jiang, Dongxiang [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Energy & Power Engn, Beijing 100084, Peoples R China
[2] Tsinghua Univ, State Key Lab Control & Simulat Power Syst & Gener, Beijing 100084, Peoples R China
关键词
Renewable energy systems; Missing data imputation; Masked autoencoder; Multiscale attention; Feature combination;
D O I
10.1016/j.knosys.2024.112114
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-quality data is essential for effective operation and maintenance of wind farms. However, data missing is a persistent issue in the supervisory control and data acquisition (SCADA) system, which seriously affects the data quality. To tackle the two limitations of current missing data imputation methods: the gap between training tasks and imputation tasks, and the inadequate extraction of correlations within SCADA data, this work proposes a data-driven framework named multiscale-attention masked autoencoder (MAMAE) for missing data imputation of wind turbines. The MAMAE employs masked autoencoding as a self-supervised training method, bridging the gap between the training and imputing task. Additionally, considering the importance of correlations in imputation for the SCADA data, a multiscale attention architecture built upon transformer is employed. Comprising four transformer stages, each applying attention mechanisms at distinct scales, the multiscale attention efficiently extracts feature, turbine, and temporal correlations. To ameliorate the problem of large computation cost caused by increased sequence length in different scales, localized attention is implemented in shifted windows, reducing the computational complexity from quadratic to a linear relationship with the sequence length. Furthermore, a turbine correlation-based feature combination method is proposed to coordinate with the multiscale attention and introduce turbine correlations into the imputation process. Experiments were conducted on a SCADA dataset collected in a real-world wind farm. The results show that the proposed method achieves higher accuracy than existing methods in most cases (especially in the cases with band missing and feature missing) and the ablation experiments verify the effectiveness of each proposed modification in improving accuracy or efficiency.
引用
下载
收藏
页数:28
相关论文
共 47 条
  • [31] Data management for structural integrity assessment of offshore wind turbine support structures: data cleansing and missing data imputation
    Martinez-Luengo, Maria
    Shafiee, Mahmood
    Kolios, Athanasios
    OCEAN ENGINEERING, 2019, 173 : 867 - 883
  • [32] Study of Missing Value Imputation in Wind Turbine Data Based on Multivariate Spatiotemporal Integration Network
    Zhan Z.-K.
    Hu X.-G.
    Zhao H.-R.
    Zhang S.-Q.
    Zhang J.-K.
    Ma D.-Z.
    Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (06): : 1171 - 1184
  • [33] Anomaly Prediction for Wind Turbines Using an Autoencoder with Vibration Data Supported by Power-Curve Filtering
    Takanashi, Masaki
    Sato, Shu-ichi
    Indo, Kentaro
    Nishihara, Nozomu
    Hayashi, Hiroki
    Suzuki, Toru
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (03) : 732 - 735
  • [34] A variational autoencoder solution for road traffic forecasting systems: Missing data imputation, dimension reduction, model selection and anomaly detection
    Boquet, Guillem
    Morell, Antoni
    Serrano, Javier
    Lopez Vicario, Jose
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 115
  • [35] Multi-scale variational autoencoder for imputation of missing values in untargeted metabolomics using whole-genome sequencing data
    Zhao C.
    Su K.-J.
    Wu C.
    Cao X.
    Sha Q.
    Li W.
    Luo Z.
    Qing T.
    Qiu C.
    Zhao L.J.
    Liu A.
    Jiang L.
    Zhang X.
    Shen H.
    Zhou W.
    Deng H.-W.
    Computers in Biology and Medicine, 2024, 179
  • [36] A Deep Learning Approach for Missing Data Imputation of Rating Scales Assessing Attention-Deficit Hyperactivity Disorder
    Cheng, Chung-Yuan
    Tseng, Wan-Ling
    Chang, Ching-Fen
    Chang, Chuan-Hsiung
    Gau, Susan Shur-Fen
    FRONTIERS IN PSYCHIATRY, 2020, 11
  • [37] A hybrid model for missing traffic flow data imputation based on clustering and attention mechanism optimizing LSTM and AdaBoost
    Qiang Shang
    Yingping Tang
    Longjiao Yin
    Scientific Reports, 14 (1)
  • [38] Missing Data Imputation for Online Monitoring of Power Equipment Based on Self-attention Generative Adversarial Networks
    Zhou Y.
    Lin M.
    Chen J.
    Bai Z.
    Chen M.
    Gaodianya Jishu/High Voltage Engineering, 2023, 49 (05): : 1795 - 1809
  • [39] Imputation of missing data from offshore wind farms using spatio-temporal correlation and feature correlation
    Sun, Chuan
    Chen, Yueyi
    Cheng, Cheng
    ENERGY, 2021, 229
  • [40] STA-GAN: A Spatio-Temporal Attention Generative Adversarial Network for Missing Value Imputation in Satellite Data
    Wang, Shuyu
    Li, Wengen
    Hou, Siyun
    Guan, Jihong
    Yao, Jiamin
    REMOTE SENSING, 2023, 15 (01)