Learning-Based Adaptive Imputation Method with kNN Algorithm for Missing Power Data

被引:39
|
作者
Kim, Minkyung [1 ]
Park, Sangdon [1 ]
Lee, Joohyung [2 ]
Joo, Yongjae [3 ]
Choi, Jun Kyun [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn, Daejeon 34141, South Korea
[2] Gachon Univ, Dept Software, Seongnam 13120, South Korea
[3] Korea Elect Power Res Inst, Daejeon 305760, South Korea
基金
新加坡国家研究基金会;
关键词
missing data; power data; imputation; kNN algorithm; learning; smart meter; energy system;
D O I
10.3390/en10101668
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
This paper proposes a learning-based adaptive imputation method (LAI) for imputing missing power data in an energy system. This method estimates the missing power data by using the pattern that appears in the collected data. Here, in order to capture the patterns from past power data, we newly model a feature vector by using past data and its variations. The proposed LAI then learns the optimal length of the feature vector and the optimal historical length, which are significant hyper parameters of the proposed method, by utilizing intentional missing data. Based on a weighted distance between feature vectors representing a missing situation and past situation, missing power data are estimated by referring to the k most similar past situations in the optimal historical length. We further extend the proposed LAI to alleviate the effect of unexpected variation in power data and refer to this new approach as the extended LAI method (eLAI). The eLAI selects a method between linear interpolation (LI) and the proposed LAI to improve accuracy under unexpected variations. Finally, from a simulation under various energy consumption profiles, we verify that the proposed eLAI achieves about a 74% reduction of the average imputation error in an energy system, compared to the existing imputation methods.
引用
收藏
页数:20
相关论文
共 50 条
  • [21] Missing Data Imputation for Supervised Learning
    Poulos, Jason
    Valle, Rafael
    [J]. APPLIED ARTIFICIAL INTELLIGENCE, 2018, 32 (02) : 186 - 196
  • [22] Iterative KNN Imputation Based on GRA for Missing Values in TPLMS
    Zhu, Ming
    Cheng, Xingbing
    [J]. PROCEEDINGS OF 2015 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2015), 2015, : 94 - 99
  • [23] Imputation of missing precipitation data using KNN, SOM, RF, and FNN
    Abinash Sahoo
    Dillip Kumar Ghose
    [J]. Soft Computing, 2022, 26 : 5919 - 5936
  • [24] Missing Data Imputation based on Unsupervised Simple Competitive Learning
    Lee, Byoung Jik
    [J]. PROCEEDINGS OF THE 9TH WSEAS INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, KNOWLEDGE ENGINEERING AND DATA BASES, 2010, : 292 - +
  • [25] Imputation of missing precipitation data using KNN, SOM, RF, and FNN
    Sahoo, Abinash
    Ghose, Dillip Kumar
    [J]. SOFT COMPUTING, 2022, 26 (12) : 5919 - 5936
  • [26] Machine Learning Based Missing Data Imputation in Categorical Datasets
    Ishaq, Muhammad
    Zahir, Sana
    Iftikhar, Laila
    Bulbul, Mohammad Farhad
    Rho, Seungmin
    Lee, Mi Young
    [J]. IEEE ACCESS, 2024, 12 : 88332 - 88344
  • [27] Missing data completion method based on KNN and Random Forest
    Zhang, Songyu
    Zhou, Yuchen
    Yan, Jinghua
    Bu, Fanliang
    [J]. SECOND IYSF ACADEMIC SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND COMPUTER ENGINEERING, 2021, 12079
  • [28] Dimensional Data KNN-Based Imputation
    Yang, Yuzhao
    Darmont, Jerome
    Ravat, Franck
    Teste, Olivier
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS, ADBIS 2022, 2022, 13389 : 315 - 329
  • [29] Imputation method for missing data based on clustering and measure of property
    Kim, Sunghyun
    Kim, Dongjae
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2018, 31 (01) : 29 - 40
  • [30] A genetic algorithm for multivariate missing data imputation
    Carlos Figueroa-Garcia, Juan
    Neruda, Roman
    Hernandez-Perez, German
    [J]. INFORMATION SCIENCES, 2023, 619 : 947 - 967