Learning-Based Adaptive Imputation Method with kNN Algorithm for Missing Power Data

被引:39
|
作者
Kim, Minkyung [1 ]
Park, Sangdon [1 ]
Lee, Joohyung [2 ]
Joo, Yongjae [3 ]
Choi, Jun Kyun [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn, Daejeon 34141, South Korea
[2] Gachon Univ, Dept Software, Seongnam 13120, South Korea
[3] Korea Elect Power Res Inst, Daejeon 305760, South Korea
基金
新加坡国家研究基金会;
关键词
missing data; power data; imputation; kNN algorithm; learning; smart meter; energy system;
D O I
10.3390/en10101668
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
This paper proposes a learning-based adaptive imputation method (LAI) for imputing missing power data in an energy system. This method estimates the missing power data by using the pattern that appears in the collected data. Here, in order to capture the patterns from past power data, we newly model a feature vector by using past data and its variations. The proposed LAI then learns the optimal length of the feature vector and the optimal historical length, which are significant hyper parameters of the proposed method, by utilizing intentional missing data. Based on a weighted distance between feature vectors representing a missing situation and past situation, missing power data are estimated by referring to the k most similar past situations in the optimal historical length. We further extend the proposed LAI to alleviate the effect of unexpected variation in power data and refer to this new approach as the extended LAI method (eLAI). The eLAI selects a method between linear interpolation (LI) and the proposed LAI to improve accuracy under unexpected variations. Finally, from a simulation under various energy consumption profiles, we verify that the proposed eLAI achieves about a 74% reduction of the average imputation error in an energy system, compared to the existing imputation methods.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] Missing Data Imputation for Geolocation-based Price Prediction Using KNN MCF Method
    Sanjar, Karshiev
    Bekhzod, Olimov
    Kim, Jaesoo
    Paul, Anand
    Kim, Jeonghong
    [J]. ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2020, 9 (04)
  • [2] Imputation Method of Missing Values for Dissolved Gas Analysis Data Based on Iterative KNN and XGBoost
    Qiao, Lin
    Ran, Ran
    Wu, He
    Zhou, Qiaoni
    Liu, Sai
    Liu, Yunfei
    [J]. 2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
  • [3] A deep learning-based imputation method for missing gaps in satellite aerosol products by fusing numerical model data
    Liu, Ning
    Li, Yi
    Zang, Zengliang
    Hu, Yiwen
    Fang, Xin
    Lolli, Simone
    [J]. ATMOSPHERIC ENVIRONMENT, 2024, 325
  • [4] The Optimal Machine Learning-Based Missing Data Imputation for the Cox Proportional Hazard Model
    Guo, Chao-Yu
    Yang, Ying-Chen
    Chen, Yi-Hau
    [J]. FRONTIERS IN PUBLIC HEALTH, 2021, 9
  • [5] A Machine Learning-Based Missing Data Imputation with FHIR Interoperability Approach in Sepsis Prediction
    Toro Beltran, Cristian Fernando
    Villarreal Ibanez, Erick Daniel
    Milen Orejuela, Vivian
    Garcia Henao, John Anderson
    [J]. HIGH PERFORMANCE COMPUTING, CARLA 2022, 2022, 1660 : 116 - 130
  • [6] Cluster-based KNN Missing Value Imputation for DNA Microarray Data
    Keerin, Phimmarin
    Kurutach, Werasak
    Boongoen, Tossapon
    [J]. PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 445 - 450
  • [7] Missing Data Imputation using Machine Learning Algorithm for Supervised Learning
    Cenitta, D.
    Arjunan, R. Vijaya
    Prema, K., V
    [J]. 2021 INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND INFORMATICS (ICCCI), 2021,
  • [8] Machine learning-based imputation soft computing approach for large missing scale and non-reference data imputation
    Alamoodi, A. H.
    Zaidan, B. B.
    Zaidan, A. . A. .
    Albahri, O. S.
    Chen, Juliana
    Chyad, M. A.
    Garfan, Salem
    Aleesa, A. M.
    [J]. CHAOS SOLITONS & FRACTALS, 2021, 151
  • [9] Pavement Missing Condition Data Imputation through Collective Learning-Based Graph Neural Networks
    Yu, Ke
    Gao, Lu
    [J]. INTERNATIONAL CONFERENCE ON TRANSPORTATION AND DEVELOPMENT 2023: TRANSPORTATION PLANNING, OPERATIONS, AND TRANSIT, 2023, : 416 - 423
  • [10] MIDA: a Web Tool for MIssing DAta Imputation based on a Boosted and Incremental Learning Algorithm
    Acampora, Giovanni
    Vitiello, Autilia
    Siciliano, Roberta
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,