Outlier detection and data filling based on KNN and LOF for power transformer operation data classification

被引:12
|
作者
Zou, Dexu [1 ]
Xiang, Yongjian [2 ]
Zhou, Tao [2 ]
Peng, Qingjun [1 ]
Dai, Weiju [1 ]
Hong, Zhihu [1 ]
Shi, Yong [3 ]
Wang, Shan [1 ]
Yin, Jianhua [4 ]
Quan, Hao [2 ]
机构
[1] China Southern Power Grid Yunnan Power Grid Co Lt, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Automat, Nanjing 210094, Peoples R China
[3] China Southern Power Grid Yunnan Power Grid Co Lt, Kunming 650217, Yunnan, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Artificial Intelligence, Nanjing 210044, Peoples R China
关键词
Power transformer; Outlier detection; Data sufficiency; LOF; KNN;
D O I
10.1016/j.egyr.2023.04.094
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The missing and abnormal data in power transformer operation and monitoring greatly affect the accuracy of fault diagnosis and thus threaten the stable operation of power systems. To conduct outlier detection and improve data quality for safety warning, this paper proposes a transformer operation data preprocessing method based on KNN (K-nearest neighbor) and LOF (local outlier factor) for power transformer operation data classification. Firstly, this paper analyzes the characteristics of transformer operation data. Secondly, the local reachable density of the input data is calculated by LOF algorithm. The local outlier factor score of the data is derived according to the local reachable density, and the abnormal data is output according to the abnormal score. Then, KNN algorithm is utilized to classify the relevant data around the abnormal value and missing value of the transformer. The data are filled or corrected according to the classification results. Thirdly, the elbow method is used to determine the optimal K value and cluster operation data by K-Means algorithm. Finally, the proposed method is applied and verified with real transformer operation data in case study. The results show the method can effectively detect and correct the abnormal and missing data, conduct transformer data cleaning and preprocessing and provide accurate and effective data samples for transformer fault diagnosis. (c) 2023 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:698 / 711
页数:14
相关论文
共 50 条
  • [21] Approximate kNN Classification for Biomedical Data
    Anagnostou, Panagiotis
    Barbas, Petros
    Vrahatis, Aristidis G.
    Tasoulis, Sotiris K.
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 3602 - 3607
  • [22] How does dimensionality influence outlier detection effectiveness in multivariate geochemical data? insights from LOF and IF methods
    Shahed Shahrestani
    Ioan Sanislav
    Earth Science Informatics, 2025, 18 (1)
  • [23] Improved Outlier Detection Method of Power Consumer Data Based on Gaussian Kernel Function
    Sun Y.
    Li S.
    Cui C.
    Li B.
    Chen S.
    Cui G.
    Dianwang Jishu/Power System Technology, 2018, 42 (05): : 1595 - 1604
  • [24] Health index for power transformer condition assessment based on operation history and test data
    Guo, Hong
    Guo, Lei
    ENERGY REPORTS, 2022, 8 : 9038 - 9045
  • [25] Outlier detection in interval data
    A. Pedro Duarte Silva
    Peter Filzmoser
    Paula Brito
    Advances in Data Analysis and Classification, 2018, 12 : 785 - 822
  • [26] Outlier detection in astronomical data
    Zhang, YX
    Luo, A
    Zhao, YH
    OPTIMIZING SCIENTIFIC RETURN FOR ASTRONOMY THROUGH INFORMATION TECHNOLOGIES, 2004, 5493 : 521 - 529
  • [27] Outlier detection for skewed data
    Hubert, Mia
    Van der Veeken, Stephan
    JOURNAL OF CHEMOMETRICS, 2008, 22 (3-4) : 235 - 246
  • [28] Outlier detection in skewed data
    Meropi, Pavlidou
    Bikos, Christoforos
    George, Zioutas
    SIMULATION MODELLING PRACTICE AND THEORY, 2018, 87 : 191 - 209
  • [29] Outlier detection in interval data
    Duarte Silva, A. Pedro
    Filzmoser, Peter
    Brito, Paula
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2018, 12 (03) : 785 - 822
  • [30] Outlier detection in transactional data
    Dash, Manoranjan
    Lie, Ng Wil
    INTELLIGENT DATA ANALYSIS, 2010, 14 (03) : 283 - 298