Outlier detection and data filling based on KNN and LOF for power transformer operation data classification

被引:12
|
作者
Zou, Dexu [1 ]
Xiang, Yongjian [2 ]
Zhou, Tao [2 ]
Peng, Qingjun [1 ]
Dai, Weiju [1 ]
Hong, Zhihu [1 ]
Shi, Yong [3 ]
Wang, Shan [1 ]
Yin, Jianhua [4 ]
Quan, Hao [2 ]
机构
[1] China Southern Power Grid Yunnan Power Grid Co Lt, Elect Power Res Inst, Kunming 650217, Yunnan, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Automat, Nanjing 210094, Peoples R China
[3] China Southern Power Grid Yunnan Power Grid Co Lt, Kunming 650217, Yunnan, Peoples R China
[4] Nanjing Univ Informat Sci & Technol, Sch Artificial Intelligence, Nanjing 210044, Peoples R China
关键词
Power transformer; Outlier detection; Data sufficiency; LOF; KNN;
D O I
10.1016/j.egyr.2023.04.094
中图分类号
TE [石油、天然气工业]; TK [能源与动力工程];
学科分类号
0807 ; 0820 ;
摘要
The missing and abnormal data in power transformer operation and monitoring greatly affect the accuracy of fault diagnosis and thus threaten the stable operation of power systems. To conduct outlier detection and improve data quality for safety warning, this paper proposes a transformer operation data preprocessing method based on KNN (K-nearest neighbor) and LOF (local outlier factor) for power transformer operation data classification. Firstly, this paper analyzes the characteristics of transformer operation data. Secondly, the local reachable density of the input data is calculated by LOF algorithm. The local outlier factor score of the data is derived according to the local reachable density, and the abnormal data is output according to the abnormal score. Then, KNN algorithm is utilized to classify the relevant data around the abnormal value and missing value of the transformer. The data are filled or corrected according to the classification results. Thirdly, the elbow method is used to determine the optimal K value and cluster operation data by K-Means algorithm. Finally, the proposed method is applied and verified with real transformer operation data in case study. The results show the method can effectively detect and correct the abnormal and missing data, conduct transformer data cleaning and preprocessing and provide accurate and effective data samples for transformer fault diagnosis. (c) 2023 The Authors. Published by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:698 / 711
页数:14
相关论文
共 50 条
  • [1] Outlier detection for on-line monitoring data of transformer based on wavelet transform and weighted LOF
    Qin, J. F.
    Yang, Y.
    Du, H. Y.
    Hong, Z. J.
    2019 INTERNATIONAL CONFERENCE ON NEW ENERGY AND FUTURE ENERGY SYSTEM, 2019, 354
  • [2] Outlier detection method for thermal process data based on EWT-LOF
    Dong Z.
    Jia H.
    Yi Qi Yi Biao Xue Bao/Chinese Journal of Scientific Instrument, 2020, 41 (02): : 126 - 134
  • [3] Robust outlier detection in geo-spatial data based on LOLIMOT and KNN search
    Mohammadreza Tabatabaei
    Roohollah Kimiaefar
    Alireza Hajian
    Alireza Akbari
    Earth Science Informatics, 2021, 14 : 1065 - 1072
  • [4] Robust outlier detection in geo-spatial data based on LOLIMOT and KNN search
    Tabatabaei, Mohammadreza
    Kimiaefar, Roohollah
    Hajian, Alireza
    Akbari, Alireza
    EARTH SCIENCE INFORMATICS, 2021, 14 (02) : 1065 - 1072
  • [5] KNN-Based Approximate Outlier Detection Algorithm Over IoT Streaming Data
    Zhu, Rui
    Ji, Xiaoling
    Yu, Danyang
    Tan, Zhiyuan
    Zhao, Liang
    Li, Jiajia
    Xia, Xiufeng
    IEEE ACCESS, 2020, 8 : 42749 - 42759
  • [6] Enhanced-KNN (M-KNN) based outlier detection and sensor data aggregation for large data streams in the IoT-cloud
    Kumar, Y. R. Sampath
    Champa, H. N.
    INTERNATIONAL JOURNAL OF INTERNET PROTOCOL TECHNOLOGY, 2022, 15 (3-4) : 236 - 249
  • [7] Outlier detection and missing data filling methods for coastal water temperature data
    Cho, Hong Yeon
    Oh, Ji Hee
    Kim, Kyeong Ok
    Shim, Jae Seol
    JOURNAL OF COASTAL RESEARCH, 2013, : 1898 - 1903
  • [8] Outlier Detection for Load Monitoring Data of Transformer Based on Kernel-Density-Fluctuation Outlier Factor
    Wang, Hongbin
    Mo, Wenxiong
    Luan, Le
    Cui, Yiping
    2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 650 - 655
  • [9] Grid-Based and Outlier Detection-Based Data Clustering and Classification
    Cho, Kyu Cheol
    Lee, Jong Sik
    UBIQUITOUS COMPUTING AND MULTIMEDIA APPLICATIONS, PT I, 2011, 150 : 129 - 138
  • [10] Grid-based & Outlier Detection-based Data Clustering & Classification
    Cho, Kyu Cheol
    Lee, Jong Sik
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (03): : 1253 - 1266