An Incorrect Data Detection Method for Big Data Cleaning of Machinery Condition Monitoring

被引:99
|
作者
Xu, Xuefang [1 ]
Lei, Yaguo [1 ]
Li, Zeda [1 ]
机构
[1] Xi An Jiao Tong Univ, Key Lab Educ, Minist Modern Design & Rotor Bearing Syst, Xian 710049, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Condition-monitoring big data; data cleaning; data quality; incorrect data; local outlier factor (LOF); OUTLIER DETECTION; NETWORK;
D O I
10.1109/TIE.2019.2903774
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The presence of incorrect data leads to the decrease of condition-monitoring big data quality. As a result, unreliable or misleading results are probably obtained by analyzing these poor-quality data. In this paper, to improve the data quality, an incorrect data detection method based on an improved local outlier factor (LOF) is proposed for data cleaning. First, a sliding window technique is used to divide data into different segments. These segments are considered as different objects and their attributes consist of time-domain statistical features extracted from each segment, such as mean, maximum and peak-to-peak value. Second, a kernel-based LOF (KLOF) is calculated using these attributes to evaluate the degree of each segment being incorrect data. Third, according to these KLOF values and a threshold value, incorrect data are detected. Finally, a simulation of vibration data generated by a defective rolling element bearing and three real cases concerning a fixed-axle gearbox, a wind turbine, and a planetary gearbox are used to verify the effectiveness of the proposed method, respectively. The results demonstrate that the proposed method is able to detect both missing segments and abnormal segments, which are two typical incorrect data, effectively, and thus is helpful for big data cleaning of machinery condition monitoring.
引用
收藏
页码:2326 / 2336
页数:11
相关论文
共 50 条
  • [1] A Dirty Data Recognition Method for Machinery Condition Monitoring in Big Data Era
    Lei, Yaguo
    Zhou, Xin
    Xu, Xuefang
    Jia, Feng
    IECON 2017 - 43RD ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, 2017, : 7061 - 7066
  • [2] Experimental Optimization of Big Data Cleaning Method for Agricultural Machinery
    Yuan Y.
    Xu L.
    Ji F.
    Guo D.
    An S.
    Niu K.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2021, 52 (06): : 35 - 42
  • [3] Big Data Cleaning
    Tang, Nan
    WEB TECHNOLOGIES AND APPLICATIONS, APWEB 2014, 2014, 8709 : 13 - 24
  • [4] A Data Cleaning Method for Big Trace Data Using Movement Consistency
    Yang, Xue
    Tang, Luliang
    Zhang, Xia
    Li, Qingquan
    SENSORS, 2018, 18 (03):
  • [5] A Big Data Cleaning Method for Drinking-Water Streaming Data
    Gai, Rong-Li
    Zhang, Hao
    Thanh, Dang Ngoc Hoang
    BRAZILIAN ARCHIVES OF BIOLOGY AND TECHNOLOGY, 2023, 66
  • [6] Data cleaning and restoring method for vehicle battery big data platform
    Li, Shuangqi
    He, Hongwen
    Zhao, Pengfei
    Cheng, Shuang
    APPLIED ENERGY, 2022, 320
  • [7] Research on Data Quality Assurance for Health Condition Monitoring of Machinery
    Lei Y.
    Xu X.
    Cai X.
    Li N.
    Kong D.
    Zhang Y.
    Jixie Gongcheng Xuebao/Journal of Mechanical Engineering, 2021, 57 (04): : 1 - 9
  • [8] An anomaly detection method for rotating machinery monitoring based on the most representative data
    Lis, Antoni
    Dworakowski, Ziemowit
    Czubak, Piotr
    JOURNAL OF VIBROENGINEERING, 2021, 23 (04) : 861 - 876
  • [9] Plant machinery working life prediction method utilizing reliability and condition-monitoring data
    Goode, KB
    Moore, J
    Roylance, BJ
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART E-JOURNAL OF PROCESS MECHANICAL ENGINEERING, 2000, 214 (E2) : 109 - 122
  • [10] Research on the Technology of Data Cleaning in Big Data
    Feng, Fu-jun
    Yao, Jun-ping
    Li, Xiao-jun
    2018 2ND INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELING AND SIMULATION (AMMS 2018), 2018, 305 : 176 - 181