An Incorrect Data Detection Method for Big Data Cleaning of Machinery Condition Monitoring

被引:99
|
作者
Xu, Xuefang [1 ]
Lei, Yaguo [1 ]
Li, Zeda [1 ]
机构
[1] Xi An Jiao Tong Univ, Key Lab Educ, Minist Modern Design & Rotor Bearing Syst, Xian 710049, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Condition-monitoring big data; data cleaning; data quality; incorrect data; local outlier factor (LOF); OUTLIER DETECTION; NETWORK;
D O I
10.1109/TIE.2019.2903774
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The presence of incorrect data leads to the decrease of condition-monitoring big data quality. As a result, unreliable or misleading results are probably obtained by analyzing these poor-quality data. In this paper, to improve the data quality, an incorrect data detection method based on an improved local outlier factor (LOF) is proposed for data cleaning. First, a sliding window technique is used to divide data into different segments. These segments are considered as different objects and their attributes consist of time-domain statistical features extracted from each segment, such as mean, maximum and peak-to-peak value. Second, a kernel-based LOF (KLOF) is calculated using these attributes to evaluate the degree of each segment being incorrect data. Third, according to these KLOF values and a threshold value, incorrect data are detected. Finally, a simulation of vibration data generated by a defective rolling element bearing and three real cases concerning a fixed-axle gearbox, a wind turbine, and a planetary gearbox are used to verify the effectiveness of the proposed method, respectively. The results demonstrate that the proposed method is able to detect both missing segments and abnormal segments, which are two typical incorrect data, effectively, and thus is helpful for big data cleaning of machinery condition monitoring.
引用
收藏
页码:2326 / 2336
页数:11
相关论文
共 50 条
  • [41] The cleaning method of duplicate big data based on association rule mining algorithm
    Wu, Ming
    INTERNATIONAL JOURNAL OF AUTONOMOUS AND ADAPTIVE COMMUNICATIONS SYSTEMS, 2023, 16 (02) : 220 - 231
  • [42] Research on the Electrical Equipment Condition Monitoring System Architecture Based on Big Data
    Zhang Bowen
    Wang Feng
    Han Shuai
    Bi Jiangang
    Yan Chunyu
    2017 2ND INTERNATIONAL CONFERENCE ON CONTROL AND ROBOTICS ENGINEERING (ICCRE2017), 2017,
  • [43] A hardware device for data fusion and novelty detection in condition monitoring
    Taylor, O
    MacIntyre, J
    SENSOR FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS IV, 2000, 4051 : 160 - 171
  • [44] Big Data Processing and Analysis Platform for Condition Monitoring of Electric Power System
    Guo, Yuanjun
    Feng, Shengzhong
    Li, Kang
    Mo, Wenxiong
    Liu, Yuquan
    Wang, Yong
    2016 UKACC 11TH INTERNATIONAL CONFERENCE ON CONTROL (CONTROL), 2016,
  • [45] A SYSTEMATIC MAPPING REVIEW ON DATA CLEANING METHODS IN BIG DATA ENVIRONMENTS
    Iwata, Claudio Keiji
    Galegale, Napoleao Verardi
    Ito, Marcia
    de Azevedo, Marilia Macorin
    Feitosa, Marcelo Duduchi
    Arima, Carlos Hideo
    IADIS-INTERNATIONAL JOURNAL ON COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2024, 19 (02): : 19 - 36
  • [46] Data Cleaning for Power Quality Monitoring
    Yang, Zijing
    Cao, Junwei
    Xu, Yanxiang
    Zhang, Huaying
    Yu, Peng
    Yao, Senjing
    2013 FOURTH INTERNATIONAL CONFERENCE ON NETWORKING AND DISTRIBUTED COMPUTING (ICNDC), 2013, : 111 - 115
  • [47] Study on the Transfer Model of Data Resource in the Condition of Big Data
    Liu Ququ
    Luo Ling
    2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL INFORMATICS - COMPUTING TECHNOLOGY, INTELLIGENT TECHNOLOGY, INDUSTRIAL INFORMATION INTEGRATION (ICIICII), 2015, : 299 - 302
  • [48] Condition monitoring for helicopter data
    Wen, F
    Willett, P
    Deb, S
    SMC 2000 CONFERENCE PROCEEDINGS: 2000 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOL 1-5, 2000, : 224 - 229
  • [49] Review of Bridge Structural Health Monitoring Aided by Big Data and Artificial Intelligence: From Condition Assessment to Damage Detection
    Sun, Limin
    Shang, Zhiqiang
    Xia, Ye
    Bhowmick, Sutanu
    Nagarajaiah, Satish
    JOURNAL OF STRUCTURAL ENGINEERING, 2020, 146 (05)
  • [50] Monitoring Data Integrity in Big Data Analytics Services
    Mantzoukas, Konstantinos
    Kloukinas, Christos
    Spanoudakis, George
    PROCEEDINGS 2018 IEEE 11TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2018, : 904 - 907