A Novel Imbalanced Data Classification Method Based on Weakly Supervised Learning for Fault Diagnosis

被引:36
|
作者
Liu, Hui [1 ]
Liu, Zhenyu [1 ]
Jia, Weiqiang [1 ,2 ]
Zhang, Donghao [1 ]
Tan, Jianrong [1 ]
机构
[1] Zhejiang Univ, State Key Lab Comp Aided Design & Comp Graph, Hangzhou 310027, Peoples R China
[2] Zhejiang Lab, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
Fault diagnosis; Supervised learning; Support vector machines; Classification algorithms; Informatics; Prognostics and health management; Prediction algorithms; Bidirectional gated recurrent units (BGRU); class imbalance; support vector machine (SVM); weakly supervised learning; SMOTE;
D O I
10.1109/TII.2021.3084132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The class imbalance problem has a huge impact on the performance of diagnostic models. When it occurs, the minority samples are easily ignored by classification models. Besides, the distribution of class imbalanced data differs from the actual data distribution, which makes it difficult for classifiers to learn an accurate decision boundary. To tackle the above issues, this article proposes a novel imbalanced data classification method based on weakly supervised learning. First, Bagging algorithm is employed to sample majority data randomly to generate several relatively balanced subsets, which are then used to train several support vector machine (SVM) classifiers. Next, these trained SVM classifiers are adopted to predict the labels of those unlabeled data, and samples that are predicted as minority class are added to the original dataset to reduce the imbalance ratio. The critical idea of this article is to introduce real-world samples into the imbalanced dataset by virtue of weakly supervised learning. In addition, bidirectional gated recurrent units are used to construct a diagnostic model for fault diagnosis, and a new weighted cross-entropy function is proposed as the loss function to reduce the impact of noise. Besides, it also increases the model's attention to the original minority samples. Furthermore, experimental evaluations of the proposed method are conducted on two datasets, i.e., Prognostics and Health Management challenge 2008 and 2010 datasets, and the experimental results demonstrate the effectiveness and superiority of the proposed method.
引用
收藏
页码:1583 / 1593
页数:11
相关论文
共 50 条
  • [21] Diesel Engine Fault Diagnosis Method for Imbalanced Data
    Fengrong, Bi
    Mingzhi, Guo
    Xiaoyang, Bi
    Daijie, Tang
    Pengfei, Shen
    Meng, Huang
    Tianjin Daxue Xuebao (Ziran Kexue yu Gongcheng Jishu Ban)/Journal of Tianjin University Science and Technology, 2024, 57 (08): : 810 - 820
  • [22] A Novel Data-Driven Fault Diagnosis Method Based on Deep Learning
    Zhang, Yuyan
    Gao, Liang
    Li, Xinyu
    Li, Peigen
    DATA MINING AND BIG DATA, DMBD 2017, 2017, 10387 : 442 - 452
  • [23] A class-aware supervised contrastive learning framework for imbalanced fault diagnosis
    Zhang, Jiyang
    Zou, Jianxiao
    Su, Zhiheng
    Tang, Jianxiong
    Kang, Yuhao
    Xu, Hongbing
    Liu, Zhiliang
    Fan, Shicai
    KNOWLEDGE-BASED SYSTEMS, 2022, 252
  • [24] Fault diagnosis method based on online semi-supervised learning
    Yin, G. (gang.gang88@163.com), 1600, Nanjing University of Aeronautics an Astronautics (25):
  • [25] Fault diagnosis method based on triple generative adversarial nets for imbalanced data
    Su, Changwei
    Wang, Xueren
    Liu, Ruijie
    Guo, Ziyi
    Sang, Shengtian
    Yu, Shuang
    Zhang, Haifeng
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2023, 34 (03)
  • [26] Fault diagnosis method based on supervised particle swarm optimization classification algorithm
    Zheng, Bo
    Huang, Hong-Zhong
    Guo, Wei
    Li, Yan-Feng
    Mi, Jinhua
    INTELLIGENT DATA ANALYSIS, 2018, 22 (01) : 191 - 210
  • [27] Lightweight bearing fault diagnosis method based on cross-scale learning transformer under imbalanced data
    Zhao, Huimin
    Li, Peixi
    Guo, Aibin
    Deng, Wu
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (10)
  • [28] Supervised Density-Based Metric Learning Based on Bhattacharya Distance for Imbalanced Data Classification Problems
    Jalali Mojahed, Atena
    Moattar, Mohammad Hossein
    Ghaffari, Hamidreza
    Big Data and Cognitive Computing, 2024, 8 (09)
  • [29] A Novel Fault Diagnosis Method Based on Semi-supervised Max-margin Dictionary Learning
    Wang W.
    Tao J.
    Liu Z.
    Zhendong Ceshi Yu Zhenduan/Journal of Vibration, Measurement and Diagnosis, 2019, 39 (05): : 1068 - 1074
  • [30] INTELLIGENT BEARING FAULT DIAGNOSIS METHOD BASED ON HNR ENVELOPE AND CLASSIFICATION USING SUPERVISED MACHINE LEARNING ALGORITHMS
    Ouachtouk, Ilias
    El Hani, Soumia
    Dahi, Khalid
    ADVANCES IN ELECTRICAL AND ELECTRONIC ENGINEERING, 2021, 19 (04) : 282 - 294