A Novel Imbalanced Data Classification Method Based on Weakly Supervised Learning for Fault Diagnosis

被引:36
|
作者
Liu, Hui [1 ]
Liu, Zhenyu [1 ]
Jia, Weiqiang [1 ,2 ]
Zhang, Donghao [1 ]
Tan, Jianrong [1 ]
机构
[1] Zhejiang Univ, State Key Lab Comp Aided Design & Comp Graph, Hangzhou 310027, Peoples R China
[2] Zhejiang Lab, Hangzhou 311121, Peoples R China
基金
中国国家自然科学基金;
关键词
Fault diagnosis; Supervised learning; Support vector machines; Classification algorithms; Informatics; Prognostics and health management; Prediction algorithms; Bidirectional gated recurrent units (BGRU); class imbalance; support vector machine (SVM); weakly supervised learning; SMOTE;
D O I
10.1109/TII.2021.3084132
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The class imbalance problem has a huge impact on the performance of diagnostic models. When it occurs, the minority samples are easily ignored by classification models. Besides, the distribution of class imbalanced data differs from the actual data distribution, which makes it difficult for classifiers to learn an accurate decision boundary. To tackle the above issues, this article proposes a novel imbalanced data classification method based on weakly supervised learning. First, Bagging algorithm is employed to sample majority data randomly to generate several relatively balanced subsets, which are then used to train several support vector machine (SVM) classifiers. Next, these trained SVM classifiers are adopted to predict the labels of those unlabeled data, and samples that are predicted as minority class are added to the original dataset to reduce the imbalance ratio. The critical idea of this article is to introduce real-world samples into the imbalanced dataset by virtue of weakly supervised learning. In addition, bidirectional gated recurrent units are used to construct a diagnostic model for fault diagnosis, and a new weighted cross-entropy function is proposed as the loss function to reduce the impact of noise. Besides, it also increases the model's attention to the original minority samples. Furthermore, experimental evaluations of the proposed method are conducted on two datasets, i.e., Prognostics and Health Management challenge 2008 and 2010 datasets, and the experimental results demonstrate the effectiveness and superiority of the proposed method.
引用
下载
收藏
页码:1583 / 1593
页数:11
相关论文
共 50 条
  • [1] A Weakly Supervised Learning-Based Oversampling Framework for Class-Imbalanced Fault Diagnosis
    Qian, Min
    Li, Yan-Fu
    IEEE TRANSACTIONS ON RELIABILITY, 2022, 71 (01) : 429 - 442
  • [2] Semi-Supervised Transfer Learning Method for Bearing Fault Diagnosis with Imbalanced Data
    Zong, Xia
    Yang, Rui
    Wang, Hongshu
    Du, Minghao
    You, Pengfei
    Wang, Su
    Su, Hao
    MACHINES, 2022, 10 (07)
  • [3] Imbalanced fault diagnosis based on semi-supervised ensemble learning
    Chuanxia Jian
    Yinhui Ao
    Journal of Intelligent Manufacturing, 2023, 34 : 3143 - 3158
  • [4] Imbalanced fault diagnosis based on semi-supervised ensemble learning
    Jian, Chuanxia
    Ao, Yinhui
    JOURNAL OF INTELLIGENT MANUFACTURING, 2023, 34 (07) : 3143 - 3158
  • [5] Research on bearing fault diagnosis method based on cjbm with semi-supervised and imbalanced data
    Li, Sai
    Peng, Yanfeng
    Bin, Guangfu
    Shen, Yiping
    Guo, Yong
    Li, Baoqing
    Jiang, Yongzheng
    Fan, Chao
    NONLINEAR DYNAMICS, 2024, : 19759 - 19781
  • [6] A Novel Fault Diagnosis method for Rotating Machinery of Imbalanced Data
    Han, Qi
    Wang, Xianghua
    Yang, Rui
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2072 - 2077
  • [7] Fault diagnosis method for imbalanced and unlabeled data based on bayesian graph balanced learning
    Zhou, Ziyou
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (09)
  • [8] Imbalanced Data Classification Method Based on Ensemble Learning
    Xiang, Yu
    Xie, Yongping
    COMMUNICATIONS, SIGNAL PROCESSING, AND SYSTEMS, CSPS 2018, VOL III: SYSTEMS, 2020, 517 : 18 - 24
  • [9] Electrical Fault Diagnosis From Text Data: A Supervised Sentence Embedding Combined With Imbalanced Classification
    Jing, Xiao
    Wu, Zhiang
    Zhang, Lu
    Li, Zhe
    Mu, Dejun
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2024, 71 (03) : 3064 - 3073
  • [10] Research of System Fault Diagnosis Method Based on Imbalanced Data
    Zhu, QingYu
    Liu, Hengyu
    Wang, Junling
    Chen, Shaowei
    Wen, Pengfei
    Wang, Shengyue
    2019 PROGNOSTICS AND SYSTEM HEALTH MANAGEMENT CONFERENCE (PHM-QINGDAO), 2019,