Data Augmentation Classifier for Imbalanced Fault Classification

被引:83
|
作者
Jiang, Xiaoyu [1 ]
Ge, Zhiqiang [1 ]
机构
[1] Zhejiang Univ, Coll Control Sci & Engn, Inst Ind Proc Control, State Key Lab Ind Control Technol, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Gallium nitride; Training; Generators; Generative adversarial networks; Data models; Games; Germanium; Data augmentation; data selection; fault classification; generative adversarial networks; imbalanced data; ANALYTICS; SMOTE;
D O I
10.1109/TASE.2020.2998467
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of fault classification in industry has been studied extensively. Most classification algorithms are modeled on the premise of data balance. However, the difficulty of collecting industrial data in different modes is quite different. This inevitably leads to data imbalance, which will adversely affect the fault classification performance. This article proposes a novel data augmentation classifier (DAC) for imbalanced fault classification. Data augmentation based on generative adversarial networks (GANs) is an effective way to solve the problem of unbalanced classification. However, the randomness of the GAN generation process restricts the effect of data enhancement. DAC proposes a data selection strategy based on data filtering and data purification in model training to solve this problem. In addition, DAC combines supervised learning and data generation processes to obtain an end-to-end model. Meanwhile, multigenerator structure of DAC (MDAC) is proposed to solve the problem of incomplete learning of a single generator when data imbalances get complicated. The proposed DAC and MDAC are applied in two fault classification cases of the Tennessee Eastman (TE) benchmark process, results of which show superiority of DAC and MDAC compared to existing methods. Note to Practitioners-Data imbalances are common in fault classification and affect the effectiveness of modeling in industry. As a generative model, generative adversarial networks (GANs) provide new ideas for small-class data augmentation. However, the instability of its training process and the randomness of data generation affect the results of data augmentation. In this article, the GAN generation process is analyzed in detail. The results of the visualization indicate that no data generation was perfect at any one time. Based on the rules of GAN data generation, we propose a data selection strategy during training. High-quality data are selected for data augmentation through data filtering and data purification. Apart from this, we combine the training process of GAN and classification model for imbalanced data to reduce modeling time. Through industrial examples, we have evaluated the effectiveness of this method.
引用
收藏
页码:1206 / 1217
页数:12
相关论文
共 50 条
  • [1] Time series data augmentation classifier for industrial process imbalanced fault diagnosis
    Shen, Bingbing
    Yao, Le
    Jiang, Xiaoyu
    Yang, Zeyu
    Zeng, Jiusun
    [J]. 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1392 - 1397
  • [2] Ensemble Data Augmentation for Imbalanced Fault Diagnosis
    Jiang, Xiaoyu
    Zheng, Junhua
    Zhuang, Xinzhen
    Ge, Zhiqiang
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [3] Data Augmentation Meta-Classifier Scheme for imbalanced data sets
    Moreno-Barea, Francisco J.
    Jerez, Jose M.
    Franco, Leonardo
    [J]. 2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 1392 - 1399
  • [4] Gaussian Discriminative Analysis aided GAN for imbalanced big data augmentation and fault classification
    Zhuo, Yue
    Ge, Zhiqiang
    [J]. JOURNAL OF PROCESS CONTROL, 2020, 92 : 271 - 287
  • [5] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    Shi, Peibei
    Wang, Zhong
    [J]. JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2021, 34 (06) : 2250 - 2266
  • [6] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    SHI Peibei
    WANG Zhong
    [J]. Journal of Systems Science & Complexity, 2021, 34 (06) : 2250 - 2266
  • [7] An Imbalanced Data Augmentation and Assessment Method for Industrial Process Fault Classification With Application in Air Compressors
    Shi, Yilin
    Li, Jince
    Li, Hongguang
    Yang, Bo
    [J]. IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [8] An Ensemble Tree Classifier for Highly Imbalanced Data Classification
    Peibei Shi
    Zhong Wang
    [J]. Journal of Systems Science and Complexity, 2021, 34 : 2250 - 2266
  • [9] GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification
    Zhou, Mengting
    Gong, Zhiguo
    [J]. THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 4, 2023, : 4954 - 4962
  • [10] Classifier Ensembles for Imbalanced Classification
    Schaefer, Gerald
    [J]. 2017 SEVENTH INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING TECHNOLOGY (INTECH 2017), 2017, : 1 - 2