Strategies to Face Imbalanced and Unlabelled Data in PHM Applications

被引:12
|
作者
Gouriveau, Rafael [1 ]
Ramasso, Emmanuel [1 ]
Zerhouni, Noureddine [1 ]
机构
[1] UTBM, ENSMM, UMR CNRS UFC 6174, FEMTO ST Inst,Automat Control & Micromechatron Sy, F-25000 Besancon, France
关键词
PROGNOSTICS;
D O I
10.3303/CET1333020
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Accuracy and usefulness of learned data-driven PHM models are closely related to availability and representativeness of data. Notably, two particular problems can be pointed out. First, how to improve the performances of learning algorithms in presence of underrepresented data and severe class distribution skews? This is often the case in PHM applications where faulty data can be hard (even dangerous) to gather, and can be sparsely distributed accordingly to the solicitations and failure modes. Secondly, how to cope with unlabelled data? Indeed, in many PHM problems, health states and transitions between states are not well defined, which leads to imprecision and uncertainty challenges. According to all this, the purpose of this paper is to address the problem of "learning PHM models when data are imbalanced and/or unlabelled" by proposing two types of learning schemes to face it. Imbalanced and unlabelled data are first defined and illustrated, and a taxonomy of PHM problems is proposed. The aim of this classification is to rank the difficulty of developing PHM models with respect to representativeness of data. Following that, two strategies are proposed as pieces of solution to cope with imbalanced and unlabeled data. The first one aims at going through very fast and/or evolving algorithms. This kind of training scheme enables repeating the learning phase in order to manage state discovery (as new data are available), notably when data are imbalanced. The second strategy aims at dealing with incompleteness and uncertainty of labels by taking advantage of partially-supervised training approaches. This enables taking into account some a priori knowledge and managing noise on labels. Both strategies are proposed as to improve robustness and reliability of estimates.
引用
收藏
页码:115 / 120
页数:6
相关论文
共 50 条
  • [1] Different Strategies of Fitting Logistic Regression for Positive and Unlabelled Data
    Teisseyre, Pawel
    Mielniczuk, Jan
    Lazecka, Malgorzata
    [J]. COMPUTATIONAL SCIENCE - ICCS 2020, PT IV, 2020, 12140 : 3 - 17
  • [3] Robust Thresholding Strategies for Highly Imbalanced and Noisy Data
    Johnson, Justin M.
    Khoshgoftaar, Taghi M.
    [J]. 20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 1182 - 1188
  • [4] Data modeling strategies for imbalanced learning in visual search
    Tesic, Jelena
    Natsev, Apostol
    Xie, Lexing
    Smith, John R.
    [J]. 2007 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-5, 2007, : 1990 - 1993
  • [5] Using unlabelled data to update classification rules with applications in food authenticity studies
    Dean, N
    Murphy, TB
    Downey, G
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2006, 55 : 1 - 14
  • [6] Soft computing applications in PHM
    Bonissone, Piero P.
    [J]. COMPUTATIONAL INTELLIGENCE IN DECISION AND CONTROL, 2008, 1 : 751 - 756
  • [7] Explicitly Semantic Guidance for Face Sketch Attribute Recognition With Imbalanced Data
    Shahed, Shahadat
    Lin, Yuhao
    Hong, Jiangnan
    Zhou, Jinglin
    Gao, Fei
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1502 - 1506
  • [8] Learning with constrained and unlabelled data
    Lange, T
    Law, MHC
    Jain, AK
    Buhmann, JM
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 1, PROCEEDINGS, 2005, : 731 - 738
  • [9] BAYESIAN ANALYSIS FOR IMBALANCED POSITIVE-UNLABELLED DIAGNOSIS CODES IN ELECTRONIC HEALTH RECORDS
    Wang, By Ru
    Liang, Ye
    Miao, Zhuqi
    Liu, Tieming
    [J]. ANNALS OF APPLIED STATISTICS, 2023, 17 (02): : 1220 - 1238
  • [10] Comparing Sampling Strategies for Tackling Imbalanced Data in Human Activity Recognition
    Alharbi, Fayez
    Ouarbya, Lahcen
    Ward, Jamie A.
    [J]. SENSORS, 2022, 22 (04)