Online-Dynamic-Clustering-Based Soft Sensor for Industrial Semi-Supervised Data Streams

被引:4
|
作者
Wang, Yuechen [1 ,2 ]
Jin, Huaiping [1 ,2 ]
Chen, Xiangguang [3 ]
Wang, Bin [1 ]
Yang, Biao [1 ]
Qian, Bin [1 ]
机构
[1] Kunming Univ Sci & Technol, Fac Informat Engn & Automation, Dept Automation, Kunming 650500, Peoples R China
[2] Yunnan Key Lab Green Energy Elect Power Measuremen, Kunming 650500, Peoples R China
[3] Beijing Inst Technol, Sch Chem & Chem Engn, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
soft sensor; semi-supervised data streams; online clustering; adaptive switching prediction; sample augmentation; Gaussian process regression; EVOLVING DATA STREAMS; ALGORITHM; NETWORKS;
D O I
10.3390/s23031520
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
In the era of big data, industrial process data are often generated rapidly in the form of streams. Thus, how to process such sequential and high-speed stream data in real time and provide critical quality variable predictions has become a critical issue for facilitating efficient process control and monitoring in the process industry. Traditionally, soft sensor models are usually built through offline batch learning, which remain unchanged during the online implementation phase. Once the process state changes, soft sensors built from historical data cannot provide accurate predictions. In practice, industrial process data streams often exhibit characteristics such as nonlinearity, time-varying behavior, and label scarcity, which pose great challenges for building high-performance soft sensor models. To address this issue, an online-dynamic-clustering-based soft sensor (ODCSS) is proposed for industrial semi-supervised data streams. The method achieves automatic generation and update of clusters and samples deletion through online dynamic clustering, thus enabling online dynamic identification of process states. Meanwhile, selective ensemble learning and just-in-time learning (JITL) are employed through an adaptive switching prediction strategy, which enables dealing with gradual and abrupt changes in process characteristics and thus alleviates model performance degradation caused by concept drift. In addition, semi-supervised learning is introduced to exploit the information of unlabeled samples and obtain high-confidence pseudo-labeled samples to expand the labeled training set. The proposed method can effectively deal with nonlinearity, time-variability, and label scarcity issues in the process data stream environment and thus enable reliable target variable predictions. The application results from two case studies show that the proposed ODCSS soft sensor approach is superior to conventional soft sensors in a semi-supervised data stream environment.
引用
收藏
页数:29
相关论文
共 50 条
  • [31] Online semi-supervised active learning ensemble classification for evolving imbalanced data streams
    Guo, Yinan
    Pu, Jiayang
    Jiao, Botao
    Peng, Yanyan
    Wang, Dini
    Yang, Shengxiang
    [J]. APPLIED SOFT COMPUTING, 2024, 155
  • [32] A Semi-supervised Ensemble Approach for Mining Data Streams
    Liu, Jing
    Xu, Guo-Sheng
    Xiao, Da
    Gu, Li-Ze
    Niu, Xin-Xin
    [J]. JOURNAL OF COMPUTERS, 2013, 8 (11) : 2873 - 2879
  • [33] Semi-supervised federated learning on evolving data streams
    Mawuli, Cobbinah B.
    Kumar, Jay
    Nanor, Ebenezer
    Fu, Shangxuan
    Pan, Liangxu
    Yang, Qinli
    Zhang, Wei
    Shao, Junming
    [J]. INFORMATION SCIENCES, 2023, 643
  • [34] Adversarial smoothing tri-regression for robust semi-supervised industrial soft sensor
    Feng, Liangjun
    Zhao, Chunhui
    Huang, Biao
    [J]. JOURNAL OF PROCESS CONTROL, 2021, 108 : 86 - 97
  • [35] Laplacian regularization of linear regression model for semi-supervised industrial soft sensor development
    Zheng, Junhua
    Ye, Lingjian
    Ge, Zhiqiang
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2024, 254
  • [36] Semi-Supervised Semantic Dynamic Text Clustering Algorithm
    Qian, Zhi-Sen
    Huang, Rui-Zhang
    Wei, Qin
    Qin, Yong-Bin
    Chen, Yan-Ping
    [J]. Dianzi Keji Daxue Xuebao/Journal of the University of Electronic Science and Technology of China, 2019, 48 (06): : 803 - 808
  • [37] Semi-supervised clustering algorithm based on small size of labeled data
    Leng, Mingwei
    Chen, Xiaoyun
    Cheng, Jianjun
    Li, Longjie
    [J]. FRONTIERS OF MANUFACTURING AND DESIGN SCIENCE II, PTS 1-6, 2012, 121-126 : 4675 - 4679
  • [38] Pattern recognition of UAV flight data based on semi-supervised clustering
    Wang, N.
    Xu, Z. Sh
    Sun, Sh W.
    Liu, Y.
    [J]. 2018 11TH INTERNATIONAL CONFERENCE ON COMPUTER AND ELECTRICAL ENGINEERING, 2019, 1195
  • [39] Semi-supervised Clustering Framework Based on Active Learning for Real Data
    Odate, Ryosuke
    Shinjo, Hiroshi
    Suzuki, Yasufumi
    Motobayashi, Masahiro
    [J]. STRUCTURAL, SYNTACTIC, AND STATISTICAL PATTERN RECOGNITION, S+SSPR 2018, 2018, 11004 : 184 - 193
  • [40] MVS-based Semi-Supervised Clustering
    Yan, Yang
    Chen, Lihui
    Chan, Chee Keong
    [J]. 2013 9TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2013,