A reliable adaptive prototype-based learning for evolving data streams with limited labels

被引:3
|
作者
Din, Salah Ud [1 ,2 ,3 ]
Ullah, Aman [1 ,2 ]
Mawuli, Cobbinah B. [1 ,2 ]
Yang, Qinli [1 ,2 ]
Shao, Junming [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou 313001, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[3] COMSATS Univ Islamabad, Dept Comp Sci, Abbottabad Campus, Abbottabad 22020, Pakistan
基金
中国国家自然科学基金;
关键词
Data streams; Data-driven prototypes; Concept drift; Concept evolution; Semi-supervised classification; NONSTATIONARY DATA; CONCEPT DRIFT; CLASSIFICATION; ENSEMBLE; MODEL;
D O I
10.1016/j.ipm.2023.103532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data stream mining presents notable challenges in the form of concept drift and evolution. Existing learning algorithms, typically designed within a supervised learning framework, require class labels for all data points. However, this is an impractical requirement given the rapid pace of data streams, which often results in label scarcity. Recognizing the realistic necessity of learning from data streams with limited labels, we propose an adaptive, data-driven, prototype-based semi-supervised learning framework specifically tailored to handle evolving data streams. Our method employs a prototype-based data representation, summarizing the continuous flow of streaming data using dynamic prototypes at varying levels of granularity. This technique enables improved data abstraction, capturing the underlying local data distributions more accurately. The model also incorporates reliability modeling and efficient emerging class discovery, dynamically updating the significance of prototypes over time and swiftly adapting to local concept drift. We further leverage these adaptive prototypes to intuitively detect concept evolution, i.e., identifying novel classes from a local density perspective. To minimize the need for manual labeling while optimizing performance, we incorporate active learning into our method. This method employs a dual-criteria approach for data point selection, considering both uncertainty and local density. These manually labeled data points, together with unlabeled data, serve to update the model efficiently and robustly. Empirical validation using several bench-mark datasets demonstrates promising performance in comparison to existing state-of-the-art techniques.
引用
下载
收藏
页数:22
相关论文
共 50 条
  • [31] Evidential prototype-based clustering based on transfer learning
    Zhou, Kuang
    Guo, Mei
    Martin, Arnaud
    International Journal of Approximate Reasoning, 2022, 151 : 322 - 343
  • [32] A Prototype-Based Adaptive Concept Drift Classification Method
    Su J.
    Qiu X.-F.
    Li S.-F.
    Liu D.-W.
    Zhang C.-H.
    1600, Beijing University of Posts and Telecommunications (40): : 43 - 50
  • [33] ContraCluster: Learning to Classify without Labels by Contrastive Self-Supervision and Prototype-Based Semi-Supervision
    Joe, Seongho
    Kim, Byoungjip
    Kang, Hoyoung
    Park, Kyoungwon
    Kim, Bogun
    Park, Jaeseon
    Lee, Joonseok
    Gwon, Youngjune
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4685 - 4692
  • [34] Prototype-Based Discriminative Feature Learning for Kinship Verification
    Yan, Haibin
    Lu, Jiwen
    Zhou, Xiuzhuang
    IEEE TRANSACTIONS ON CYBERNETICS, 2015, 45 (11) : 2535 - 2545
  • [35] Risk Sensitivity and Entropy Regularization in Prototype-based Learning
    Mavridis, Christos
    Noorani, Erfaun
    Baras, John S.
    2022 30TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2022, : 194 - 199
  • [36] Prototype-based Models for the Supervised Learning of Classification Schemes
    Biehl, Michael
    Hammer, Barbara
    Villmann, Thomas
    ASTROINFORMATICS, 2017, 12 (S325): : 129 - 138
  • [37] Sentiment analysis on big sparse data streams with limited labels
    Vasileios Iosifidis
    Eirini Ntoutsi
    Knowledge and Information Systems, 2020, 62 : 1393 - 1432
  • [38] Sentiment analysis on big sparse data streams with limited labels
    Iosifidis, Vasileios
    Ntoutsi, Eirini
    KNOWLEDGE AND INFORMATION SYSTEMS, 2020, 62 (04) : 1393 - 1432
  • [39] Distant Supervision via Prototype-Based Global Representation Learning
    Han, Xianpei
    Sun, Le
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3443 - 3449
  • [40] A Prototype-Based Knowledge Distillation Framework for Heterogeneous Federated Learning
    Lyu, Feng
    Tang, Cheng
    Deng, Yongheng
    Liu, Tong
    Zhang, Yongmin
    Zhang, Yaoxue
    2023 IEEE 43RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS, 2023, : 37 - 47