A reliable adaptive prototype-based learning for evolving data streams with limited labels

被引:3
|
作者
Din, Salah Ud [1 ,2 ,3 ]
Ullah, Aman [1 ,2 ]
Mawuli, Cobbinah B. [1 ,2 ]
Yang, Qinli [1 ,2 ]
Shao, Junming [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou 313001, Peoples R China
[2] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, Chengdu 611731, Peoples R China
[3] COMSATS Univ Islamabad, Dept Comp Sci, Abbottabad Campus, Abbottabad 22020, Pakistan
基金
中国国家自然科学基金;
关键词
Data streams; Data-driven prototypes; Concept drift; Concept evolution; Semi-supervised classification; NONSTATIONARY DATA; CONCEPT DRIFT; CLASSIFICATION; ENSEMBLE; MODEL;
D O I
10.1016/j.ipm.2023.103532
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data stream mining presents notable challenges in the form of concept drift and evolution. Existing learning algorithms, typically designed within a supervised learning framework, require class labels for all data points. However, this is an impractical requirement given the rapid pace of data streams, which often results in label scarcity. Recognizing the realistic necessity of learning from data streams with limited labels, we propose an adaptive, data-driven, prototype-based semi-supervised learning framework specifically tailored to handle evolving data streams. Our method employs a prototype-based data representation, summarizing the continuous flow of streaming data using dynamic prototypes at varying levels of granularity. This technique enables improved data abstraction, capturing the underlying local data distributions more accurately. The model also incorporates reliability modeling and efficient emerging class discovery, dynamically updating the significance of prototypes over time and swiftly adapting to local concept drift. We further leverage these adaptive prototypes to intuitively detect concept evolution, i.e., identifying novel classes from a local density perspective. To minimize the need for manual labeling while optimizing performance, we incorporate active learning into our method. This method employs a dual-criteria approach for data point selection, considering both uncertainty and local density. These manually labeled data points, together with unlabeled data, serve to update the model efficiently and robustly. Empirical validation using several bench-mark datasets demonstrates promising performance in comparison to existing state-of-the-art techniques.
引用
下载
收藏
页数:22
相关论文
共 50 条
  • [1] Robust Prototype-Based Learning on Data Streams
    Shao, Junming
    Huang, Feng
    Yang, Qinli
    Luo, Guangchun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (05) : 978 - 991
  • [2] Prototype-based Learning on Concept-drifting Data Streams
    Shao, Junming
    Ahmadi, Zahra
    Kramer, Stefan
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 412 - 421
  • [3] Learning High-Dimensional Evolving Data Streams With Limited Labels
    Din, Salah Ud
    Kumar, Jay
    Shao, Junming
    Mawuli, Cobbinah Bernard
    Ndiaye, Waldiodio David
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (11) : 11373 - 11384
  • [4] EINCKM: An Enhanced Prototype-based Method for Clustering Evolving Data Streams in Big Data
    Al Abd Alazeez, Ammar
    Jassim, Sabah
    Du, Hongbo
    ICPRAM: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION APPLICATIONS AND METHODS, 2017, : 173 - 183
  • [5] Selective prototype-based learning on concept-drifting data streams
    Chen, Dongzi
    Yang, Qinli
    Liu, Jiaming
    Zeng, Zhu
    INFORMATION SCIENCES, 2020, 516 : 20 - 32
  • [6] FedStream: Prototype-Based Federated Learning on Distributed Concept-Drifting Data Streams
    Mawuli, Cobbinah B.
    Che, Liwei
    Kumar, Jay
    Din, Salah Ud
    Qin, Zhili
    Yang, Qinli
    Shao, Junming
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (11): : 7112 - 7124
  • [7] Adaptive Learning from Evolving Data Streams
    Bifet, Albert
    Gavalda, Ricard
    ADVANCES IN INTELLIGENT DATA ANALYSIS VIII, PROCEEDINGS, 2009, 5772 : 249 - 260
  • [8] Adaptive Basis Functions for Prototype-based Classification of Functional Data
    Bani, Gabriele
    Seiffert, Udo
    Biehl, Michael
    Melchert, Friedrich
    2017 12TH INTERNATIONAL WORKSHOP ON SELF-ORGANIZING MAPS AND LEARNING VECTOR QUANTIZATION, CLUSTERING AND DATA VISUALIZATION (WSOM), 2017, : 145 - 152
  • [9] Adaptive basis functions for prototype-based classification of functional data
    Melchert, Friedrich
    Bani, Gabriele
    Seiffert, Udo
    Biehl, Michael
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (24): : 18213 - 18223
  • [10] Adaptive basis functions for prototype-based classification of functional data
    Friedrich Melchert
    Gabriele Bani
    Udo Seiffert
    Michael Biehl
    Neural Computing and Applications, 2020, 32 : 18213 - 18223