Active Learning over Evolving Data Streams using Paired Ensemble Framework

被引:0
|
作者
Xu, Wenhua [1 ]
Zhao, Fengfei [2 ]
Lu, Zhengcai [2 ]
机构
[1] Ocean Univ China, Dept Comp Sci & Technol, Qingdao, Peoples R China
[2] Tsinghua Univ, Dept Comp Sci & Technol, Beijing, Peoples R China
关键词
Active learning; classifier ensemble; concept drift; data streams;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Stream data is considered as one of the main sources of big data. The inherent scarcity of labeled instances and the underlying concept drift have posed significant challenges on stream data classification in practice. A paired ensemble active learning framework is proposed to tackle the challenges. First, an ensemble model consists of two base classifiers is exploited to detect the changes over time, as well as to make prediction on new instances. Second, two active learning strategies work alternatively to find out the most informative instances without missing the potential changes happened anywhere in the instance space. Third, the informativeness of an instance is measured by a margin based metric, and it can effectively capture uncertain instances. Experimental results on real-world datasets demonstrate that the proposed approach can achieve good predictive accuracy on data streams.
引用
收藏
页码:180 / 185
页数:6
相关论文
共 50 条
  • [1] Online Active Learning with Drifted Data Streams Using Paired Ensemble Framework
    Shan, Ji-Cheng
    Liu, Wei-Ke
    Chu, Chen-Xi
    Dai, Chao-Fan
    Liu, Qing-Bao
    [J]. 4TH ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2017), 2017, 12
  • [2] Online Active Learning Ensemble Framework for Drifted Data Streams
    Shan, Jicheng
    Zhang, Hang
    Liu, Weike
    Liu, Qingbao
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (02) : 486 - 498
  • [3] Online semi-supervised active learning ensemble classification for evolving imbalanced data streams
    Guo, Yinan
    Pu, Jiayang
    Jiao, Botao
    Peng, Yanyan
    Wang, Dini
    Yang, Shengxiang
    [J]. APPLIED SOFT COMPUTING, 2024, 155
  • [4] Clustering Based Active Learning for Evolving Data Streams
    Ienco, Dino
    Bifet, Albert
    Zliobaite, Indre
    Pfahringer, Bernhard
    [J]. DISCOVERY SCIENCE, 2013, 8140 : 79 - 93
  • [5] Ensemble Diversity in Evolving Data Streams
    Brzezinski, Dariusz
    Stefanowski, Jerzy
    [J]. DISCOVERY SCIENCE, (DS 2016), 2016, 9956 : 229 - 244
  • [6] An Effective Ensemble-based Framework for Outlier Detection in Evolving Data Streams
    Hassan, Asmaa F.
    Barakat, Sherif
    Rezk, Amira
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (11) : 315 - 329
  • [7] Incremental density-based ensemble clustering over evolving data streams
    Khan, Imran
    Huang, Joshua Z.
    Ivanov, Kamen
    [J]. NEUROCOMPUTING, 2016, 191 : 34 - 43
  • [8] Heterogeneous ensemble selection for evolving data streams
    Luong, Anh Vu
    Nguyen, Tien Thanh
    Liew, Alan Wee-Chung
    Wang, Shilin
    [J]. PATTERN RECOGNITION, 2021, 112
  • [9] New Ensemble Methods For Evolving Data Streams
    Bifet, Albert
    Holmes, Geoff
    Pfahringer, Bernhard
    Kirkby, Richard
    Gavalda, Ricard
    [J]. KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 139 - 147
  • [10] Improved Ensemble Classification for Evolving Data Streams
    Tian, Hui
    Wang, Lulu
    Shen, Hong
    Liew, Alan Wee-Chung
    [J]. IEEE INTELLIGENT SYSTEMS, 2022, 37 (01) : 38 - 50