Tracking Recurring Concepts from Evolving Data Streams using Ensemble Method

被引:0
|
作者
Sun, Yange [1 ,2 ]
Wang, Zhihai [2 ]
Yuan, Jidong [2 ]
Zhang, Wei [2 ]
机构
[1] Xinyang Normal Univ, Sch Comp & Informat Technol, Xinyang, Peoples R China
[2] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Data streams; ensemble classification; change detection; recurring concept; Jensen-Shannon divergence; CONCEPT DRIFT; SYSTEM;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble models are the most widely used methods for classifying evolving data stream. However, most of the existing data stream ensemble classification algorithms do not consider the issue of recurring concepts, which commonly exist in real-world applications. Motivated by this challenge, an Ensemble with internal Change Detection (ECD) was proposed to enhance performance by exploring the recurring concepts. It is done by maintaining a pool of classifiers, which dynamically adds and removes classifiers in response to the change detector. The algorithm adopts a two window change detection model, which adopts the Jensen-Shannon divergence to measure the distance of the distributions between old and recent data. When a change is detected, the repository of stored historical concepts is checked for reuse. Experimental results on both synthetic and real-world data streams demonstrate that the proposed algorithm not only outperforms the state-of-art methods on standard evaluation metrics, but also adapts well in different types of concept drift scenarios especially when concept s reappear.
引用
收藏
页码:1044 / 1052
页数:9
相关论文
共 50 条
  • [1] Rapidly Labeling and Tracking Dynamically Evolving Concepts In Data Streams
    Parker, Brandon S.
    Khan, Latifur
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 1161 - 1164
  • [2] Ensemble Diversity in Evolving Data Streams
    Brzezinski, Dariusz
    Stefanowski, Jerzy
    [J]. DISCOVERY SCIENCE, (DS 2016), 2016, 9956 : 229 - 244
  • [3] AN IMPROVING ONLINE ACCURACY UPDATED ENSEMBLE METHOD IN LEARNING FROM EVOLVING DATA STREAMS
    Gu, Xiao-Feng
    Xu, Jia-Wen
    Huang, Shi-Jing
    Wang, Liao-Ming
    [J]. 2014 11TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2014, : 430 - 433
  • [4] An Ensemble of Classifiers for coping with Recurring Contexts in Data Streams
    Katakis, Ioannis
    Tsoumakas, Grigorios
    Vlahavas, Ioannis
    [J]. ECAI 2008, PROCEEDINGS, 2008, 178 : 763 - +
  • [5] Heterogeneous ensemble selection for evolving data streams
    Luong, Anh Vu
    Nguyen, Tien Thanh
    Liew, Alan Wee-Chung
    Wang, Shilin
    [J]. PATTERN RECOGNITION, 2021, 112
  • [6] New Ensemble Methods For Evolving Data Streams
    Bifet, Albert
    Holmes, Geoff
    Pfahringer, Bernhard
    Kirkby, Richard
    Gavalda, Ricard
    [J]. KDD-09: 15TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2009, : 139 - 147
  • [7] Improved Ensemble Classification for Evolving Data Streams
    Tian, Hui
    Wang, Lulu
    Shen, Hong
    Liew, Alan Wee-Chung
    [J]. IEEE INTELLIGENT SYSTEMS, 2022, 37 (01) : 38 - 50
  • [8] Recurring concept meta-learning for evolving data streams
    Anderson, Robert
    Koh, Yun Sing
    Dobbie, Gillian
    Bifet, Albert
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 138
  • [9] A Method For Evolving Data Streams
    Wankhade, Kapil
    Hasan, Tasneem
    Thool, Ravindra
    [J]. 2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 1619 - 1622
  • [10] A Probabilistic Framework for Adapting to Changing and Recurring Concepts in Data Streams
    Halstead, Ben
    Koh, Yun Sing
    Riddle, Patricia
    Pechenizkiy, Mykola
    Bifet, Albert
    [J]. 2022 IEEE 9TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2022, : 407 - 416