Learning Patterns from Imbalanced Evolving Data Streams

被引:0
|
作者
Almuammar, Manal [1 ]
Fasli, Maria [2 ]
机构
[1] Univ Essex, Sch Comp Sci & Elect Engn, Colchester, Essex, England
[2] Univ Essex, Sch Comp Sci & Elect Engn, Inst Analyt & Data Sci, Colchester, Essex, England
基金
英国经济与社会研究理事会;
关键词
data stream; pattern discovery; imbalanced classes; evolving stream; rare pattern; DRIFT;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning patterns from evolving data streams is challenging due to the characteristics of such streams: being continuous, unbounded and high speed data of non-stationary nature, which must be processed on the fly, using minimal computational resources. An additional challenge is imposed by the imbalanced data streams in many real-world applications, this difficulty becomes more prominent in multi-class learning tasks. This paper investigates the multi-class imbalance problem in non-stationary streams and develops a method to exploit real-time stream data and capture the dynamic of patterns from heterogeneous streams. In particular, we seek to extend concept drift adaptation techniques into imbalanced classes' scenarios, and accordingly, we use an adaptive learner to classify multiple streams over a sequence of titled time windows. We include examples of the falsely classified instances in the training set, then we propose using a dynamic support threshold to discover the frequent patterns in these streams. We conduct an experiment on the car parking lots environment of a typical University with three simulated streams from sensors, smart pay stations and a mobile application. The result indicates the efficiency of applying adaptive learner approaches and modifying the training set to cope with the concept drift in multi-class imbalance scenarios, it also shows the merit of using a dynamic threshold to detect the rare patterns from evolving streams.
引用
收藏
页码:2048 / 2057
页数:10
相关论文
共 50 条
  • [1] Online Learning From Incomplete and Imbalanced Data Streams
    You, Dianlong
    Xiao, Jiawei
    Wang, Yang
    Yan, Huigui
    Wu, Di
    Chen, Zhen
    Shen, Limin
    Wu, Xindong
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (10) : 10650 - 10665
  • [2] Adaptive Learning from Evolving Data Streams
    Bifet, Albert
    Gavalda, Ricard
    [J]. ADVANCES IN INTELLIGENT DATA ANALYSIS VIII, PROCEEDINGS, 2009, 5772 : 249 - 260
  • [3] The Influence of Multiple Classes on Learning from Imbalanced Data Streams
    Lipska, Agnieszka
    Stefanowski, Jerzy
    [J]. FOURTH INTERNATIONAL WORKSHOP ON LEARNING WITH IMBALANCED DOMAINS: THEORY AND APPLICATIONS, VOL 183, 2022, 183 : 187 - 198
  • [4] Learning model trees from evolving data streams
    Ikonomovska, Elena
    Gama, Joao
    Dzeroski, Saso
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2011, 23 (01) : 128 - 168
  • [5] Learning model trees from evolving data streams
    Elena Ikonomovska
    João Gama
    Sašo Džeroski
    [J]. Data Mining and Knowledge Discovery, 2011, 23 : 128 - 168
  • [6] IEBench: Benchmarking Streaming Learners on Imbalanced Evolving Data Streams
    Bernardo, Alessio
    Ziffer, Giacomo
    Della Valle, Emanuele
    [J]. 21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, : 331 - 340
  • [7] Online semi-supervised active learning ensemble classification for evolving imbalanced data streams
    Guo, Yinan
    Pu, Jiayang
    Jiao, Botao
    Peng, Yanyan
    Wang, Dini
    Yang, Shengxiang
    [J]. APPLIED SOFT COMPUTING, 2024, 155
  • [8] Online Evaluation of Patterns from Evolving Web Data Streams
    Rojas, Carlos
    Nasraoui, Olfa
    [J]. 2009 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCES ON WEB INTELLIGENCE (WI) AND INTELLIGENT AGENT TECHNOLOGIES (IAT), VOL 1, 2009, : 315 - 318
  • [9] Low-Dimensional Representation Learning from Imbalanced Data Streams
    Korycki, Lukasz
    Krawczyk, Bartosz
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2021, PT I, 2021, 12712 : 629 - 641
  • [10] Mining evolving data streams for frequent patterns
    Laur, Pierre-Alain
    Nock, Richard
    Symphor, Jean-Emile
    Poncelet, Pascal
    [J]. PATTERN RECOGNITION, 2007, 40 (02) : 492 - 503