Anensemble method for data stream classification in the presence of concept drift

被引:5
|
作者
Abbaszadeh, Omid [1 ]
Amiri, Ali [1 ]
Khanteymoori, Ali Reza [1 ]
机构
[1] Univ Zanjan, Dept Comp Engn, Zanjan 4537138791, Iran
关键词
Data stream; Classificaion; Ensemble classifiers; Concept drift;
D O I
10.1631/FITEE.1400398
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
One recent area of interest in computer science is data stream management and processing. By 'data stream', we refer to continuous and rapidly generated packages of data. Specific features of data streams are immense volume, high production rate, limited data processing time, and data concept drift; these features differentiate the data stream from standard types of data. An issue for the data stream is classification of input data. A novel ensemble classifier is proposed in this paper. The classifier uses base classifiers of two weighting functions under different data input conditions. In addition, a new method is used to determine drift, which emphasizes the precision of the algorithm. Another characteristic of the proposed method is removal of different numbers of the base classifiers based on their quality. Implementation of a weighting mechanism to the base classifiers at the decision-making stage is another advantage of the algorithm. This facilitates adaptability when drifts take place, which leads to classifiers with higher efficiency. Furthermore, the proposed method is tested on a set of standard data and the results confirm higher accuracy compared to available ensemble classifiers and single classifiers. In addition, in some cases the proposed classifier is faster and needs less storage space.
引用
收藏
页码:1059 / 1068
页数:10
相关论文
共 50 条
  • [41] Data stream mining: methods and challenges for handling concept drift
    Scott Wares
    John Isaacs
    Eyad Elyan
    [J]. SN Applied Sciences, 2019, 1
  • [42] Data stream mining: methods and challenges for handling concept drift
    Wares, Scott
    Isaacs, John
    Elyan, Eyad
    [J]. SN APPLIED SCIENCES, 2019, 1 (11)
  • [43] Multidimensional surrogate stability to detect data stream concept drift
    da Costa, Fausto G.
    Duarte, Felipe S. L. G.
    Vallim, Rosane M. M.
    de Mello, Rodrigo F.
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2017, 87 : 15 - 29
  • [44] Self-adaption neighborhood density clustering method for mixed data stream with concept drift
    Xu, Shuliang
    Feng, Lin
    Liu, Shenglan
    Qiao, Hong
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 89
  • [45] Novel Class Detection with Concept Drift in Data Stream - AhtNODE
    Gandhi, Jay
    Gandhi, Vaibhav
    [J]. INTERNATIONAL JOURNAL OF DISTRIBUTED SYSTEMS AND TECHNOLOGIES, 2020, 11 (01) : 15 - 26
  • [46] Streaming Malware Classification in the Presence of Concept Drift and Class Imbalance
    Kegelmeyer, W. Philip
    Chiang, Ken
    Ingram, Joe
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 2, 2013, : 48 - 53
  • [47] Transcending TRANSCEND: Revisiting Malware Classification in the Presence of Concept Drift
    Barbero, Federico
    Pendlebury, Feargus
    Pierazzi, Fabio
    Cavallaro, Lorenzo
    [J]. 43RD IEEE SYMPOSIUM ON SECURITY AND PRIVACY (SP 2022), 2022, : 805 - 823
  • [48] Overview of Wind and Photovoltaic Data Stream Classification and Data Drift Issues
    Zhu, Xinchun
    Wu, Yang
    Zhao, Xu
    Yang, Yunchen
    Liu, Shuangquan
    Shi, Luyi
    Wu, Yelong
    [J]. ENERGIES, 2024, 17 (17)
  • [49] Flexible decision tree for data stream classification in the presence of concept change, noise and missing values
    Sattar Hashemi
    Ying Yang
    [J]. Data Mining and Knowledge Discovery, 2009, 19 : 95 - 131
  • [50] Flexible decision tree for data stream classification in the presence of concept change, noise and missing values
    Hashemi, Sattar
    Yang, Ying
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 2009, 19 (01) : 95 - 131