Transfer estimation of evolving class priors in data stream classification

被引:25
|
作者
Zhang, Zhihao [1 ]
Zhou, Jie [1 ]
机构
[1] Tsinghua Univ, Dept Automat, State Key Lab Intelligent Technol & Syst, Tsinghua Natl Lab Informat Sci & Technol TNList, Beijing 100084, Peoples R China
基金
中国国家自然科学基金;
关键词
Concept drift; Transfer learning; Prior estimation; PRIOR PROBABILITIES;
D O I
10.1016/j.patcog.2010.03.021
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data stream classification is a hot topic in data mining research. The great challenge is that the class priors may evolve along the data sequence. Algorithms have been proposed to estimate the dynamic class priors and adjust the classifier accordingly. However, the existing algorithms do not perform well on prior estimation due to the lack of samples from the target distribution. Sample size has great effects in parameter estimation and small-sample effects greatly contaminate the estimation performance. In this paper, we propose a novel parameter estimation method called transfer estimation. Transfer estimation makes use of samples not only from the target distribution but also from similar distributions. We apply this new estimation method to the existing algorithms and obtain an improved algorithm. Experiments on both synthetic and real data sets show that the improved algorithm outperforms the existing algorithms on both class prior estimation and classification. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3151 / 3161
页数:11
相关论文
共 50 条
  • [1] Transfer estimation and the applications in data stream classification
    Zhang, Zhihao
    Zhou, Jie
    MIPPR 2011: PATTERN RECOGNITION AND COMPUTER VISION, 2011, 8004
  • [2] Non-stationary data sequence classification using online class priors estimation
    Yang, Chunyu
    Zhou, Jie
    PATTERN RECOGNITION, 2008, 41 (08) : 2656 - 2664
  • [3] Exploiting evolving micro-clusters for data stream classification with emerging class detection
    Din, Salah Ud
    Shao, Junming
    INFORMATION SCIENCES, 2020, 507 : 404 - 420
  • [4] Evolving Big Data Stream Classification with MapReduce
    Haque, Ahsanul
    Parker, Brandon
    Khan, Latifur
    Thuraisingham, Bhavani
    2014 IEEE 7TH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD), 2014, : 570 - 577
  • [5] Adaptive random forests for evolving data stream classification
    Gomes, Heitor M.
    Bifet, Albert
    Read, Jesse
    Barddal, Jean Paul
    Enembreck, Fabricio
    Pfharinger, Bernhard
    Holmes, Geoff
    Abdessalem, Talel
    MACHINE LEARNING, 2017, 106 (9-10) : 1469 - 1495
  • [6] Streaming Random Patches for Evolving Data Stream Classification
    Gomes, Heitor Murilo
    Read, Jesse
    Bifet, Albert
    2019 19TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM 2019), 2019, : 240 - 249
  • [7] Adaptive regularized ensemble for evolving data stream classification
    Paim, Aldo M.
    Enembreck, Fabricio
    PATTERN RECOGNITION LETTERS, 2024, 180 : 55 - 61
  • [8] Adaptive random forests for evolving data stream classification
    Heitor M. Gomes
    Albert Bifet
    Jesse Read
    Jean Paul Barddal
    Fabrício Enembreck
    Bernhard Pfharinger
    Geoff Holmes
    Talel Abdessalem
    Machine Learning, 2017, 106 : 1469 - 1495
  • [9] Adaptive random tree ensemble for evolving data stream classification
    Paim, Aldo M.
    Enembreck, Fabricio
    KNOWLEDGE-BASED SYSTEMS, 2025, 309
  • [10] Correction to: Adaptive random forests for evolving data stream classification
    Heitor M. Gomes
    Albert Bifet
    Jesse Read
    Jean Paul Barddal
    Fabrício Enembreck
    Bernhard Pfahringer
    Geoff Holmes
    Talel Abdessalem
    Machine Learning, 2019, 108 : 1877 - 1878