Active mining of data streams

被引:0
|
作者
Fan, W [1 ]
Huang, YA [1 ]
Wang, HX [1 ]
Yu, PS [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Hawthorne, NY 10532 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most previously proposed mining methods on data streams make an unrealistic assumption that "labelled" data stream is readily available and can be mined at anytime. However, in most real-world problems, labelled data streams are rarely immediately available. Due to this reason, models are refreshed periodically, that is usually synchronized with data availability schedule. There are several undesirable consequences of this "passive periodic refresh". In this paper, we propose a new concept of demand-driven active data mining. It estimates the error of the model on the new data stream without knowing the true class labels. When significantly higher error is suspected, it investigates the true class labels of a selected number of examples in the most recent data stream to verify the suspected higher error.
引用
收藏
页码:457 / 461
页数:5
相关论文
共 50 条
  • [31] Mining evolving data streams for frequent patterns
    Laur, Pierre-Alain
    Nock, Richard
    Symphor, Jean-Emile
    Poncelet, Pascal
    PATTERN RECOGNITION, 2007, 40 (02) : 492 - 503
  • [32] SAMOA: A Platform for Mining Big Data Streams
    De Francisci Morales, Gianmarco
    PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), 2013, : 777 - 778
  • [33] Automatic Sequential Pattern Mining in Data Streams
    Kawabata, Koki
    Matsubara, Yasuko
    Sakurai, Yasushi
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 1733 - 1742
  • [34] MFIS - Mining frequent itemsets on data streams
    Xie, Zhi-jun
    Chen, Hong
    Li, Cuiping
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 1085 - 1093
  • [35] Mining Robust Frequent Items in Data Streams
    Xia, Rui
    Dai, Haipeng
    Du, Zhanchao
    Li, Meng
    Liu, Alex X.
    Chen, Guihai
    2020 IEEE INTERNATIONAL CONFERENCE ON JOINT CLOUD COMPUTING (JCC 2020), 2020, : 110 - 117
  • [36] Mining discriminative items in multiple data streams
    Zhenhua Lin
    Bin Jiang
    Jian Pei
    Daxin Jiang
    World Wide Web, 2010, 13 : 497 - 522
  • [37] Mining for social processes in intelligence data streams
    Savell, Robert
    Cybenko, George
    SOCIAL COMPUTING, BEHAVIORAL MODELING AND PREDICTION, 2008, : 110 - 119
  • [38] Mining discriminative items in multiple data streams
    Lin, Zhenhua
    Jiang, Bin
    Pei, Jian
    Jiang, Daxin
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2010, 13 (04): : 497 - 522
  • [39] Data Streams Fusion by Frequent Correlations Mining
    Ziembinski, Radoslaw Z.
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2015, 2015, 9375 : 1 - 8
  • [40] Mining emerging patterns and classification in data streams
    Alhammady, H
    Ramamohanarao, K
    2005 IEEE/WIC/ACM International Conference on Web Intelligence, Proceedings, 2005, : 272 - 275