Active mining of data streams

被引:0
|
作者
Fan, W [1 ]
Huang, YA [1 ]
Wang, HX [1 ]
Yu, PS [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Hawthorne, NY 10532 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most previously proposed mining methods on data streams make an unrealistic assumption that "labelled" data stream is readily available and can be mined at anytime. However, in most real-world problems, labelled data streams are rarely immediately available. Due to this reason, models are refreshed periodically, that is usually synchronized with data availability schedule. There are several undesirable consequences of this "passive periodic refresh". In this paper, we propose a new concept of demand-driven active data mining. It estimates the error of the model on the new data stream without knowing the true class labels. When significantly higher error is suspected, it investigates the true class labels of a selected number of examples in the most recent data stream to verify the suspected higher error.
引用
收藏
页码:457 / 461
页数:5
相关论文
共 50 条
  • [1] Classification and Novel Class Detection in Data Streams with Active Mining
    Masud, Mohammad M.
    Gao, Jing
    Khan, Latifur
    Han, Jiawei
    Thuraisingham, Bhavani
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PT II, PROCEEDINGS, 2010, 6119 : 311 - +
  • [2] An active learning system for mining time-changing data streams
    Huang, Shucheng
    Dong, Yisheng
    INTELLIGENT DATA ANALYSIS, 2007, 11 (04) : 401 - 419
  • [3] An Active Learning Method for Mining Time-Changing Data Streams
    Huang, Shucheng
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION, VOL II, PROCEEDINGS, 2008, : 548 - 552
  • [4] Mining data streams: A review
    Gaber, MM
    Zaslavsky, A
    Krishnaswamy, S
    SIGMOD RECORD, 2005, 34 (02) : 18 - 26
  • [5] Mining databases and data streams
    Zaniolo, Carlo
    Thakkar, Hetal
    HOMELAND SECURITY TECHNOLOGY CHALLENGES: FROM SENSING AND ENCRYPTING TO MINING AND MODELING, 2008, : 103 - +
  • [6] Mining discriminative itemsets in data streams
    Seyfi, Majid (m.seyfi@qut.edu.au), 1600, Springer Verlag (8786):
  • [7] Towards Mining Trapezoidal Data Streams
    Zhang, Qin
    Zhang, Peng
    Long, Guodong
    Ding, Wei
    Zhang, Chengqi
    Wu, Xindong
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2015, : 1111 - 1116
  • [8] Mining Regular Patterns in Data Streams
    Tanbeer, Syed Khairuzzaman
    Ahmed, Chowdhury Farhan
    Jeong, Byeong-Soo
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT I, PROCEEDINGS, 2010, 5981 : 399 - 413
  • [9] Mining continuously changing data streams
    Lu Yi-hong
    Wang Zi-ren
    Huang Yan
    ISTM/2007: 7TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-7, CONFERENCE PROCEEDINGS, 2007, : 6238 - 6242
  • [10] Decision trees for mining data streams
    Gama, Joao
    Fernandes, Ricardo
    Rocha, Ricardo
    INTELLIGENT DATA ANALYSIS, 2006, 10 (01) : 23 - 45