Active mining of data streams

被引:0
|
作者
Fan, W [1 ]
Huang, YA [1 ]
Wang, HX [1 ]
Yu, PS [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Hawthorne, NY 10532 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Most previously proposed mining methods on data streams make an unrealistic assumption that "labelled" data stream is readily available and can be mined at anytime. However, in most real-world problems, labelled data streams are rarely immediately available. Due to this reason, models are refreshed periodically, that is usually synchronized with data availability schedule. There are several undesirable consequences of this "passive periodic refresh". In this paper, we propose a new concept of demand-driven active data mining. It estimates the error of the model on the new data stream without knowing the true class labels. When significantly higher error is suspected, it investigates the true class labels of a selected number of examples in the most recent data stream to verify the suspected higher error.
引用
收藏
页码:457 / 461
页数:5
相关论文
共 50 条
  • [41] Active learning for data streams: a survey
    Cacciarelli, Davide
    Kulahci, Murat
    MACHINE LEARNING, 2024, 113 (01) : 185 - 239
  • [42] Active learning from data streams
    Zhu, Xingquan
    Zhang, Peng
    Lin, Xiaodong
    Shi, Yong
    ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 757 - +
  • [43] Approximate data mining for sliding window based data streams
    Yin, Kuo-Cheng
    Hsieh, Yu-Lung
    Yang, Don-Lin
    Journal of Computers, 2012, 23 (02): : 1 - 13
  • [44] Incremental Mining of Across-streams Sequential Patterns in Multiple Data Streams
    Yang, Shih-Yang
    Chao, Ching-Ming
    Chen, Po-Zung
    Sun, Chu-Hao
    JOURNAL OF COMPUTERS, 2011, 6 (03) : 449 - 457
  • [45] Efficient mining of frequent itemsets from data streams
    Leung, Carson Kai-Sang
    Brajczuk, Dale A.
    SHARING DATA, INFORMATION AND KNOWLEDGE, PROCEEDINGS, 2008, 5071 : 2 - 14
  • [46] An efficient algorithm for frequent itemset mining on data streams
    Xie Zhi-Jun
    Chen Hong
    Li, Cuiping
    ADVANCES IN DATA MINING: APPLICATIONS IN MEDICINE, WEB MINING, MARKETING, IMAGE AND SIGNAL MINING, 2006, 4065 : 474 - 491
  • [47] Ratio Rules Mining in Concept Drifting Data Streams
    Fan, Wei
    Watanabe, Toyohide
    Asakura, Koichi
    WCECS 2009: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 809 - +
  • [48] Mining Strongly Closed Itemsets from Data Streams
    Trabold, Daniel
    Horvath, Tamas
    DISCOVERY SCIENCE, DS 2017, 2017, 10558 : 251 - 266
  • [49] Anytime Frequent Itemset Mining of Transactional Data Streams
    Goyal, Poonam
    Challa, Jagat Sesh
    Shrivastava, Shivin
    Goyal, Navneet
    BIG DATA RESEARCH, 2020, 21
  • [50] A Survey on Closed Frequent Itemset Mining on Data Streams
    Bai, Pavitra . S.
    Kumar, Ravi . G. . K.
    PROCEEDINGS OF THE 2016 2ND INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING AND INFORMATICS (IC3I), 2016, : 542 - 547