Top-k Pattern Matching Using an Information-Theoretic Criterion over Probabilistic Data Streams

被引:2
|
作者
Sugiura, Kento [1 ]
Ishikawa, Yoshiharu [2 ]
机构
[1] Nagoya Univ, Grad Sch Informat Sci, Nagoya, Aichi, Japan
[2] Nagoya Univ, Grad Sch Informat, Nagoya, Aichi, Japan
来源
基金
日本科学技术振兴机构;
关键词
Complex event processing; Probabilistic data streams; Pattern matching; Regular expressions; Information-theoretic criterion; COMPLEX EVENT DETECTION; EFFICIENT;
D O I
10.1007/978-3-319-63579-8_39
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As the development of data mining technologies for sensor data streams, more sophisticated methods for complex event processing are demanded. In the case of event recognition, since event recognition results may contain errors, we need to deal with the uncertainty of events. We therefore consider probabilistic event data streams with occurrence probabilities of events, and develop a pattern matching method based on regular expressions. In this paper, we first analyze the semantics of pattern matching over non-probabilistic data streams, and then propose the problem of top-k pattern matching over probabilistic data streams. We introduce the use of an information-theoretic criterion to select appropriate matches as the result of pattern matching. Then, we present an efficient algorithm to detect top-k matches, and evaluate the effectiveness of our approach using real and synthetic datasets.
引用
收藏
页码:511 / 526
页数:16
相关论文
共 50 条
  • [11] Diversified Top-k Graph Pattern Matching
    Fan, Wenfei
    Wang, Xin
    Wu, Yinghui
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (13): : 1510 - 1521
  • [12] Diversified Top-k Spatial Pattern Matching
    Xie, Jiahua
    Chen, Hongmei
    Wang, Lizhen
    SPATIAL DATA AND INTELLIGENCE, SPATIALDI 2022, 2022, 13614 : 87 - 98
  • [13] Sliding Window Top-K Monitoring over Distributed Data Streams
    Chen B.
    Lv Z.
    Yu X.
    Liu Y.
    Data Science and Engineering, 2017, 2 (4) : 289 - 300
  • [14] Top-k/w publish/subscribe: A publish/subscribe model for continuous top-k processing over data streams
    Pripuzic, Kresimir
    Zarko, Ivana Podnar
    Aberer, Karl
    INFORMATION SYSTEMS, 2014, 39 : 256 - 276
  • [15] Probabilistic Reverse Top-k Query on Probabilistic Data
    Trieu Minh Nhut Le
    Cao, Jinli
    DATABASES THEORY AND APPLICATIONS, ADC 2023, 2024, 14386 : 30 - 43
  • [16] Finding top-k elements in data streams
    Homem, Nuno
    Carvalho, Joao Paulo
    INFORMATION SCIENCES, 2010, 180 (24) : 4958 - 4974
  • [17] Information-Theoretic Data Discarding for Dynamic Trees on Data Streams
    Anagnostopoulos, Christoforos
    Gramacy, Robert B.
    ENTROPY, 2013, 15 (12) : 5510 - 5535
  • [18] An efficient algorithm for mining top-k closed frequent item sets over data streams over data streams
    Yimin, Mao
    Xiaofang, Xue
    Jinqing, Chen
    Telkomnika - Indonesian Journal of Electrical Engineering, 2013, 11 (07): : 3759 - 3766
  • [19] Mining top-k frequent patterns over data streams sliding window
    Chen, Hui
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2014, 42 (01) : 111 - 131
  • [20] Effective and efficient top-k query processing over incomplete data streams
    Ren, Weilong
    Lian, Xiang
    Ghazinour, Kambiz
    INFORMATION SCIENCES, 2021, 544 : 343 - 371