An Effective Method for Mining Negative Sequential Patterns From Data Streams

被引:2
|
作者
Zhang, Nannan [1 ]
Ren, Xiaoqiang [1 ]
Dong, Xiangjun [1 ]
机构
[1] Qilu Univ Technol, Shandong Acad Sci, Dept Comp Sci & Technol, Jinan 250353, Peoples R China
基金
中国国家自然科学基金;
关键词
Data mining; Behavioral sciences; Real-time systems; Transient analysis; Heuristic algorithms; Clustering algorithms; Classification algorithms; Data stream; transient; sliding window; negative sequential patterns (NSPs);
D O I
10.1109/ACCESS.2023.3262823
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Traditional negative sequential patterns(NSPs) mining algorithms are used to mine static dataset which are stored in equipment and can be scanned many times. Nowadays, with the development of technology, many applications produce a large amount of data at a very high speed, which is called as data stream. Unlike static data, data stream is transient and can usually be read only once. So, traditional NSP mining algorithm cannot be directly applied to data stream. Briefly, the key reasons are: (1) inefficient negative sequential candidates generation method, (2) one-time mining, (3) lack of real-time processing. To solve this problem, this paper proposed a new algorithm mining NSP from data stream, called nsp-DS. First, we present a method to generate positive and negative sequential candidates simultaneously, and a new negative containment definition. Second, we use a sliding window to store sample data in current time. The continuous mining of entire data stream is realized through the continuous replacement of old and new data. Finally, a prefix tree structure is introduced to store sequential patterns. Whenever the user requests, it traverses the prefix tree to output sequential patterns. The experimental results show that nsp-DS may discover NSPs from data streams.
引用
收藏
页码:31842 / 31854
页数:13
相关论文
共 50 条
  • [41] Mining Sequential Patterns with Timelines from Digital Health Data
    Hryhoruk, Connor C. J.
    Leung, Carson K.
    2023 IEEE INTERNATIONAL CONFERENCE ON DIGITAL HEALTH, ICDH, 2023, : 292 - 294
  • [42] Mining sequential patterns with flexible constraints from MOOC data
    Song, Wei
    Ye, Wei
    Fournier-Viger, Philippe
    APPLIED INTELLIGENCE, 2022, 52 (14) : 16458 - 16474
  • [43] Mining sequential patterns with flexible constraints from MOOC data
    Wei Song
    Wei Ye
    Philippe Fournier-Viger
    Applied Intelligence, 2022, 52 : 16458 - 16474
  • [44] Adaptive load shedding for mining frequent patterns from data streams
    Dang, Xuan Hong
    Ng, Wee-Keong
    Ong, Kok-Leong
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2006, 4081 : 342 - 351
  • [45] Relational Frequent Patterns Mining for Novelty Detection from Data Streams
    Ceci, Michelangelo
    Appice, Annalisa
    Loglisci, Corrado
    Caruso, Costantina
    Fumarola, Fabio
    Valente, Carmine
    Malerba, Donato
    MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, 2009, 5632 : 427 - 439
  • [46] On Mining Progressive Positive and Negative Sequential Patterns Simultaneously
    Huang, Jen-Wei
    Wu, Yong-Bin
    Jaysawal, Bijay Prasad
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2020, 36 (01) : 145 - 169
  • [47] NSPIS: Mining Negative Sequential Patterns with Individual Support
    Huang, Gengsen
    Gan, Wensheng
    Huang, Shan
    Chen, Jiahui
    Chen, Chien-Ming
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5507 - 5516
  • [48] Mining Interesting Negative Sequential Patterns Based on Influence
    Cui, Fengling
    Ren, Xiaoqiang
    Dong, Xiangjun
    IEEE ACCESS, 2023, 11 : 12925 - 12936
  • [49] Mining actionable repetitive positive and negative sequential patterns
    Sun, Chuanhou
    Ren, Xiaoqiang
    Dong, Xiangjun
    Qiu, Ping
    Wu, Xiaoming
    Zhao, Long
    Guo, Ying
    Gong, Yongshun
    Zhang, Chengqi
    KNOWLEDGE-BASED SYSTEMS, 2024, 302
  • [50] False-negative frequent items mining from data streams with bursting
    Chong, ZH
    Yu, JX
    Lu, HJ
    Zhang, ZJ
    Zhou, AY
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2005, 3453 : 422 - 434