A two-phase approach for unexpected pattern mining

被引:2
|
作者
Zhang, Jingtian [1 ]
Shou, Lidan [1 ]
Wu, Sai [1 ]
Chen, Gang [1 ]
Chen, Ke [1 ]
机构
[1] Zhejiang Univ, Dept Comp Sci & Technol, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Frequent pattern mining; Subgroup discovery; Multi-dimensional dataset; Data mining; Anomaly detection; SUBGROUP DISCOVERY; FAST ALGORITHM; EFFICIENT; SD;
D O I
10.1016/j.eswa.2019.112946
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A typical mining task is to retrieve all frequent patterns from a multi-dimensional dataset. Those patterns give us a basic idea of how the data look like and the hidden inherent regularities. However, this is only useful for an unfamiliar dataset, while for datasets that are analyzed periodically, "unexpected" patterns are more interesting (e.g., some customers decided to subscribe to long-term deposits despite the burden of housing loan). In this paper, we propose a new mining job, unexpected mining, which targets at retrieving frequent patterns that are not valid in a reference dataset, but are significant enough in a specific subgroup. Given a reference dataset, we step by step generate all unexpected patterns for all subgroups. We extend existing mining approaches to support the new mining job efficiently. In particular, our scheme consists of an offline process and an online process. Offline process generates candidate patterns and builds an index table. Online process can retrieve unexpected patterns from user-defined subgroups and a given support. Experiments on real datasets show that our approach can find interesting patterns and is very efficient compared to existing approaches. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:15
相关论文
共 50 条
  • [31] A two-phase approach for the Radiotherapy Scheduling Problem
    Tu-San Pham
    Louis-Martin Rousseau
    Patrick De Causmaecker
    Health Care Management Science, 2022, 25 : 191 - 207
  • [32] A wake tracking approach for two-phase Schlieren
    Davis, Ian
    Meehan, Rudi O'Reilly
    Nolan, Kevin
    Grennan, Kieran
    Murray, Darina
    INTERNATIONAL JOURNAL OF MULTIPHASE FLOW, 2018, 102 : 38 - 48
  • [33] A TWO-PHASE APPROACH TO FUZZY SYSTEM IDENTIFICATION
    Ta-Wei HUNG
    Shu-Cherng FANG
    Henry L.W.NUTTLE
    Journal of Systems Science and Systems Engineering, 2003, (04) : 408 - 423
  • [34] A continuum approach to two-phase porous media
    Mls, J
    TRANSPORT IN POROUS MEDIA, 1999, 35 (01) : 15 - 36
  • [35] A continuum approach to two-phase porous media
    Charles University, Faculty of Science, Albertov 6, CZ-128 43 Praha 2, Czech Republic
    Transp. Porous Media, 1 (15-36):
  • [36] A two-phase approach to fuzzy system identification
    Ta-Wei Hung
    Shu-Cherng Fang
    Henry L. W. Nuttle
    Journal of Systems Science and Systems Engineering, 2003, 12 (4) : 408 - 423
  • [37] A mechanistic approach to developing two phase flow pattern transition maps for two-phase dielectric fluids subject to high voltage polarization
    Nangle-Smith, S.
    Cotton, J. S.
    INTERNATIONAL JOURNAL OF HEAT AND MASS TRANSFER, 2018, 127 : 1233 - 1247
  • [38] A two-phase scheduling approach for grid computing
    Dong, Fangpeng
    Akl, Selim G.
    PROCEEDINGS OF THE 18TH IASTED INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING AND SYSTEMS, 2006, : 587 - +
  • [39] Two-phase flow pattern transitions of short airlift pumps
    Moisidis, Charalampos T.
    Kastrinakis, Eleftherios G.
    JOURNAL OF HYDRAULIC RESEARCH, 2010, 48 (05) : 680 - 685
  • [40] SPIRAL PATTERN FORMATION IN A SIMPLE TWO-PHASE FLOW SYSTEM
    Yoshikawa, H. N.
    Mathis, C.
    Maissa, P.
    Rousseaux, G.
    CHAOS, COMPLEXITY AND TRANSPORT, 2012, : 113 - 122