Fault-tolerant tile mining

被引:0
|
作者
Lu, Haibing [1 ]
Zhu, Wendong [2 ]
Phan, Joseph [3 ]
Ghiassi, M. [4 ]
Fang, Yi [3 ]
Hong, Yuan [5 ]
He, Xiaoyun [6 ]
机构
[1] Santa Clara Univ, Dept Operat Management & Informat Syst, Santa Clara, CA 95053 USA
[2] Global Energy Interconnect Res Inst North Amer, Santa Clara, CA USA
[3] Santa Clara Univ, Dept Comp Engn, Santa Clara, CA 95053 USA
[4] Santa Clara Univ, Dept Operat Management & Informat Syst, Santa Clara, CA 95053 USA
[5] IIT, Dept Comp Sci, Chicago, IL 60616 USA
[6] Auburn Univ, Dept Informat Syst, Montgomery, AL 36117 USA
关键词
Itemset mining; Fault-tolerant; Optimization; Exact algorithm;
D O I
10.1016/j.eswa.2018.02.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Interesting itemset mining is a fundamental research problem in knowledge management and machine learning. It is intended to identify interesting relations between variables in a database using some measures of interestingness and has a number of applications, including market basket analysis, web usage mining, intrusion detection, and many others. This paper proposes a new interestingness measure, the fault-tolerant tile. That is based on two observations: (1) the length of an itemset can be as important as its frequency; (2) knowledge discovery from real-world datasets calls for fault -tolerant data mining (e.g. extracting fault -tolerant association rules, analyzing noisy datasets). Given a user-defined fault tolerance value, we are interested in finding the maximum/top-k fault-tolerant tiles. Due to the exponential search space of candidate itemsets, both problems are NP-hard. While using some monotonic property to prune search space is a common strategy for interesting itemset mining, no monotonic property is available for this problem. To tackle the challenge, we utilize the branch-and-bound search strategy to analyze the characteristics of candidate itemsets at each searching branch and estimating their bounds. Our experimental results show that our algorithms can effectively analyze real datasets and retrieve meaningful results. (C) 2018 Elsevier Ltd. All rights reserved.
引用
下载
收藏
页码:25 / 42
页数:18
相关论文
共 50 条
  • [21] Fault-Tolerant Swarms
    Perez, Ivan
    Goodloe, Alwyn
    Edmonson, William
    2019 IEEE INTERNATIONAL CONFERENCE ON SPACE MISSION CHALLENGES FOR INFORMATION TECHNOLOGY (SMC-IT 2019), 2019, : 47 - 54
  • [22] Fault-tolerant estimation
    Mutuel, LH
    Speyer, AL
    PROCEEDINGS OF THE 2000 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2000, : 3718 - 3722
  • [23] FAULT-TOLERANT SYSTEMS
    AVIZIENIS, A
    IEEE TRANSACTIONS ON COMPUTERS, 1976, 25 (12) : 1304 - 1312
  • [24] FAULT-TOLERANT SYSTEMS
    SINGH, AD
    MURUGESAN, S
    COMPUTER, 1990, 23 (07) : 15 - 17
  • [25] FAULT-TOLERANT BROADCASTS
    SCHNEIDER, FB
    GRIES, D
    SCHLICHTING, RD
    SCIENCE OF COMPUTER PROGRAMMING, 1984, 4 (01) : 1 - 15
  • [26] Universal fault-tolerant quantum computation using fault-tolerant conversion schemes
    Luo, Lan
    Ma, Zhi
    NEW JOURNAL OF PHYSICS, 2019, 21 (08)
  • [27] FlexDM: Simple, parallel and fault-tolerant data mining using WEKA
    Flannery, Madison
    Budden, David M.
    Mendes, Alexandre
    SOURCE CODE FOR BIOLOGY AND MEDICINE, 2015, 10
  • [28] Performance Analysis of a Fault-Tolerant Exact Motif Mining Algorithm on the Cloud
    Nhan Nguyen
    Khan, Mohammad Maifi Hasan
    2013 IEEE 32ND INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2013,
  • [29] Study of Fault-tolerant Control Strategies for a Fault-tolerant Permanent Magnet Motor
    Bai Hongfen
    Zhu Jingwei
    Qin Junfeng
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 6455 - 6460
  • [30] Mining statistical information of frequent fault-tolerant patterns in transactional databases
    Poernorno, Ardian Kristanto
    Gopalkrishnan, Vivekanand
    ICDM 2007: PROCEEDINGS OF THE SEVENTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, 2007, : 272 - 281