A Fast Parallel Algorithm for Discovering Frequent Patterns

被引:7
|
作者
Lin, Kawuu W. [1 ]
Luo, Yu-Chin [1 ]
机构
[1] Natl Kaohsiung Univ Appl Sci, Dept Comp Sci & Informat Engn, Kaohsiung 807, Taiwan
关键词
Data mining; cloud computing; association rule mining; frequent pattern mining; privacy preserved;
D O I
10.1109/GRC.2009.5255089
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fast discovery of frequent patterns is the most extensively discussed problem in data mining fields due to its wide applications. As the size of database increases, the computation time and the required memory increase severely. The difficulty of mining large database launched the research of designing parallel and distributed algorithms to solve the problem. Most of the past studies tried to parallelize the computation by dividing the database and distribute the divided database to other nodes for mining. This approach might leak data out and evidently is not suitable to be applied to sensitive domains like health-care. In this paper, we propose a novel data mining algorithm named FD-Mine that is able to efficiently utilize the nodes to discover frequent patterns in cloud computing environments with data privacy preserved. Through empirical evaluations on various simulation conditions, the proposed FD-Mine delivers excellent performance in terms of scalability and execution time.
引用
收藏
页码:398 / 403
页数:6
相关论文
共 50 条
  • [21] A fast and distributed algorithm for mining frequent patterns in congested networks
    Lin, Kawuu W.
    Chung, Sheng-Hao
    Lin, Chun-Cheng
    COMPUTING, 2016, 98 (03) : 235 - 256
  • [22] An Efficient and Fast Algorithm for Mining Frequent Patterns on Multiple Biosequences
    Liu, Wei
    Chen, Ling
    COMPUTER AND COMPUTING TECHNOLOGIES IN AGRICULTURE IV, PT 1, 2011, 344 : 178 - 194
  • [23] Parallel Algorithm for Discovering and Comparing Three-Dimensional Proteins Patterns
    Valdes-Jimenez, Alejandro
    Reyes-Parada, Miguel
    Nunez-Vivanco, Gabriel
    Jimenez-Gonzalez, Daniel
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2024, 21 (03) : 508 - 515
  • [24] A fast algorithm for discovering optimal string patterns in large text databases
    Arimura, H
    Wataki, A
    Fujino, R
    Araikawa, S
    ALGORITHMIC LEARNING THEORY, 1998, 1501 : 247 - 261
  • [25] Discovering Frequent Patterns on Agrometeorological Data with TrieMotif
    Chino, Daniel Y. T.
    Goncalves, Renata R. V.
    Romani, Luciana A. S.
    Traina, Caetano, Jr.
    Traina, Agma J. M.
    ENTERPRISE INFORMATION SYSTEMS, ICEIS 2014, 2015, 227 : 91 - 107
  • [26] A Relational Approach for Discovering Frequent Patterns with Disjunctions
    Loglisci, Corrado
    Ceci, Michelangelo
    Malerba, Donato
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, 2010, 6263 : 263 - 274
  • [27] Discovering maximal frequent patterns in sequence groups
    Guan, JW
    Bell, DA
    Liu, DY
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, 2004, 3066 : 602 - 609
  • [28] Discovering frequent patterns for in-flight incidents
    Sene, Alsane
    Kamsu-Foguem, Bernard
    Rumeau, Pierre
    COGNITIVE SYSTEMS RESEARCH, 2018, 49 : 97 - 113
  • [29] A FAST PARALLEL ALGORITHM FOR DOT LINKING IN GLASS PATTERNS
    GROSS, A
    HARTLEY, R
    ROSENFELD, A
    PATTERN RECOGNITION LETTERS, 1985, 3 (04) : 263 - 270
  • [30] A MODIFIED FAST PARALLEL ALGORITHM FOR THINNING DIGITAL PATTERNS
    CHEN, YS
    HSU, WH
    PATTERN RECOGNITION LETTERS, 1988, 7 (02) : 99 - 106