MaPle:: A fast algorithm for maximal pattern-based clustering

被引:0
|
作者
Pei, J [1 ]
Zhang, XL [1 ]
Cho, MJ [1 ]
Wang, HX [1 ]
Yu, PS [1 ]
机构
[1] SUNY Buffalo, Buffalo, NY 14260 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pattern-based clustering is important in many applications, such as DNA micro-array data analysis, automatic recommendation systems and target marketing systems. However pattern-based clustering in large databases is challenging. On the one hand, there can be a huge number of clusters and many of them can be redundant and thus make the pattern-based clustering ineffective. On the other hand, the previous proposed methods may not be efficient or scalable in mining large databases. In this paper, we study the problem of maximal pattern-based clustering. Redundant clusters are avoided completely by mining only the maximal pattern-based clusters. MaPle, an efficient and scalable mining algorithm is developed. It conducts a depth-first, divide-and-conquer search and prunes unnecessary branches smartly. Our extensive performance study on both synthetic data sets and real data sets shows that maximal pattern-based clustering is effective. It reduces the number of clusters substantially. Moreover MaPle is more efficient and scalable than the previously proposed pattern-based clustering methods in mining large databases.
引用
收藏
页码:259 / 266
页数:8
相关论文
共 50 条
  • [41] Fast density-based clustering algorithm
    Zhou, Shuigeng
    Zhou, Aoying
    Cao, Jing
    Hu, Yunfa
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2000, 37 (11): : 1287 - 1292
  • [42] Fast Correntropy-Based Clustering Algorithm
    Li Z.
    Yang B.
    Zhang J.
    Liu Y.
    Zhang X.
    Wang F.
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2021, 55 (06): : 121 - 130
  • [43] Semi-Supervised Pattern-Based Algorithm for Arabic Relation Extraction
    Sarhan, Injy
    El-Sonbaty, Yasser
    Abou El-Nasr, Mohamed
    2016 IEEE 28TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2016), 2016, : 177 - 183
  • [44] Pattern-based Algorithm for Part-of-Speech Tagging Arabic Text
    Alqrainy, Shihadeh
    Alserhan, Hasan Muaidi
    Ayesh, Aladdin
    ICCES: 2008 INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING & SYSTEMS, 2007, : 119 - +
  • [45] An effective maximal subspace clustering algorithm based on enumeration tree
    Yin, Jian
    Huang, Zhilan
    Chen, Jian
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2007, : 572 - +
  • [46] Pattern-based texture metamorphosis
    Liu, ZQ
    Liu, C
    Shum, HY
    Yul, YZ
    10TH PACIFIC CONFERENCE ON COMPUTER GRAPHICS AND APPLICATIONS, PROCEEDINGS, 2002, : 184 - 191
  • [47] Pattern-Based Mapping Refinement
    Hamdi, Faycal
    Reynaud, Chantal
    Safar, Brigitte
    KNOWLEDGE ENGINEERING AND MANAGEMENT BY THE MASSES, EKAW 2010, 2010, 6317 : 1 - 15
  • [48] Pattern-based verification for trees
    Ceska, Milan
    Erlebach, Pavel
    Vojnar, Tomas
    COMPUTER AIDED SYSTEMS THEORY- EUROCAST 2007, 2007, 4739 : 488 - 496
  • [49] Pattern-based data compression
    Kuri, A
    Galaviz, J
    MICAI 2004: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2004, 2972 : 1 - 10
  • [50] Pattern-based Rule Disambiguation
    Zheng, Jie
    Cheng, Gang
    Li, Shoushan
    Kong, Fang
    Huang, Chu-Ren
    Zhou, Guodong
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 1444 - 1449