MaPle:: A fast algorithm for maximal pattern-based clustering

被引:0
|
作者
Pei, J [1 ]
Zhang, XL [1 ]
Cho, MJ [1 ]
Wang, HX [1 ]
Yu, PS [1 ]
机构
[1] SUNY Buffalo, Buffalo, NY 14260 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pattern-based clustering is important in many applications, such as DNA micro-array data analysis, automatic recommendation systems and target marketing systems. However pattern-based clustering in large databases is challenging. On the one hand, there can be a huge number of clusters and many of them can be redundant and thus make the pattern-based clustering ineffective. On the other hand, the previous proposed methods may not be efficient or scalable in mining large databases. In this paper, we study the problem of maximal pattern-based clustering. Redundant clusters are avoided completely by mining only the maximal pattern-based clusters. MaPle, an efficient and scalable mining algorithm is developed. It conducts a depth-first, divide-and-conquer search and prunes unnecessary branches smartly. Our extensive performance study on both synthetic data sets and real data sets shows that maximal pattern-based clustering is effective. It reduces the number of clusters substantially. Moreover MaPle is more efficient and scalable than the previously proposed pattern-based clustering methods in mining large databases.
引用
收藏
页码:259 / 266
页数:8
相关论文
共 50 条
  • [1] O-SM: A fast algorithm for mining candidate clusters in pattern-based clustering
    Guo, Jingfeng
    Ma, Qian
    Liu, Hanfeng
    2007 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, VOLS 1 AND 2, 2007, : 127 - 132
  • [2] GHIC: A hierarchical pattern-based clustering algorithm for grouping Web transactions
    Yang, YH
    Padmanabhan, B
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (09) : 1300 - 1304
  • [3] Pattern-based clustering and attribute analysis
    Gabriela Alexe
    Sorin Alexe
    Peter L. Hammer
    Soft Computing, 2006, 10 : 442 - 452
  • [4] Pattern-based clustering and attribute analysis
    Alexe, G
    Alexe, S
    Hammer, PL
    SOFT COMPUTING, 2006, 10 (05) : 442 - 452
  • [5] A fast subspace clustering algorithm based on pattern similarity
    Gan, Yanglan
    Guan, Jihong
    Wang, Hao
    FOURTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 3, PROCEEDINGS, 2007, : 253 - +
  • [6] Pattern-based clustering problem based on fuzzy measures
    Gutierrez, I
    Barroso, M.
    Gomez, D.
    Castro, C.
    Espinola, R.
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 412 - 420
  • [7] A pattern-based approach to conceptual clustering in FOL
    Lisi, Francesca A.
    CONCEPTUAL STRUCTURES: INSPIRATION AND APPLICATION, 2006, 4068 : 346 - 359
  • [8] Pattern-Based Corner Detection Algorithm
    Le, Xuesong
    Gonzalez, Ruben
    2009 PROCEEDINGS OF 6TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA 2009), 2009, : 242 - +
  • [9] A pattern-based entity resolution algorithm
    Liu, Hui-Ping
    Jin, Che-Qing
    Zhou, Ao-Ying
    Jisuanji Xuebao/Chinese Journal of Computers, 2015, 38 (09): : 1796 - 1808
  • [10] Fast pattern-based algorithms for cutting stock
    Brandao, Filipe
    Pedroso, Joao Pedro
    COMPUTERS & OPERATIONS RESEARCH, 2014, 48 : 69 - 80