Discovering pattern-based subspace clusters by pattern tree

被引:10
|
作者
Guan, Jihong [1 ]
Gan, Yanglan [1 ]
Wang, Hao [2 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
[2] Hefei Univ Technol, Dept Comp Sci & Technol, Hefei 23009, Peoples R China
基金
中国国家自然科学基金;
关键词
Clustering analysis; Subspace clustering; Pattern similarity; Pattern tree;
D O I
10.1016/j.knosys.2009.02.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional clustering models based on distance similarity are not always effective in capturing correlation among data objects, while pattern-based clustering can do well in identifying correlation hidden among data objects. However, the state-of-the-art pattern-based clustering methods are inefficient and provide no metric to measure the clustering quality. This paper presents a new pattern-based subspace clustering method, which can tackle the problems mentioned above. Observing the analogy between mining frequent itemsets and discovering subspace clusters, we apply pattern tree - a structure used in frequent itemsets mining to determining the target subspaces by scanning the database once, which can be done efficiently in large datasets. Furthermore, we introduce a general clustering quality evaluation model to guide the identifying of meaningful clusters. The proposed new method enables the users to set flexibly proper quality-control parameters to meet different needs. Experimental results on synthetic and real datasets show that our method outperforms the existing methods in both efficiency and effectiveness. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:569 / 579
页数:11
相关论文
共 50 条
  • [31] Pattern-based specification of crowdsourcing applications
    Bozzon, Alessandro
    Brambilla, Marco
    Ceri, Stefano
    Mauri, Andrea
    Volonterio, Riccardo
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8541 : 218 - 235
  • [32] PatEC: Pattern-Based Equivalence Checking
    Jakobs, Marie-Christine
    MODEL CHECKING SOFTWARE (SPIN 2021), 2021, 12864 : 120 - 139
  • [33] Pattern-based reengineering of software systems
    Meyer, Matthias
    13TH WORKING CONFERENCE ON REVERSE ENGINEERING PROCEEDINGS, 2006, : 305 - +
  • [34] A pattern-based approach to elementary algebra
    Stromskag, Heidi
    PROCEEDINGS OF THE NINTH CONFERENCE OF THE EUROPEAN SOCIETY FOR RESEARCH IN MATHEMATICS EDUCATION (CERME9), 2015, : 474 - 480
  • [35] Local pattern-based interval models
    Cholewa, W
    ARTIFICIAL INTELLIGENCE AND SOFT COMPUTING - ICAISC 2004, 2004, 3070 : 948 - 953
  • [36] A Moire Pattern-Based Thread Counter
    Reich, Gary
    PHYSICS TEACHER, 2017, 55 (07): : 426 - 430
  • [37] PATTERN-BASED ONTOLOGY TRANSFORMATION SERVICE
    Svab-Zamazal, Ondrej
    Svatek, Vojtech
    Scharffe, Francois
    KEOD 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON KNOWLEDGE ENGINEERING AND ONTOLOGY DEVELOPMENT, 2009, : 42 - +
  • [38] Pattern-based guidelines for coordination engineering
    Etcheverry, P
    Lopistéguy, P
    Dagorret, P
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, 2001, 2113 : 155 - 164
  • [39] Pattern-based development of communication systems
    Gotzhein, R
    Schaible, P
    ANNALES DES TELECOMMUNICATIONS-ANNALS OF TELECOMMUNICATIONS, 1999, 54 (11-12): : 508 - 525
  • [40] Pattern-based clustering and attribute analysis
    Gabriela Alexe
    Sorin Alexe
    Peter L. Hammer
    Soft Computing, 2006, 10 : 442 - 452