GHIC: A hierarchical pattern-based clustering algorithm for grouping Web transactions

被引:19
|
作者
Yang, YH
Padmanabhan, B
机构
[1] Univ Calif Davis, Grad Sch Management, Davis, CA 95616 USA
[2] Univ Penn, Wharton Sch, OPIM Dept, Philadelphia, PA 19104 USA
关键词
data mining; clustering; classification; association rules; Web mining;
D O I
10.1109/TKDE.2005.145
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Grouping customer transactions into segments may help understand customers better. The marketing literature has concentrated on identifying important segmentation variables (e.g., customer loyalty) and on using cluster analysis and mixture models for segmentation. The data mining literature has provided various clustering algorithms for segmentation without focusing specifically on clustering customer transactions. Building on the notion that observable customer transactions are generated by latent behavioral traits, in this paper, we investigate using a pattern-based clustering approach to grouping customer transactions. We define an objective function that we maximize in order to achieve a good clustering of customer transactions and present an algorithm, GHIC, that groups customer transactions such that itemsets generated from each cluster, while similar to each other, are different from ones generated from others. We present experimental results from user-centric Web usage data that demonstrates that GHIC generates a highly effective clustering of transactions.
引用
收藏
页码:1300 / 1304
页数:5
相关论文
共 50 条
  • [21] Research of Matrix Clustering Algorithm Based on Web User Access Pattern
    Bao, Jian
    WEB INFORMATION SYSTEMS AND MINING, PT II, 2011, 6988 : 154 - 159
  • [22] Pattern-based variability management in Web service development
    Jiang, JJ
    Ruokonen, A
    Systä, T
    THIRD EUROPEAN CONFERENCE ON WEB SERVICES, PROCEEDINGS, 2005, : 83 - 94
  • [23] Pattern-based automatic taxonomy learning from the Web
    Sanchez, David
    Moreno, Antonio
    AI COMMUNICATIONS, 2008, 21 (01) : 27 - 48
  • [24] Generating pattern-based web tutorials for Java frameworks
    Hakala, Markku
    Hautamäki, Juha
    Koskimies, Kai
    Savolainen, Pekka
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2003, 2604 : 99 - 110
  • [25] Parallelization of a graph-cut based algorithm for hierarchical clustering of web documents
    Seshadri, Karthick
    Shalinie, S. Mercy
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2015, 27 (17): : 5156 - 5176
  • [26] A pattern-based constraint description approach for Web services
    Wang, Qianxiang
    Li, Min
    Meng, Na
    Liu, Yonggang
    Mei, Hong
    USIC 2007: PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON QUALITY SOFTWARE, 2007, : 60 - 69
  • [27] A pattern-based voting approach for concept discovery on the web
    Chen, J
    Zhang, ZG
    Li, Q
    Li, XM
    WEB TECHNOLOGIES RESEARCH AND DEVELOPMENT - APWEB 2005, 2005, 3399 : 109 - 120
  • [28] Supporting pattern-based application authoring for the semantic Web
    Michael, F
    Claudia, N
    Matthias, H
    INTELLIGENT AND ADAPTIVE SYSTEMS AND SOFTWARE ENGINEERING, 2004, : 215 - 220
  • [29] Pattern-based Automatic Parallelization of Representative-based Clustering Algorithms
    Islam, Saiyedul
    Balasubramaniam, Sundar
    Gupta, Shruti
    Brajesh, Shikhar
    Badlani, Rohan
    Labhishetty, Nitin
    Baid, Abhinav
    Goyal, Poonam
    Goyal, Navneet
    2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 99 - 108
  • [30] UNSUPERVISED GROUPING OF MOVING OBJECTS BASED ON AGGLOMERATIVE HIERARCHICAL CLUSTERING
    Fujinami, Kaori
    INTERNATIONAL JOURNAL ON SMART SENSING AND INTELLIGENT SYSTEMS, 2016, 9 (04): : 2276 - 2296