GHIC: A hierarchical pattern-based clustering algorithm for grouping Web transactions

被引:19
|
作者
Yang, YH
Padmanabhan, B
机构
[1] Univ Calif Davis, Grad Sch Management, Davis, CA 95616 USA
[2] Univ Penn, Wharton Sch, OPIM Dept, Philadelphia, PA 19104 USA
关键词
data mining; clustering; classification; association rules; Web mining;
D O I
10.1109/TKDE.2005.145
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Grouping customer transactions into segments may help understand customers better. The marketing literature has concentrated on identifying important segmentation variables (e.g., customer loyalty) and on using cluster analysis and mixture models for segmentation. The data mining literature has provided various clustering algorithms for segmentation without focusing specifically on clustering customer transactions. Building on the notion that observable customer transactions are generated by latent behavioral traits, in this paper, we investigate using a pattern-based clustering approach to grouping customer transactions. We define an objective function that we maximize in order to achieve a good clustering of customer transactions and present an algorithm, GHIC, that groups customer transactions such that itemsets generated from each cluster, while similar to each other, are different from ones generated from others. We present experimental results from user-centric Web usage data that demonstrates that GHIC generates a highly effective clustering of transactions.
引用
收藏
页码:1300 / 1304
页数:5
相关论文
共 50 条
  • [31] Feature Grouping for Intrusion Detection System Based on Hierarchical Clustering
    Song, Jingping
    Zhu, Zhiliang
    Price, Chris
    AVAILABILITY, RELIABILITY, AND SECURITY IN INFORMATION SYSTEMS, 2014, 8708 : 270 - +
  • [32] Clustering High-Dimensional Data: A Survey on Subspace Clustering, Pattern-Based Clustering, and Correlation Clustering
    Kriegel, Hans-Peter
    Kroeger, Peer
    Zimek, Arthur
    ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2009, 3 (01)
  • [33] Hierarchical pattern-based complex query of temporal knowledge graph
    Zhu, Lin
    Zhang, Heng
    Bai, Luyi
    KNOWLEDGE-BASED SYSTEMS, 2024, 284
  • [34] A pattern-based clustering strategy for object-oriented databases
    Chen, YH
    Lai, JK
    Lee, C
    INFORMATION INTELLIGENCE AND SYSTEMS, VOLS 1-4, 1996, : 971 - 976
  • [35] Hierarchical clustering algorithm based on granularity
    Liang, Jiuzhen
    Li, Guangbin
    GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 429 - 432
  • [36] A hierarchical clustering algorithm based on GiST
    Zhou, Bing
    Wang, He-xing
    Wang, Cui-rong
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2007, 2 : 125 - +
  • [37] Clustering Algorithm of Web Click Stream Frequency Pattern
    Li Yang
    Zhang Liang
    2011 INTERNATIONAL CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND AUTOMATION (CCCA 2011), VOL III, 2010, : 388 - 391
  • [38] Pattern-based extraction of addresses from Web page content
    Asadi, Saeid
    Yang, Guowei
    Zhou, Xiaofang
    Shi, Yuan
    Zhai, Boxuan
    Jiang, Wendy Wen-Rong
    PROGRESS IN WWW RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2008, 4976 : 407 - 418
  • [39] Generating pattern-based web tutorials for Java']Java frameworks
    Hakala, M
    Hautamäki, J
    Koskimies, K
    Savolainen, P
    SCIENTIFIC ENGINEERING FOR DISTRIBUTED JAVA APPLICATIONS, 2002, 2604 : 99 - 110
  • [40] A fuzzy pattern-based filtering algorithm for botnet detection
    Wang, Kuochen
    Huang, Chun-Ying
    Lin, Shang-Jyh
    Lin, Ying-Dar
    COMPUTER NETWORKS, 2011, 55 (15) : 3275 - 3286