GHIC: A hierarchical pattern-based clustering algorithm for grouping Web transactions

被引:19
|
作者
Yang, YH
Padmanabhan, B
机构
[1] Univ Calif Davis, Grad Sch Management, Davis, CA 95616 USA
[2] Univ Penn, Wharton Sch, OPIM Dept, Philadelphia, PA 19104 USA
关键词
data mining; clustering; classification; association rules; Web mining;
D O I
10.1109/TKDE.2005.145
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Grouping customer transactions into segments may help understand customers better. The marketing literature has concentrated on identifying important segmentation variables (e.g., customer loyalty) and on using cluster analysis and mixture models for segmentation. The data mining literature has provided various clustering algorithms for segmentation without focusing specifically on clustering customer transactions. Building on the notion that observable customer transactions are generated by latent behavioral traits, in this paper, we investigate using a pattern-based clustering approach to grouping customer transactions. We define an objective function that we maximize in order to achieve a good clustering of customer transactions and present an algorithm, GHIC, that groups customer transactions such that itemsets generated from each cluster, while similar to each other, are different from ones generated from others. We present experimental results from user-centric Web usage data that demonstrates that GHIC generates a highly effective clustering of transactions.
引用
收藏
页码:1300 / 1304
页数:5
相关论文
共 50 条
  • [1] Segmenting customer transactions using a pattern-based clustering approach
    Yang, YH
    Padmanabhan, B
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 411 - 418
  • [2] Incremental Hierarchical Clustering of Stochastic Pattern-Based Symbolic Data
    Xu, Xin
    Lu, Jiaheng
    Wang, Wei
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2016, PT II, 2016, 9652 : 156 - 167
  • [3] MaPle:: A fast algorithm for maximal pattern-based clustering
    Pei, J
    Zhang, XL
    Cho, MJ
    Wang, HX
    Yu, PS
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 259 - 266
  • [4] CLEOPATRA: Evolutionary pattern-based clustering of web usage data
    Zhao, Qiankun
    Bhowmick, Sourav S.
    Gruenwald, Le
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2006, 3918 : 323 - 333
  • [5] Aircraft grouping based on improved divisive hierarchical clustering algorithm
    Xia, Qingjun
    Li, Xueming
    Song, Ye
    Zhang, Baocheng
    JOURNAL OF AIR TRANSPORT MANAGEMENT, 2014, 40 : 157 - 162
  • [6] Pattern-based clustering and attribute analysis
    Gabriela Alexe
    Sorin Alexe
    Peter L. Hammer
    Soft Computing, 2006, 10 : 442 - 452
  • [7] Pattern-based clustering and attribute analysis
    Alexe, G
    Alexe, S
    Hammer, PL
    SOFT COMPUTING, 2006, 10 (05) : 442 - 452
  • [8] A term-based algorithm for hierarchical clustering of web documents
    Schenker, A
    Last, M
    Kandel, A
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 3076 - 3081
  • [9] Pattern-based clustering problem based on fuzzy measures
    Gutierrez, I
    Barroso, M.
    Gomez, D.
    Castro, C.
    Espinola, R.
    DEVELOPMENTS OF ARTIFICIAL INTELLIGENCE TECHNOLOGIES IN COMPUTATION AND ROBOTICS, 2020, 12 : 412 - 420
  • [10] A pattern-based approach to conceptual clustering in FOL
    Lisi, Francesca A.
    CONCEPTUAL STRUCTURES: INSPIRATION AND APPLICATION, 2006, 4068 : 346 - 359