Efficient and anonymous web-usage mining for web personalization

被引:18
|
作者
Shahabi, C [1 ]
Banaei-Kashani, F [1 ]
机构
[1] Univ So Calif, Dept Comp Sci, Integrated Media Syst Ctr, Los Angeles, CA 90089 USA
关键词
web-usage mining; data mining; personalization; pattern discovery;
D O I
10.1287/ijoc.15.2.123.14444
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The World Wide Web (WWW) is the largest distributed information space and has grown to encompass diverse information resources. Although the web is growing exponentially, the individual's capacity to read and digest content is essentially fixed. The full economic potential of the web will not be realized unless enabling technologies are provided to facilitate access to web resources. Currently web personalization is the most promising approach to remedy this problem, and web mining, particularly web-usage mining, is considered a crucial component of any efficacious web-personalization system. In this paper, we describe a complete framework for web-usage mining to satisfy the challenging requirements of web-personalization applications. For online and anonymous web personalization to be effective, web usage mining must be accomplished in real time as accurately as possible. On the other hand, web-usage mining should allow a compromise between scalability and accuracy to be applicable to real-life websites with numerous visitors. Within our web-usage-mining framework, we introduce a distributed user-tracking approach for accurate, scalable, and implicit collection of the usage data. We also propose a new model, the feature-matrices (FM) model, to discover and interpret users' access patterns. With FM, various spatial and temporal features of usage data can be captured with flexible precision so that we can trade off accuracy for scalability based on the specific application requirements. Moreover, tunable complexity of the FM model allows real-time and adaptive access pattern discovery from usage data. We define a novel similarity measure based on FM that is specifically designed for accurate classification of partial navigation patterns in real time. Our extensive experiments with both synthetic and real data verify correctness and efficacy of our web-usage-mining framework for anonymous and efficient web personalization.
引用
收藏
页码:123 / 147
页数:25
相关论文
共 50 条
  • [1] A web usage mining algorithm for web personalization
    Picariello, Antonio
    Sansone, Carlo
    [J]. INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2008, 2 (04): : 219 - 230
  • [2] Linguistic object-oriented web-usage mining
    Hong, Tzung-Pei
    Huang, Cheng-Ming
    Horng, Shi-Jinn
    [J]. INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2008, 48 (01) : 47 - 61
  • [3] Feature matrices: A model for efficient and anonymous Web usage mining
    Shahabi, C
    Banaei-Kashani, F
    Faruque, J
    Faisal, A
    [J]. ELECTRONIC COMMERCE AND WEB TECHNOLOGIES, 2001, 2115 : 280 - 294
  • [4] Web Usage Mining as a Tool for Personalization: A Survey
    Dimitrios Pierrakos
    Georgios Paliouras
    Christos Papatheodorou
    Constantine D. Spyropoulos
    [J]. User Modeling and User-Adapted Interaction, 2003, 13 : 311 - 372
  • [5] Web usage mining as a tool for personalization: A survey
    Pierrakos, D
    Paliouras, G
    Papatheodorou, C
    Spyropoulos, CD
    [J]. USER MODELING AND USER-ADAPTED INTERACTION, 2003, 13 (04) : 311 - 372
  • [6] A Web Usage Lattice Based Mining Approach for Intelligent Web Personalization
    Zhou, Baoyao
    Hui, Siu Cheung
    Fong, Alvis C. M.
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2005, 1 (03) : 137 - +
  • [7] Mining web usage data for automatic site personalization
    Mobasher, B
    [J]. CLASSIFICATION, AUTOMATION, AND NEW MEDIA, 2002, : 299 - 312
  • [8] Toward recommendation based on ontology-powered web-usage mining
    Adda, Mehdi
    Valtchev, Petko
    Missaoui, Rokia
    Djeraba, Chabane
    [J]. IEEE INTERNET COMPUTING, 2007, 11 (04) : 45 - 52
  • [9] Building Web Personalization System with Time-Driven Web Usage Mining
    Ramya, P. T.
    Sajeev, G. P.
    [J]. PROCEEDING OF THE THIRD INTERNATIONAL SYMPOSIUM ON WOMEN IN COMPUTING AND INFORMATICS (WCI-2015), 2015, : 38 - 43
  • [10] Web Usage Mining and Text Mining in the Environment of Web Personalization for Ontology Development of Recommender Systems
    Bhattacharya, Tanya
    Jaiswal, Arunima
    Nagpal, Vaibhav
    [J]. 2016 5TH INTERNATIONAL CONFERENCE ON RELIABILITY, INFOCOM TECHNOLOGIES AND OPTIMIZATION (TRENDS AND FUTURE DIRECTIONS) (ICRITO), 2016, : 78 - 84