An OLAP-based scalable web access analysis engine

被引:0
|
作者
Chen, Q [1 ]
Dayal, U [1 ]
Hsu, M [1 ]
机构
[1] Hewlett Packard Corp, HP Labs, Palo Alto, CA 94303 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Collecting and mining web log records (WLRs) from e-commerce web sites has become increasingly important for targeted marketing, promotions, and traffic analysis. In this paper, we describe a scalable data warehousing and OLAP-based engine for analyzing WLRs. We have to address several scalability and performance challenges in developing such a framework. Because an active web site may generate hundreds of millions of WLRs daily, we have to deal with Huge data volumes and data flow rates. To support fine-grained analysis, e.g., individual users' access profiles, we end up with huge, sparse data cubes defined over very large-sized dimensions (there may be hundreds of thousands of visitors to the site and tens of thousands of pages). While OLAP servers store sparse cubes quite efficiently, rolling up a very large cube can take prohibitively long. We have applied several non-traditional approaches to deal with this problem, which allow us to speed up WLR analysis by 3 orders of magnitude. Our framework supports multilevel and multidimensional pattern extraction, analysis and feature ranking, and in addition to the typical OLAP operations, supports data mining operations such as extended multilevel and multidimensional association rules.
引用
收藏
页码:210 / 223
页数:14
相关论文
共 50 条
  • [1] Effectiveness of OLAP-based Sales Analysis in Retail Enterprises
    Ju, Chunhua
    Han, Minghua
    [J]. 2008 ISECS INTERNATIONAL COLLOQUIUM ON COMPUTING, COMMUNICATION, CONTROL, AND MANAGEMENT, VOL 3, PROCEEDINGS, 2008, : 240 - 244
  • [2] Multidimensional modeling and analysis of large and complex watercourse data: an OLAP-based solution
    Boulil, Kamal
    Le Ber, Florence
    Bimonte, Sandro
    Grac, Corinne
    Cemesson, Flavie
    [J]. ECOLOGICAL INFORMATICS, 2014, 24 : 90 - 106
  • [3] Towards Indoor Radon Analytics: An OLAP-based Multidimensional Approach
    Azevedo, Rolando
    Silva, Joaquim P.
    Lopes, Nuno
    Curado, Antonio
    Nunes, Leonel J. R.
    Lopes, Sergio Ivan
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON DATA SCIENCE, TECHNOLOGY AND APPLICATIONS (DATA), 2022, : 361 - 369
  • [4] SOLA: Stream OLAP-based Analytical Framework for Roadway Maintenance
    Komamizu, Takahiro
    Amagasa, Toshiyuki
    Shaikh, Salman Ahmed
    Shiokawa, Hiroaki
    Kitagawa, Hiroyuki
    [J]. 9TH INTERNATIONAL CONFERENCE ON MANAGEMENT OF EMERGENT DIGITAL ECOSYSTEMS (MEDES 2017), 2017, : 35 - 41
  • [5] Research and Application on OLAP-based Farm Products Examination Model
    Han, Minghua
    Ju, Chunhua
    [J]. PROCEEDINGS OF THE INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, 2008, : 858 - 861
  • [6] Multiagent reinforcement learning using OLAP-based association rules mining
    Kaya, M
    Alhajj, R
    [J]. IEEE/WIC INTERNATIONAL CONFERENCE ON INTELLIGENT AGENT TECHNOLOGY, PROCEEDINGS, 2003, : 584 - 587
  • [7] Development of a responsive logistics workflow system: an OLAP-based hybrid approach
    Lau, Henry C. W.
    Ho, G. T. S.
    Ip, A. W. H.
    Lee, C. K. M.
    Ning, A.
    [J]. INTERNATIONAL JOURNAL OF SERVICES TECHNOLOGY AND MANAGEMENT, 2006, 7 (5-6) : 568 - 581
  • [8] Effectiveness of OLAP-based cost data management in construction cost estimate
    Moon, S. W.
    Kim, J. S.
    Kwon, K. N.
    [J]. AUTOMATION IN CONSTRUCTION, 2007, 16 (03) : 336 - 344
  • [9] A hybrid intelligent system to enhance logistics workflow: an OLAP-based GA approach
    Ho, GTS
    Lee, CKM
    Lau, HCW
    Ip, AWH
    [J]. INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2006, 19 (01) : 69 - 78
  • [10] OSim: An OLAP-Based Similarity Search Service Solver for Dynamic Information Networks
    Niu, Xiaoguang
    Zhang, Yihao
    Huang, Ting
    Wu, Xiaoping
    [J]. WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2016, 2016, 9798 : 536 - 547