Mining high-quality cases for hypertext prediction and prefetching

被引:0
|
作者
Yang, Q [1 ]
Li, ITY [1 ]
Zhang, HH [1 ]
机构
[1] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Case-based reasoning aims to use past experience to solve new problems. A strong requirement for its application is that extensive experience base exists that provides statistically significant justification for new applications. Such extensive experience base has been rare, limiting most CBR applications to be confined to small-scale problems involving single or few users, or even toy problems. In this work, we present an application of CBR in the domain of web document prediction and retrieval, whereby a server-side application can decide, with high accuracy and coverage, a user's next request for hypertext documents based on past requests. An application program can then use the prediction knowledge to prefetch or presend web objects to reduce latency and network load. Through this application, we demonstrate the feasibility of CBR application in the web-document retrieval context, exposing the vast possibility of using web-log Files that contain document retrieval experiences from millions of users. In this framework, a CBR system is embedded within an overall web-server application. A novelty of the work is that data mining and case-based reasoning are combined in a scanless manner, allowing cases to be mined efficiently. In addition we developed techniques to allow different case bases to be combined in order to yield a overall case base with higher quality than each individual ones. We validate our work through experiments using realistic, large-scale web logs.
引用
收藏
页码:744 / 755
页数:12
相关论文
共 50 条
  • [1] Mining Web logs for Prediction in Prefetching and Caching
    Songwattana, Areerat
    [J]. THIRD 2008 INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, VOL 2, PROCEEDINGS, 2008, : 1006 - 1011
  • [2] High-quality houses with high numbers of cases are disadvantaged
    Epping, Bernhard
    Heller, Karl-Dieter
    [J]. ZEITSCHRIFT FUR ORTHOPADIE UND UNFALLCHIRURGIE, 2018, 156 (02): : 134 - +
  • [3] An automated method for mining high-quality assertion sets
    Iman, Mohammad Reza Heidari
    Raik, Jaan
    Jenihhin, Maksim
    Jervan, Gert
    Ghasempouri, Tara
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2023, 97
  • [4] Conditional Coverage Estimation for High-Quality Prediction Intervals
    Huang, Ziyi
    Lam, Henry
    Zhang, Haofeng
    [J]. JOURNAL OF SYSTEMS SCIENCE AND SYSTEMS ENGINEERING, 2023, 32 (03) : 289 - 319
  • [5] Conditional Coverage Estimation for High-Quality Prediction Intervals
    Ziyi Huang
    Henry Lam
    Haofeng Zhang
    [J]. Journal of Systems Science and Systems Engineering, 2023, 32 : 289 - 319
  • [6] Social Network Big Data Hierarchical High-Quality Node Mining
    Jia, Dongning
    Yin, Bo
    Huang, Xianqing
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [7] Application of data mining and statistical measurement of agricultural high-quality development
    Zhou, Yan
    [J]. ADVANCES IN NANO RESEARCH, 2023, 14 (03) : 225 - 234
  • [8] Robust Benchmark for Propagandist Text Detection and Mining High-Quality Data
    Ahmad, Pir Noman
    Liu, Yuanchao
    Ali, Gauhar
    Wani, Mudasir Ahmad
    ElAffendi, Mohammed
    [J]. MATHEMATICS, 2023, 11 (12)
  • [9] A high-quality dataset construction method for text mining in materials science
    Yue, Liu
    Da-Hui, Liu
    Xian-Yuan, Ge
    Zheng-Wei, Yang
    Shu-Chang, Ma
    Zhe-Yi, Zou
    Si-Qi, Shi
    [J]. ACTA PHYSICA SINICA, 2023, 72 (07)
  • [10] Prediction of High-Quality MODIS-NPP Product Data
    Liu, Zhenhua
    Wang, Ting
    Qu, Yonghua
    Liu, Huiming
    Wu, Xiaofang
    Wen, Ya
    [J]. REMOTE SENSING, 2019, 11 (12)