BT plus -tree: A New Index for Temporal Information in Web Pages

被引:0
|
作者
Chen, Hong [1 ]
Li, Qiang [1 ]
Jin, Peiquan [1 ]
机构
[1] Univ Sci & Technol China, Sch Comp Sci & Technol, Hefei 230027, Peoples R China
关键词
B+-TREES;
D O I
暂无
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
With the growth of Web information, traditional search engines, which are built on the text-based search technology, are unable to meet users demands on Web search. As many queries are time-related, and most Web pages contain time information, it has been an important issue to develop time-aware Web search engines. Based on this view, in this paper we study the indexing mechanism of the temporal information in Web pages. Our work is based on the assumption that each Web page only has one primary time, which will be utilized in time-based Web search. We present a new index structure called BT+-tree which is based on the MAP21-tree. However, unlike MAP21-tree's double-tree structure, BT+-tree only uses one tree structure. Furthermore, duplicated keys can be effectively treated in BT+-tree, while the MAP21-tree has little consideration on duplicated keys. After discussing the index structure as well as manipulation algorithms of BT+-tree, we design a testing program to measure the performance of BT+-tree. The experimental results show that BT+-tree is effective for indexing temporal information in Web pages.
引用
收藏
页码:68 / 78
页数:11
相关论文
共 50 条
  • [41] Chemicke Listy Have New Web Pages
    Vyskocil, Vlastimil
    [J]. CHEMICKE LISTY, 2018, 112 (04): : 270 - 271
  • [42] Webformer: Pre-training with Web Pages for Information Retrieval
    Guo, Yu
    Ma, Zhengyi
    Mao, Jiaxin
    Qian, Hongjin
    Zhang, Xinyu
    Jiang, Hao
    Cao, Zhao
    Dou, Zhicheng
    [J]. PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 1502 - 1512
  • [43] Term frequency occurrences on web pages for textual information retrieval
    Sivapathasundaram, Karthika
    Cheng, Xiaochun
    Petridis, Miltos
    [J]. DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 585 - 590
  • [44] Relating Web Pages to Enable Information-Gathering Tasks
    Bagchi, Amitabha
    Lahoti, Garima
    [J]. 20TH ACM CONFERENCE ON HYPERTEXT AND HYPERMEDIA (HYPERTEXT 2009), 2009, : 109 - 118
  • [45] Differences in information processing from print ads and web pages
    Unni, R
    [J]. ADVANCES IN CONSUMER RESEARCH, VOLUME XXXI, 2004, 31 : 263 - 264
  • [46] Web information seeking by pages: an observational study of moving and stopping
    Kari, J
    [J]. INFORMATION RESEARCH-AN INTERNATIONAL ELECTRONIC JOURNAL, 2004, 9 (04):
  • [47] A color selection tool for the readability of textual information on Web pages
    Zuffia, Silvia
    Beretta, Giordano
    Brambilla, Carla
    [J]. INTERNET IMAGING VII, 2006, 6061
  • [48] Effectively finding relevant Web pages from linkage information
    Hou, JY
    Zhang, YC
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2003, 15 (04) : 940 - 951
  • [49] Noise elimination from web pages for efficacious information retrieval
    Uma, R.
    Latha, B.
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 14583 - 14602
  • [50] Automatic MEDLINE searching: Integrating medical information into Web pages
    Worel, S
    [J]. ECONTENT, 1999, 22 (04) : 38 - 42