XM-Tree, a new index for Web Information Retrieval

被引:0
|
作者
Deco, Claudia [1 ]
Pierangeli, Guillermo [1 ]
Bender, Cristina [1 ]
Reyes, Nora [2 ]
机构
[1] Univ Nacl Rosario, Fac Ciencias Exactas Ingn Agrimensura, Dept Sistemas & Informat, RA-2000 Rosario, Argentina
[2] Univ Nacl San Luis, Dept Informat, RA-5700 San Luis, Argentina
来源
关键词
Metric Spaces; Similarity Searching; M-Tree; XM-Tree;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Web Information Retrieval is another problem of searching elements of a set that are closest to a given query under a certain similarity criterion. It is of interest to take advantage of metric spaces in order to solve a search in an effective and efficient way. In this article, we present an extension of the M-Tree index, called XM-Tree, in order to improve search results. This index allows dynamic insertion of new data, reduces search costs using pruning and precalculated distances, and uses a tolerable amount of space, which makes this index apt for the extensive and dynamic Web. The proposed extension indexes Web documents, uses L-2 as indexing distance and L-infinity as similarity criterion to solve queries. We also present experiments validating the results.
引用
收藏
页码:78 / 84
页数:7
相关论文
共 50 条
  • [1] BT plus -tree: A New Index for Temporal Information in Web Pages
    Chen, Hong
    Li, Qiang
    Jin, Peiquan
    [J]. DATABASE THEORY AND APPLICATION, BIO-SCIENCE AND BIO-TECHNOLOGY, 2010, 118 : 68 - 78
  • [2] TB±tree: Index Structure for Information Retrieval Systems
    Fekihal, Mabruk
    Jaluta, Ibrahim
    Saini, Dinesh Kumar
    [J]. 2015 SECOND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE, COMPUTER ENGINEERING, AND SOCIAL MEDIA (CSCESM), 2015, : 182 - 186
  • [3] A new web usage model for information retrieval
    Zhou Hong-fang
    Feng Bo-qin
    Yue Hui
    Lv Lin-tao
    [J]. 2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 1456 - 1459
  • [4] Information retrieval on the Web
    Yang, KD
    [J]. ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 2005, 39 : 33 - 80
  • [5] Information retrieval on the Web
    Kobayashi, M
    Takeda, K
    [J]. ACM COMPUTING SURVEYS, 2000, 32 (02) : 144 - 173
  • [6] New challenges of web crawler technology for information retrieval
    Blazquez Ochando, Manuel
    [J]. METODOS DE INFORMACION, 2013, 4 (07): : 115 - 128
  • [7] New Web Information Retrieval paradigm based on a Multi-Space Interpretation Index and Projection operations
    Adda, Mehdi
    Hannech, Amel
    Mcheick, Hamid
    [J]. 2013 XXIV INTERNATIONAL SYMPOSIUM ON INFORMATION, COMMUNICATION AND AUTOMATION TECHNOLOGIES (ICAT), 2013,
  • [8] MULTITERM INDEX - A NEW CONCEPT IN INFORMATION STORAGE AND RETRIEVAL
    SKOLNIK, H
    [J]. ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 1969, (SEP): : CH17 - &
  • [9] MULTITERM INDEX - A NEW CONCEPT IN INFORMATION STORAGE AND RETRIEVAL
    SKOLNIK, H
    [J]. JOURNAL OF CHEMICAL DOCUMENTATION, 1970, 10 (02): : 81 - &
  • [10] Semantic information retrieval on the web
    Sezer, Ebru
    Yazici, Adnan
    Yarimagan, Unal
    [J]. ADVANCES IN INFORMATION SYSTEMS, PROCEEDINGS, 2006, 4243 : 158 - 167