The index-based XXL search engine for querying XML data with relevance ranking

被引:0
|
作者
Theobald, A [1 ]
Weikum, G [1 ]
机构
[1] Univ Saarland, Saarbrucken, Germany
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query languages for XML such as XPath or XQuery support Boolean retrieval: a query result is a (possibly restructured) subset of XML elements or entire documents that satisfy the search conditions of the query. This search paradigm works for highly schematic XML data collections such as electronic catalogs. However, for searching information in open environments such as the Web or intranets of large corporations, ranked retrieval is more appropriate: a query result is a rank list of XML elements in descending order of (estimated) relevance. Web search engines, which,are based on the ranked retrieval paradigm, do, however, not consider the additional information and rich annotations provided by the structure of XML documents and their element names. This paper presents the XXL search engine that supports relevance ranking on XML data. XXL is particularly geared for path queries with wildcards that can span multiple XML collections and contain both exact-match as well as semantic-similarity search conditions. In addition, ontological information and suitable index structures are used to improve the search efficiency and effectiveness. XXL is fully implemented as a suite of Java servlets. Experiments with a variety of structurally diverse XML data demonstrate the efficiency of the XXL search engine and underline its effectiveness for ranked retrieval.
引用
收藏
页码:477 / 495
页数:19
相关论文
共 50 条
  • [31] A new multi-search engine for querying data through an Internet search service on CORBA
    Chang, YS
    Yuan, SM
    Lo, W
    COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 34 (03): : 467 - 480
  • [32] Usage-based Ranking of Distributed XML Data
    Constantin, Camelia
    Amann, Bernd
    APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 1008 - 1012
  • [33] Towards index-based similarity search for protein structure databases
    Çamoglu, O
    Kahveci, T
    Singh, AK
    PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, : 148 - 158
  • [34] Index-Based Densest Clique Percolation Community Search in Networks
    Yuan, Long
    Qin, Lu
    Zhang, Wenjie
    Chang, Lijun
    Yang, Jianye
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (05) : 922 - 935
  • [35] Index-based Densest Clique Percolation Community Search in Networks
    Yuan, Long
    Qin, Lu
    Zhang, Wenjie
    Chang, Lijun
    Yang, Jianye
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 2161 - 2162
  • [36] Index-based fast search algorithm of image database on internet
    Yeh, CH
    Kuo, CJ
    2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1195 - 1198
  • [37] Index-Based Search Scheme in Peer-to-Peer Networks
    Bo, Jin
    Zhao, Juping
    COMPUTER SCIENCE FOR ENVIRONMENTAL ENGINEERING AND ECOINFORMATICS, PT 2, 2011, 159 : 102 - 106
  • [38] Index-Based Approach to Similarity Search in Protein and Nucleotide Databases
    Hoksza, David
    Skopal, Tomas
    DATESO 2007 - DATABASES, TEXTS, SPECIFICATIONS, OBJECTS: PROCEEDINGS OF THE 7TH ANNUAL INTERNATIONAL WORKSHOP, 2007, 235 : 67 - 80
  • [39] An encoding scheme based on fractional number for querying and updating XML data
    Mirabi, Meghdad
    Ibrahim, Hamidah
    Udzir, Nur Izura
    Mamat, Ali
    JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (08) : 1831 - 1851
  • [40] A structure-based approach of keyword querying for fuzzy XML data
    Li, Ting
    Ma, Zongmin
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2018, 22 (02) : 125 - 140