The index-based XXL search engine for querying XML data with relevance ranking

被引:0
|
作者
Theobald, A [1 ]
Weikum, G [1 ]
机构
[1] Univ Saarland, Saarbrucken, Germany
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query languages for XML such as XPath or XQuery support Boolean retrieval: a query result is a (possibly restructured) subset of XML elements or entire documents that satisfy the search conditions of the query. This search paradigm works for highly schematic XML data collections such as electronic catalogs. However, for searching information in open environments such as the Web or intranets of large corporations, ranked retrieval is more appropriate: a query result is a rank list of XML elements in descending order of (estimated) relevance. Web search engines, which,are based on the ranked retrieval paradigm, do, however, not consider the additional information and rich annotations provided by the structure of XML documents and their element names. This paper presents the XXL search engine that supports relevance ranking on XML data. XXL is particularly geared for path queries with wildcards that can span multiple XML collections and contain both exact-match as well as semantic-similarity search conditions. In addition, ontological information and suitable index structures are used to improve the search efficiency and effectiveness. XXL is fully implemented as a suite of Java servlets. Experiments with a variety of structurally diverse XML data demonstrate the efficiency of the XXL search engine and underline its effectiveness for ranked retrieval.
引用
收藏
页码:477 / 495
页数:19
相关论文
共 50 条
  • [41] Index-based query processing on distributed multidimensional data
    Tsatsanifos, George
    Sacharidis, Dimitris
    Sellis, Timos
    GEOINFORMATICA, 2013, 17 (03) : 489 - 519
  • [42] A relevance ranking method for citation-based search results
    Belter, Christopher W.
    SCIENTOMETRICS, 2017, 112 (02) : 731 - 746
  • [43] A relevance ranking method for citation-based search results
    Christopher W. Belter
    Scientometrics, 2017, 112 : 731 - 746
  • [44] GeoSearcher: Location-based ranking of search engine results
    Watters, C
    Amoudi, G
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2003, 54 (02): : 140 - 151
  • [45] INDEX-BASED OBJECT RECOGNITION IN PICTORIAL DATA MANAGEMENT
    GROSKY, WI
    MEHROTRA, R
    COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1990, 52 (03): : 416 - 436
  • [46] Index-based query processing on distributed multidimensional data
    George Tsatsanifos
    Dimitris Sacharidis
    Timos Sellis
    GeoInformatica, 2013, 17 : 489 - 519
  • [47] Navigation- vs. index-based XML multi-query processing
    Bruno, N
    Gravano, L
    Koudas, N
    Srivastava, D
    19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, : 139 - 150
  • [48] An inverted index-based buffered search algorithm for mobile navigation services
    Kwon, Dongseop
    Choi, Wonik
    Lee, Sangjun
    PROCEEDINGS OF FUTURE GENERATION COMMUNICATION AND NETWORKING, MAIN CONFERENCE PAPERS, VOL 1, 2007, : 487 - +
  • [49] Index-based tool for preliminary ranking of social and environmental impacts of hydropower and storage reservoirs
    Larson, S.
    Larson, S.
    ENERGY, 2007, 32 (06) : 943 - 947
  • [50] PICS: Parallel Index-based Search Algorithm for Coalition Structure Generation
    Taguelmimt, Redha
    Aknine, Samir
    Boukredera, Djamila
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 739 - 746