The index-based XXL search engine for querying XML data with relevance ranking

被引:0
|
作者
Theobald, A [1 ]
Weikum, G [1 ]
机构
[1] Univ Saarland, Saarbrucken, Germany
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query languages for XML such as XPath or XQuery support Boolean retrieval: a query result is a (possibly restructured) subset of XML elements or entire documents that satisfy the search conditions of the query. This search paradigm works for highly schematic XML data collections such as electronic catalogs. However, for searching information in open environments such as the Web or intranets of large corporations, ranked retrieval is more appropriate: a query result is a rank list of XML elements in descending order of (estimated) relevance. Web search engines, which,are based on the ranked retrieval paradigm, do, however, not consider the additional information and rich annotations provided by the structure of XML documents and their element names. This paper presents the XXL search engine that supports relevance ranking on XML data. XXL is particularly geared for path queries with wildcards that can span multiple XML collections and contain both exact-match as well as semantic-similarity search conditions. In addition, ontological information and suitable index structures are used to improve the search efficiency and effectiveness. XXL is fully implemented as a suite of Java servlets. Experiments with a variety of structurally diverse XML data demonstrate the efficiency of the XXL search engine and underline its effectiveness for ranked retrieval.
引用
收藏
页码:477 / 495
页数:19
相关论文
共 50 条
  • [1] Querying and ranking xml documents based on data synopses
    He, Weimin
    Lv, Teng
    Journal of Digital Information Management, 2011, 9 (05): : 199 - 205
  • [2] Semantic relevance ranking for XML keyword search
    Lou, Ying
    Li, Zhanhuai
    Chen, Qun
    INFORMATION SCIENCES, 2012, 190 : 127 - 143
  • [3] Semantic Similarity Search on Semistructured Data with the XXL Search Engine
    Ralf Schenkel
    Anja Theobald
    Gerhard Weikum
    Information Retrieval, 2005, 8 : 521 - 545
  • [4] Semantic similarity search on semistructured data with the XXL search engine
    Schenkel, R
    Theobald, A
    Weikum, G
    INFORMATION RETRIEVAL, 2005, 8 (04): : 521 - 545
  • [5] Index-based approximate XML joins
    Guha, S
    Koudas, N
    Srivastava, D
    Yu, T
    19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, : 708 - 710
  • [6] Effective XML Keyword Search with Relevance Oriented Ranking
    Bao, Zhifeng
    Ling, Tok Wang
    Chen, Bo
    Lu, Jiaheng
    ICDE: 2009 IEEE 25TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2009, : 517 - +
  • [7] TargetSearch: A Ranking Friendly XML Keyword Search Engine
    Liu, Ziyang
    Cai, Yichuan
    Chen, Yi
    26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 1101 - 1104
  • [8] Intelligent search engine for XML based on index and domain ontology
    Li, Xin-Ye
    Yuan, Jin-Sha
    Yang, Xue-Ming
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 4501 - +
  • [9] IBP: An index-based XML parser model
    Zhang, HH
    Zhou, XS
    Gang, Y
    Wu, XJ
    NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2005, 3779 : 65 - 71
  • [10] An index-based ranking of conferences in a distinctive manner
    Farooq, Muhammad
    Khan, Hikmat Ullah
    Iqbal, Tassawar
    Iqbal, Saqib
    ELECTRONIC LIBRARY, 2019, 37 (01): : 67 - 80