The index-based XXL search engine for querying XML data with relevance ranking

被引：0

作者：

Theobald, A ^{[1
]}

Weikum, G ^{[1
]}

机构：

[1] Univ Saarland, Saarbrucken, Germany

来源：

ADVANCES IN DATABASE TECHNOLOGY - EDBT 2002 | 2002年 / 2287卷

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Query languages for XML such as XPath or XQuery support Boolean retrieval: a query result is a (possibly restructured) subset of XML elements or entire documents that satisfy the search conditions of the query. This search paradigm works for highly schematic XML data collections such as electronic catalogs. However, for searching information in open environments such as the Web or intranets of large corporations, ranked retrieval is more appropriate: a query result is a rank list of XML elements in descending order of (estimated) relevance. Web search engines, which,are based on the ranked retrieval paradigm, do, however, not consider the additional information and rich annotations provided by the structure of XML documents and their element names. This paper presents the XXL search engine that supports relevance ranking on XML data. XXL is particularly geared for path queries with wildcards that can span multiple XML collections and contain both exact-match as well as semantic-similarity search conditions. In addition, ontological information and suitable index structures are used to improve the search efficiency and effectiveness. XXL is fully implemented as a suite of Java servlets. Experiments with a variety of structurally diverse XML data demonstrate the efficiency of the XXL search engine and underline its effectiveness for ranked retrieval.

引用

页码：477 / 495

页数：19

共 50 条

[31] A new multi-search engine for querying data through an Internet search service on CORBA
Chang, YS
Yuan, SM
Lo, W
COMPUTER NETWORKS-THE INTERNATIONAL JOURNAL OF COMPUTER AND TELECOMMUNICATIONS NETWORKING, 2000, 34 (03): : 467 - 480
[32] Usage-based Ranking of Distributed XML Data
Constantin, Camelia
Amann, Bernd
APPLIED COMPUTING 2008, VOLS 1-3, 2008, : 1008 - 1012
[33] Towards index-based similarity search for protein structure databases
Çamoglu, O
Kahveci, T
Singh, AK
PROCEEDINGS OF THE 2003 IEEE BIOINFORMATICS CONFERENCE, 2003, : 148 - 158
[34] Index-Based Densest Clique Percolation Community Search in Networks
Yuan, Long
Qin, Lu
Zhang, Wenjie
Chang, Lijun
Yang, Jianye
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (05) : 922 - 935
[35] Index-based Densest Clique Percolation Community Search in Networks
Yuan, Long
Qin, Lu
Zhang, Wenjie
Chang, Lijun
Yang, Jianye
2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 2161 - 2162
[36] Index-based fast search algorithm of image database on internet
Yeh, CH
Kuo, CJ
2000 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, PROCEEDINGS VOLS I-III, 2000, : 1195 - 1198
[37] Index-Based Search Scheme in Peer-to-Peer Networks
Bo, Jin
Zhao, Juping
COMPUTER SCIENCE FOR ENVIRONMENTAL ENGINEERING AND ECOINFORMATICS, PT 2, 2011, 159 : 102 - 106
[38] Index-Based Approach to Similarity Search in Protein and Nucleotide Databases
Hoksza, David
Skopal, Tomas
DATESO 2007 - DATABASES, TEXTS, SPECIFICATIONS, OBJECTS: PROCEEDINGS OF THE 7TH ANNUAL INTERNATIONAL WORKSHOP, 2007, 235 : 67 - 80
[39] An encoding scheme based on fractional number for querying and updating XML data
Mirabi, Meghdad
Ibrahim, Hamidah
Udzir, Nur Izura
Mamat, Ali
JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (08) : 1831 - 1851
[40] A structure-based approach of keyword querying for fuzzy XML data
Li, Ting
Ma, Zongmin
INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2018, 22 (02) : 125 - 140

← 1 2 3 4 5 →