Querying documents in object databases

被引:315
|
作者
Abiteboul S. [1 ]
Cluet S. [1 ]
Christophides V. [1 ]
Milo T. [2 ]
Moerkotte G. [3 ]
Siméon J. [1 ]
机构
[1] INRIA-Rocquencourt, F-78153 Le Chesnay Cedex
[2] Tel Aviv University, Ramat Aviv
[3] Lehrstuhl für Praktische Informatik III, Seminargebäude A5, Universität Mannheim
关键词
Generalized path expressions; ODMG; OQL; Pattern matching;
D O I
10.1007/s007990050001
中图分类号
学科分类号
摘要
We consider the problem of storing and accessing documents (SGML and HTML, in particular) using database technology. To specify the database image of documents, we use structuring schemas that consist in grammars annotated with database programs. To query documents, we introduce an extension of OQL, the ODMG standard query language for object databases. Our extension (named OQL-doc) allows us to query documents without a precise knowledge of their structure using in particular generalized path expressions and pattern matching. This allows us to introduce in a declarative language (in the style of SQL or OQL), navigational and information retrieval styles of accessing data. Query processing in the context of documents and path expressions leads to challenging implementation issues. We extend an object algebra with new operators to deal with generalized path expressions. We then consider two essential complementary optimization techniques. We show that almost standard database optimization techniques can be used to answer queries without having to load the entire document into the database. We also consider the interaction of full-text indexes (e.g., inverted files) with standard database collection indexes (e.g., B-trees) that provide important speed-up. © Springer-Verlag 1997.
引用
收藏
页码:5 / 19
页数:14
相关论文
共 50 条
  • [1] QUERYING OBJECT DATABASES
    LOOMIS, MES
    JOURNAL OF OBJECT-ORIENTED PROGRAMMING, 1994, 7 (03): : 56 - &
  • [2] An approach to querying multiple object Databases
    Koh, JL
    Chen, ALP
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2002, 18 (02) : 281 - 310
  • [3] A survey on querying encrypted XML documents for databases as a service
    Unay, Ozan
    Gundem, Taflan I.
    SIGMOD RECORD, 2008, 37 (01) : 12 - 20
  • [4] Relational databases for querying XML documents: Limitations and opportunities
    Shanmugasundaram, J
    Tufte, K
    He, G
    Zhang, C
    DeWitt, D
    Naughton, J
    PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, 1999, : 302 - 314
  • [5] Foundations of a new proposal for querying relational databases and XML documents
    García, AF
    Larrea, MJG
    Aguilar, LJ
    Labrador, JLD
    WEB ENGINEERING, PROCEEDINGS, 2003, 2722 : 411 - 412
  • [6] A middleware approach to storing and querying XML documents in relational databases
    Sevkli, Z
    Mercan, M
    Kurt, A
    ADVANCES IN INFORMATION SYSTEMS, PROCEEDINGS, 2004, 3261 : 223 - 233
  • [7] Flexible querying in multimedia databases with an object query language
    Bosc, P
    Connan, F
    Rocacher, D
    1998 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AT THE IEEE WORLD CONGRESS ON COMPUTATIONAL INTELLIGENCE - PROCEEDINGS, VOL 1-2, 1998, : 1308 - 1313
  • [8] Storing and querying XML documents using a path table in relational databases
    Shin, BJ
    Jin, M
    CONCEPTUAL MODELING FOR NOVEL APPLICATION DOMAINS, PROCEEDINGS, 2003, 2814 : 285 - 296
  • [9] Performance of querying temporal attributes in object-relational databases
    Kleiner, C
    Lipeck, UW
    NINTH INTERNATIONAL SYMPOSIUM ON TEMPORAL REPRESENTATION AND REASONING, PROCEEDINGS, 2002, : 58 - 60
  • [10] Representing and querying space object registration data using graph databases
    Le May, S.
    Carter, B.A.
    Gehly, S.
    Flegel, S.
    Jah, M.
    Acta Astronautica, 2020, 173 : 392 - 403