Querying documents in object databases

被引:315
|
作者
Abiteboul S. [1 ]
Cluet S. [1 ]
Christophides V. [1 ]
Milo T. [2 ]
Moerkotte G. [3 ]
Siméon J. [1 ]
机构
[1] INRIA-Rocquencourt, F-78153 Le Chesnay Cedex
[2] Tel Aviv University, Ramat Aviv
[3] Lehrstuhl für Praktische Informatik III, Seminargebäude A5, Universität Mannheim
关键词
Generalized path expressions; ODMG; OQL; Pattern matching;
D O I
10.1007/s007990050001
中图分类号
学科分类号
摘要
We consider the problem of storing and accessing documents (SGML and HTML, in particular) using database technology. To specify the database image of documents, we use structuring schemas that consist in grammars annotated with database programs. To query documents, we introduce an extension of OQL, the ODMG standard query language for object databases. Our extension (named OQL-doc) allows us to query documents without a precise knowledge of their structure using in particular generalized path expressions and pattern matching. This allows us to introduce in a declarative language (in the style of SQL or OQL), navigational and information retrieval styles of accessing data. Query processing in the context of documents and path expressions leads to challenging implementation issues. We extend an object algebra with new operators to deal with generalized path expressions. We then consider two essential complementary optimization techniques. We show that almost standard database optimization techniques can be used to answer queries without having to load the entire document into the database. We also consider the interaction of full-text indexes (e.g., inverted files) with standard database collection indexes (e.g., B-trees) that provide important speed-up. © Springer-Verlag 1997.
引用
收藏
页码:5 / 19
页数:14
相关论文
共 50 条
  • [41] Modeling and querying video databases
    Decleir, C
    Hacid, MS
    Kouloumdjian, J
    24TH EUROMICRO CONFERENCE - PROCEEDING, VOLS 1 AND 2, 1998, : 492 - 498
  • [42] Querying Encrypted Graph Databases
    Aburawi, Nahla
    Lisitsa, Alexei
    Coenen, Frans
    ICISSP: PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INFORMATION SYSTEMS SECURITY AND PRIVACY, 2018, : 447 - 451
  • [43] Functional Querying in Graph Databases
    Pokorny, Jaroslav
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2017, PT I, 2017, 10191 : 291 - 301
  • [44] Querying and learning in probabilistic databases
    Dylla, Maximilian
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8714 : 313 - 368
  • [45] Keyword Querying and Ranking in Databases
    Chaudhuri, Surajit
    Das, Gautam
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (02): : 1658 - 1659
  • [46] Querying Large Graph Databases
    Ke, Yiping
    Cheng, James
    Yu, Jeffrey Xu
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 487 - +
  • [47] Querying sequence databases with transducers
    Anthony J. Bonner
    Giansalvatore Mecca
    Acta Informatica, 2000, 36 : 511 - 544
  • [48] QUERYING DESIGN AND PLANNING DATABASES
    IMIELINSKI, T
    NAQVI, S
    VADAPARTY, K
    LECTURE NOTES IN COMPUTER SCIENCE, 1991, 566 : 524 - 545
  • [49] Automated querying of genome databases
    Schattner, Peter
    PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (01) : 3 - 8
  • [50] Modelling and querying sensor databases
    Zechinelli-Martini, Jose-Luis
    Elias-Morales, Ilian
    ENC 2007: EIGHTH MEXICAN INTERNATIONAL CONFERENCE ON CURRENT TRENDS IN COMPUTER SCIENCE, PROCEEDINGS, 2007, : 138 - +