Schemas for Unordered XML on a DIME

被引:10
|
作者
Boneva, Iovka [1 ]
Ciucanu, Radu [1 ]
Staworko, Sawek [1 ]
机构
[1] Univ Lille, INRIA, Lille, France
关键词
Schemas for XML; Unordered XML; Regular expressions; Twig queries; Semi-structured data; DECISION-PROBLEMS; COMPLEXITY; INCLUSION; LOGIC;
D O I
10.1007/s00224-014-9593-1
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We investigate schema languages for unordered XML having no relative order among siblings. First, we propose unordered regular expressions (UREs), essentially regular expressions with unordered concatenation instead of standard concatenation, that define languages of unordered words to model the allowed content of a node (i.e., collections of the labels of children). However, unrestricted UREs are computationally too expensive as we show the intractability of two fundamental decision problems for UREs: membership of an unordered word to the language of a URE and containment of two UREs. Consequently, we propose a practical and tractable restriction of UREs, disjunctive interval multiplicity expressions (DIMEs). Next, we employ DIMEs to define languages of unordered trees and propose two schema languages: disjunctive interval multiplicity schema (DIMS), and its restriction, disjunction-free interval multiplicity schema (IMS). We study the complexity of the following static analysis problems: schema satisfiability, membership of a tree to the language of a schema, schema containment, as well as twig query satisfiability, implication, and containment in the presence of schema. Finally, we study the expressive power of the proposed schema languages and compare them with yardstick languages of unordered trees (FO, MSO, and Presburger constraints) and DTDs under commutative closure. Our results show that the proposed schema languages are capable of expressing many practical languages of unordered trees and enjoy desirable computational properties.
引用
下载
收藏
页码:337 / 376
页数:40
相关论文
共 50 条
  • [31] On semantic weighting and decomposition techniques for XML schemas
    Chen, YF
    Kuo, CCJ
    INTERNET MULTIMEDIA MANAGEMENT SYSTEMS V, 2004, 5601 : 100 - 110
  • [32] SEARCH-BASED EVOLUTION OF XML SCHEMAS
    Teodoro Silva, Julio Cesar
    Ramirez Pozo, Aurora Trinidad
    Vergilio, Silvia Regina
    Musicante, Martin A.
    COMPUTING AND INFORMATICS, 2012, 31 (03) : 573 - 595
  • [33] Using regular tree automata as XML schemas
    Chidlovskii, B
    IEEE ADVANCES IN DIGITAL LIBRARIES 2000, PROCEEDINGS, 2000, : 89 - 98
  • [34] An approach for clustering semantically heterogeneous XML Schemas
    De Meo, P
    Quattrone, G
    Terracina, G
    Ursino, D
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2005: COOPIS, DOA, AND ODBASE, PT 1, PROCEEDINGS, 2005, 3760 : 329 - 346
  • [35] A novel mining approach for schemas in XML documents
    Wang, Tong
    Liu, Daxin
    Sun, Wei
    FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 2, 2006, : 731 - +
  • [36] A software tool for object and XML schemas generation
    Chankuang, N
    Chittayasothorn, S
    2003 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS, AND SIGNAL PROCESSING, VOLS 1 AND 2, CONFERENCE PROCEEDINGS, 2003, : 675 - 678
  • [37] XML: Model, schemas, types, logics, and queries
    Klarlund, N
    Schwentick, T
    Suciu, D
    LOGICS FOR EMERGING APPLICATIONS OF DATABASES, 2004, : 1 - 41
  • [38] Mapping relational schemas to XML DTDs with constraints
    Lv, Teng
    Yan, Ping
    FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 2, 2006, : 528 - +
  • [39] Integration of XML Schemas at various "severity" levels
    De Meo, Pasquale
    Quattrone, Giovanni
    Terracina, Giorgio
    Ursino, Domenico
    INFORMATION SYSTEMS, 2006, 31 (06) : 397 - 434
  • [40] Hierarchical approach for datatype matching in XML schemas
    Hong-Minh, Tran
    Smith, Dan
    WORKSHOPS OF THE TWENTY FOURTH BRITISH NATIONAL CONFERENCE ON DATABASES, WORKSHOP PROCEEDINGS, 2007, : 120 - +