Generating compact redundancy-free XML documents from conceptual-model hypergraphs

被引:13
|
作者
Mok, Wai Yin [1 ]
Embley, David W.
机构
[1] Univ Alabama, Dept Econ & Informat Syst, Huntsville, AL 35899 USA
[2] Brigham Young Univ, Dept Comp Sci, Provo, UT 84602 USA
基金
美国国家科学基金会;
关键词
XML data redundancy; compact XML storage structures; XML scheme generation;
D O I
10.1109/TKDE.2006.125
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
As XML data becomes more and more prevalent and as larger quantities of data find their way into XML documents, the need for quality XML data organization will only increase. One standard way of structuring data well is to reduce and, if possible, eliminate redundancy, while at the same time making the storage structures as compact as possible. In this paper, we present a methodology to generate XML storage structures where conforming XML documents are redundancy-free, and for most practical cases, are also fully compact. Our methodology assumes the input is a conceptual-model hypergraph. For the special case that every edge in the hypergraph is binary, we present a simple algorithm, guaranteed to always generate redundancy-free storage structures. We show, however, that generating a minimum number of redundancy-free storage structures is NP-hard. We therefore provide heuristics to guide the process and observe that these heuristics result in satisfactory solutions, which are often optimal. We then present a general algorithm for n-ary edges and show that it generates redundancy-free storage structures. The general algorithm must overcome several problems that do not arise in the special case.
引用
收藏
页码:1082 / 1096
页数:15
相关论文
共 40 条
  • [1] Generating the fewest redundancy-free scheme trees from acyclic conceptual-model hypergraphs in polynomial time
    Mok, Wai Yin
    Fong, Joseph
    Embley, David W.
    [J]. INFORMATION SYSTEMS, 2014, 41 : 20 - 44
  • [2] Designing Redundancy-Free XML Schema: A Smallest Closure Approach
    Mok, Wai Yan
    Fong, Joseph
    Wong, Kenneth
    [J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2015, 24 (04)
  • [3] Generating Free Redundancy XML Documents from Non Normalized Relational Views Using A Statistically Approach
    Nasser, Mohammed
    Ibrahim, Hamidah
    Mamat, Ali
    Sulaiman, Md Nasir
    [J]. 2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 1233 - 1239
  • [4] Extracting a largest redundancy-free XML storage structure from an acyclic hypergraph in polynomial time
    Mok, Wai Yin
    Fong, Joseph
    Embley, David W.
    [J]. INFORMATION SYSTEMS, 2010, 35 (07) : 804 - 824
  • [5] Generating relations from XML documents
    Cohen, S
    Kanza, Y
    Sagiv, Y
    [J]. DATABASE THEORY ICDT 2003, PROCEEDINGS, 2003, 2572 : 285 - 299
  • [6] Algorithms for Generating XML Documents from Probabilistic XML
    Zhu Yan
    Ma Haitao
    [J]. INFORMATION TECHNOLOGY APPLICATIONS IN INDUSTRY, PTS 1-4, 2013, 263-266 : 1578 - +
  • [7] Gauge redundancy-free formulation of compact QED with dynamical matter for quantum and classical computations
    Bender, Julian
    Zohar, Erez
    [J]. PHYSICAL REVIEW D, 2020, 102 (11)
  • [8] Designing Redundancy-Free XML Schema: A Smallest Closure Approach (vol 24, 1550005, 2015)
    Mok, Wai Yin
    Fong, Joseph
    Wong, Kenneth
    [J]. INTERNATIONAL JOURNAL OF COOPERATIVE INFORMATION SYSTEMS, 2016, 25 (01)
  • [9] CONCEPTUAL-MODEL FOR SCREENING AND INDEXING OF DOCUMENTS FOR A MULTIDISCIPLINARY INFORMATION-SYSTEM
    GRUBBS, RE
    [J]. PROCEEDINGS OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1978, 15 : 146 - 149
  • [10] EXTRACTION OF REDUNDANCY-FREE PROGRAMS FROM CONSTRUCTIVE NATURAL DEDUCTION PROOFS
    TAKAYAMA, Y
    [J]. JOURNAL OF SYMBOLIC COMPUTATION, 1991, 12 (01) : 29 - 69