Structural and semantic aspects of similarity of Document Type Definitions and XML schemas

被引:19
|
作者
Wojnar, Ales [1 ]
Mlynkova, Irena [1 ]
Dokulil, Jiri [1 ]
机构
[1] Charles Univ Prague, Dept Software Engn, Fac Math & Phys, CR-11800 Prague 1, Czech Republic
关键词
XML schema; DTD; XSD; Similarity; Data semantics; Structural analysis; PERFORMANCE; METHODOLOGY; ALGORITHM;
D O I
10.1016/j.ins.2009.12.024
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The natural optimization strategy for XML-to-relational mapping methods is exploitation of similarity of XML data. However, none of the current similarity evaluation approaches is suitable for this purpose. While the key emphasis is currently put on semantic similarity of XML data, the main aspect of XML-to-relational mapping methods is analysis of their structure. In this paper we propose an approach that utilizes a verified strategy for structural similarity evaluation - tree edit distance - to DTD constructs. This approach is able to cope with the fact that DTDs involve several types of nodes and can form general graphs. In addition, it is optimized for the specific features of XML data and, if required, it enables one to exploit the semantics of element/attribute names. Using a set of experiments we show the impact of these extensions on similarity evaluation. And, finally, we discuss how this approach can be extended for XSDs, which involve plenty of "syntactic sugar", i.e. constructs that are structurally or semantically equivalent. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:1817 / 1836
页数:20
相关论文
共 50 条
  • [21] A SEMANTIC APPROACH TO INTEGRATING XML SCHEMAS USING DOMAIN ONTOLOGIES
    Kang, Haeran
    Lee, Kyong-Ho
    COMPUTING AND INFORMATICS, 2011, 30 (04) : 857 - 879
  • [22] Simplify the Design of XML Schemas by Type Dependencies
    Liu, Jia
    Liao, Husheng
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2016, PT II, 2016, 9828 : 445 - 453
  • [23] A Hybrid Method to Evaluate Similarity of XML Document
    Dai, Yubiao
    Ren, Xueli
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON EDUCATION, MANAGEMENT, COMPUTER AND SOCIETY, 2016, 37 : 677 - 680
  • [24] A matching algorithm for measuring the structural similarity between an XML document and a DTD and its applications
    Bertino, E
    Guerrini, G
    Mesiti, M
    INFORMATION SYSTEMS, 2004, 29 (01) : 23 - 46
  • [25] Using a semantic model and XML for document annotation
    Czejdo, BD
    Sobaniec, C
    INTELLIGENT PROBLEM SOLVING: METHODOLOGIES AND APPROACHES, PRODEEDINGS, 2000, 1821 : 236 - 241
  • [26] Similarity Measure for Semantic Document Interconnections
    Hwang, Myunggwon
    Choi, Dongjin
    Choi, Junho
    Kim, Hanil
    Kim, Pankoo
    INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2010, 13 (02): : 253 - 267
  • [27] Almost automatic and semantic integration of XML Schemas at various "severity" levels
    De Meo, P
    Quattrone, G
    Terracina, G
    Ursino, D
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS 2003: COOPIS, DOA, AND ODBASE, 2003, 2888 : 4 - 21
  • [28] XML document similarity measure in terms of the structure and contents
    Kim, Woosaeng
    PROCEEDINGS OF THE 2ND WSEAS INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: MODERN TOPICS OF COMPUTER SCIENCE, 2008, : 205 - 212
  • [29] Fast detection of XML structural similarity
    Flesca, S
    Manco, G
    Masciari, E
    Pontieri, L
    Pugliese, A
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (02) : 160 - 175
  • [30] An XML document generator for semantic query optimization experimentation
    Geng, Ke
    Dobbie, Gillian
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2007, 3 (1-2) : 26 - +