Efficient extraction of schemas for XML documents

被引:28
|
作者
Min, JK [1 ]
Ahn, JY [1 ]
Chung, CW [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn & Comp Sci, Div Comp Sci, Yusong Gu, Taejon 305701, South Korea
关键词
XML; automatic schema extraction; DTD; XML schema; databases;
D O I
10.1016/S0020-0190(02)00345-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a technique for efficient extraction of concise and accurate schemas for XML documents. By restricting the schema form and applying some heuristic rules, we achieve the efficiency and conciseness. The result of an experiment with real-life DTDs shows that our approach attains high accuracy and is 20 to 200 times faster than existing approaches. (C) 2002 Elsevier Science B.V All rights reserved.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 50 条
  • [1] Transforming XML Documents as Schemas Evolve
    Kwietniewski, Marcin
    Gryz, Jarek
    Hazlewood, Stephanie
    Van Run, Paul
    [J]. PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (02): : 1577 - 1580
  • [2] A Framework of Summarizing XML Documents with Schemas
    Lv, Teng
    Yan, Ping
    [J]. INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2013, 10 (01) : 18 - 27
  • [3] A novel mining approach for schemas in XML documents
    Wang, Tong
    Liu, Daxin
    Sun, Wei
    [J]. FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 2, 2006, : 731 - +
  • [4] A Security Framework for XML Schemas and Documents for Healthcare
    Algarin, Alberto De la Rosa
    Demurjian, Steven A.
    Berhe, Solomon
    Pavlich-Mariscal, Jaime A.
    [J]. 2012 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW), 2012,
  • [5] Efficient schema extraction from a large collection of XML documents
    Xing, Guangming
    Parthepan, Vijayeandra
    [J]. PROCEEDINGS OF THE 49TH ANNUAL ASSOCIATION FOR COMPUTING MACHINERY SOUTHEAST CONFERENCE (ACMSE '11), 2011, : 92 - 96
  • [6] Using schemas to simplify access control for XML documents
    Ray, I
    Muller, M
    [J]. DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY, PROCEEDINGS, 2004, 3347 : 363 - 368
  • [7] Updating XML Schemas and Associated Documents through EXup
    Cavalieri, Federico
    Guerrini, Giovanna
    Mesiti, Marco
    [J]. IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 1320 - 1323
  • [8] An efficient algorithm for clustering XML schemas
    Rhim, TW
    Lee, KH
    Ko, MC
    [J]. WEB INFORMATION SYSTEMS - WISE 2004, PROCEEDINGS, 2004, 3306 : 372 - 377
  • [9] Schemas for Safe and Efficient XML Processing
    Colazzo, Dario
    Ghelli, Giorgio
    Sartiani, Carlo
    [J]. IEEE 27TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2011), 2011, : 1378 - +
  • [10] Computing simple and complex matchings between XML schemas for transforming XML documents
    Lee, Jun-Seung
    Lee, Kyong-Ho
    [J]. INFORMATION AND SOFTWARE TECHNOLOGY, 2006, 48 (09) : 937 - 946