Efficient extraction of schemas for XML documents

被引:28
|
作者
Min, JK [1 ]
Ahn, JY [1 ]
Chung, CW [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn & Comp Sci, Div Comp Sci, Yusong Gu, Taejon 305701, South Korea
关键词
XML; automatic schema extraction; DTD; XML schema; databases;
D O I
10.1016/S0020-0190(02)00345-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we present a technique for efficient extraction of concise and accurate schemas for XML documents. By restricting the schema form and applying some heuristic rules, we achieve the efficiency and conciseness. The result of an experiment with real-life DTDs shows that our approach attains high accuracy and is 20 to 200 times faster than existing approaches. (C) 2002 Elsevier Science B.V All rights reserved.
引用
收藏
页码:7 / 12
页数:6
相关论文
共 50 条
  • [21] Efficient memory representation of XML documents
    Busatto, G
    Lohrey, M
    Maneth, S
    [J]. DATABASE PROGRAMMING LANGUAGES, 2005, 3774 : 199 - 216
  • [22] Efficient incremental validation of XML documents
    Barbosa, D
    Mendelzon, AO
    Libkin, L
    Mignet, L
    Arenas, M
    [J]. 20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 671 - 682
  • [23] Efficient Change Control of XML Documents
    Roennau, Sebastian
    Philipp, Geraint
    Borghoff, Uwe M.
    [J]. DOCENG'09: PROCEEDINGS OF THE 2009 ACM SYMPOSIUM ON DOCUMENT ENGINEERING, 2009, : 3 - 12
  • [24] Efficient fragmentation of large XML documents
    Bonifati, Angela
    Cuzzocrea, Alfredo
    [J]. Database and Expert Systems Applications, Proceedings, 2007, 4653 : 539 - 550
  • [25] Learning Concise Relax NG Schemas Supporting Interleaving from XML Documents
    Li, Yeting
    Mou, Xiaoying
    Chen, Haiming
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2018, 2018, 11323 : 303 - 317
  • [26] Information extraction and automatic markup for XML documents
    Abolhassani, M
    Fuhr, N
    Gövert, N
    [J]. INTELLIGENT SEARCH ON XML DATA: APPLICATIONS, LANGUAGES, MODELS IMPLEMENTATIONS AND BENCHMARKS, 2003, 2818 : 159 - 174
  • [27] Efficient inclusion checking for deterministic tree automata and XML schemas
    Champavere, Jerome
    Gilleron, Remi
    Lemay, Aurelien
    Niehren, Joachim
    [J]. INFORMATION AND COMPUTATION, 2009, 207 (11) : 1181 - 1208
  • [28] Efficient filtering of XML documents with XPath expressions
    Chan, CY
    Felber, P
    Garofalakis, M
    Rastogi, R
    [J]. VLDB JOURNAL, 2002, 11 (04): : 354 - 379
  • [29] Efficient filtering of XML documents with XPath expressions
    C.-Y. Chan
    P. Felber
    M. Garofalakis
    R. Rastogi
    [J]. The VLDB Journal, 2002, 11 : 354 - 379
  • [30] Simple yet efficient approach for maximal frequent subtrees extraction from a collection of XML documents
    Paik, Juryon
    Kim, Ung Mo
    [J]. WEB INFORMATION SYSTEMS - WISE 2006 WORKSHOPS, PROCEEDINGS, 2006, 4256 : 94 - 103