Improving XML Data Quality with Functional Dependencies

被引:0
|
作者
Tan, Zijing [1 ]
Zhang, Liyong [1 ]
机构
[1] Fudan Univ, Sch Comp Sci, Shanghai 200433, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We study the problem of repairing XML functional dependency violations by making the smallest value modifications in terms of repair cost. Our cost model assigns a weight to each leaf node in the XML document, and the cost of a repair is measured by the total weight of the modified nodes. We show that it is beyond reach in practice to find optimum repairs: this problem is already NP-complete for a setting with a fixed DTD, a fixed set of functional dependencies, and equal weights for all the nodes in the XML document. To this end we provide an efficient two-step heuristic method to repair XML functional dependency violations. First, the initial violations are captured and fixed by leveraging the conflict hypergraph. Second, the remaining conflicts are resolved by modifying the violating nodes and their related nodes called determinants, in a way that guarantees no new violations. The experimental results demonstrate that our algorithm scales well and is effective in improving data quality.
引用
收藏
页码:450 / 465
页数:16
相关论文
共 50 条
  • [1] Fast Detection of Functional Dependencies in XML Data
    Shi, Hang
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    [J]. DATABASE AND XML TECHNOLOGIES, 2010, 6309 : 113 - +
  • [2] Functional Dependencies for XML
    Chen, Haitao
    Liao, Husheng
    Gao, Zengqi
    [J]. WEB-AGE INFORMATION MANAGEMENT, 2010, 6185 : 110 - 115
  • [3] Functional dependencies for XML
    Vincent, MW
    Liu, JX
    [J]. WEB TECHNOLOGIES AND APPLICATIONS, 2003, 2642 : 22 - 34
  • [4] On the Existence of Armstrong Data Trees for XML Functional Dependencies
    Hartmann, Sven
    Koehler, Henning
    Trinh, Thu
    [J]. FOUNDATIONS OF INFORMATION AND KNOWLEDGE SYSTEMS, PROCEEDINGS, 2010, 5956 : 94 - +
  • [5] Repairs and consistent answers for XML data with functional dependencies
    Flesca, S
    Furfaro, F
    Greco, S
    Zumpano, E
    [J]. DATABASE AND XML TECHNOLOGIES, 2003, 2824 : 238 - 253
  • [6] Functional dependencies in XML documents
    Yan, P
    Lv, T
    [J]. ADVANCED WEB AND NETWORK TECHNOLOGIES, AND APPLICATIONS, PROCEEDINGS, 2006, 3842 : 29 - 37
  • [7] More functional dependencies for XML
    Hartmann, S
    Link, S
    [J]. ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2003, 2798 : 355 - 369
  • [8] Functional dependencies for XML databases
    Dong Dong
    Wuwongse, Vilas
    [J]. ICCSE'2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2006, : 249 - 255
  • [9] Designing functional dependencies for XML
    Lee, ML
    Ling, TW
    Low, WL
    [J]. ADVANCES IN DATABASE TECHNOLOGY - EDBT 2002, 2002, 2287 : 124 - 141
  • [10] FOX: Inference of approximate functional dependencies from XML data
    Fassetti, Fabio
    Fazzinga, Bettina
    [J]. DEXA 2007: 18TH INTERNATIONAL CONFERENCE ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, : 10 - +