Reconciling inconsistent data in probabilistic XML data integration

被引:0
|
作者
Pankowski, Tadeusz [1 ]
机构
[1] Poznan Univ Tech, Inst Control & Informat Engn, Poznan, Poland
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of dealing with inconsistent data while integrating XML data from different sources is an important task, necessary to improve data integration quality. Typically, in order to remove inconsistencies, i.e. conflicts between data, data cleaning (or repairing) procedures are applied. In this paper, we present a probabilistic XML data integration setting. A probability is assigned to each data source and its probability models the reliability level of the data source. In this way, an answer (a tuple of values of XML trees) has a probability assigned to it. The problem is how to compute such probability, especially when the same answer is produced by many sources. We consider three semantics for computing such probabilistic answers: by-peer, by-sequence, and by-subtree semantics. The probabilistic answers can be used for resolving a class of inconsistencies violating XML functional dependencies defined over the target schema. Having a probability distribution over a set of conflicting answers, we can choose the one for which the probability of being correct is the highest.
引用
收藏
页码:75 / 86
页数:12
相关论文
共 50 条
  • [1] A probabilistic XML approach to data integration
    van Keulen, M
    de Keijzer, A
    Alink, W
    ICDE 2005: 21ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2005, : 459 - 470
  • [2] A probabilistic XML approach to data integration
    Van Keulen, M. (m.vankeulen@utwente.nl), IEEE Computer Society; The Database Society of Japan, DBSJ; Information Processing Society of Japan, IPSJ; Institute of Electronics, Info. Commun. Engineers, IEICE (Institute of Electrical and Electronics Engineers Computer Society):
  • [3] Consistent data for inconsistent XML document
    Tan, Zijing
    Zhang, Zijun
    Wang, Wei
    Shi, Baile
    INFORMATION AND SOFTWARE TECHNOLOGY, 2007, 49 (9-10) : 947 - 959
  • [4] Querying and repairing inconsistent XML data
    Flesca, S
    Furfaro, F
    Greco, S
    Zumpano, E
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2005, 2005, 3806 : 175 - 188
  • [5] Repairing inconsistent merged XML data
    Ng, W
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, 2736 : 244 - 255
  • [6] Converting probabilistic relational data to probabilistic XML data tree
    Wang J.
    Hao Z.
    Information Technology Journal, 2010, 9 (08) : 1706 - 1712
  • [7] XML and data integration
    Bertino, E
    Ferrari, E
    IEEE INTERNET COMPUTING, 2001, 5 (06) : 75 - 76
  • [8] Integration of XML data
    Saccol, DD
    Heuser, CA
    EFFICIENCY AND EFFECTIVENESS OF XML TOOLS AND TECHNIQUES AND DATA INTEGRATION OVER THE WEB, 2003, 2590 : 68 - 80
  • [9] XML data integration with identification
    Poggi, A
    Abiteboul, S
    DATABASE PROGRAMMING LANGUAGES, 2005, 3774 : 106 - 121
  • [10] AN IMPLEMENTATION OF XML DATA INTEGRATION
    Pan, Weidong
    Liu, Jixue
    Tian, Jiashen
    ICEIS 2008: PROCEEDINGS OF THE TENTH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL DISI: DATABASES AND INFORMATION SYSTEMS INTEGRATION, 2008, : 111 - 116