Reconciling inconsistent data in probabilistic XML data integration

被引:0
|
作者
Pankowski, Tadeusz [1 ]
机构
[1] Poznan Univ Tech, Inst Control & Informat Engn, Poznan, Poland
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The problem of dealing with inconsistent data while integrating XML data from different sources is an important task, necessary to improve data integration quality. Typically, in order to remove inconsistencies, i.e. conflicts between data, data cleaning (or repairing) procedures are applied. In this paper, we present a probabilistic XML data integration setting. A probability is assigned to each data source and its probability models the reliability level of the data source. In this way, an answer (a tuple of values of XML trees) has a probability assigned to it. The problem is how to compute such probability, especially when the same answer is produced by many sources. We consider three semantics for computing such probabilistic answers: by-peer, by-sequence, and by-subtree semantics. The probabilistic answers can be used for resolving a class of inconsistencies violating XML functional dependencies defined over the target schema. Having a probability distribution over a set of conflicting answers, we can choose the one for which the probability of being correct is the highest.
引用
收藏
页码:75 / 86
页数:12
相关论文
共 50 条
  • [31] Semantic integration of XML heterogeneous data sources
    Reynaud, C
    Sirot, JP
    Vodislav, D
    2001 INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2001, : 199 - 208
  • [32] GRAPH QUERIES FOR DATA INTEGRATION USING XML
    Tulchinsky, V. G.
    Yushchenko, A. K.
    Yushchenko, R. A.
    CYBERNETICS AND SYSTEMS ANALYSIS, 2008, 44 (02) : 292 - 303
  • [33] Semantic integration of heterogeneous XML data sources
    Kim, HH
    Park, SS
    OBJECT-ORIENTED INFORMATION SYSTEMS, PROCEEDINGS, 2002, 2425 : 95 - 107
  • [34] XML Data Integration Using Fragment Join
    Gong, Jian
    Cheung, David W.
    Mamoulis, Nikos
    Kao, Ben
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2009, 5463 : 334 - 338
  • [35] On Transiting Key in XML Data Transformation for Integration
    Shahriar, Md. Sumon
    Liu, Jixue
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2009, 3 (01): : 101 - 115
  • [36] Data integration based WWW with XML and CORBA
    Lu, ZD
    Zhang, SZ
    WEB ENGINEERING, PROCEEDINGS, 2003, 2722 : 455 - 458
  • [37] XML data integration with OWL: Experiences & challenges
    Lehti, P
    Fankhauser, P
    2004 INTERNATIONAL SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2004, : 160 - 167
  • [38] An XML Schema-Based Data Integration
    Ran, Chong-Shan
    Wang, Ma-Chuan
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 7, 2010, : 100 - 102
  • [39] Fuzzy Keyword Search over Probabilistic XML Data
    Zhao, Yue
    Wang, Guoren
    Yuan, Ye
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 2523 - 2527
  • [40] A web data integration technique based on XML
    Feng Shaorong
    Xiao Wenjun
    Feng Shaorong
    ICCSE'2006: PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2006, : 648 - 651