Handling probabilistic integrity constraints in pay-as-you-go reconciliation of data models

被引:2
|
作者
Nguyen Quoc Viet Hung [1 ]
Weidlich, Matthias [2 ]
Nguyen Thanh Tam [3 ]
Miklos, Zoltan [4 ]
Aberer, Karl [3 ]
Gal, Avigdor [5 ]
Stantic, Bela [1 ]
机构
[1] Griffith Univ, Gold Coast, Australia
[2] Humboldt Univ, Berlin, Germany
[3] Ecole Polytech Fed Lausanne, Lausanne, Switzerland
[4] Univ Rennes 1, Rennes, France
[5] Israel Inst Technol, Haifa, Israel
关键词
Data integration; Probabilistic constraints; Model reconciliation; WEB; ALIGNMENT; PATTERNS;
D O I
10.1016/j.is.2019.04.002
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data models capture the structure and characteristic properties of data entities, e.g., in terms of a database schema or an ontology. They are the backbone of diverse applications, reaching from information integration, through peer-to-peer systems and electronic commerce to social networking. Many of these applications involve models of diverse data sources. Effective utilisation and evolution of data models, therefore, calls for matching techniques that generate correspondences between their elements. Various such matching tools have been developed in the past. Yet, their results are often incomplete or erroneous, and thus need to be reconciled, i.e., validated by an expert. This paper analyses the reconciliation process in the presence of large collections of data models, where the network induced by generated correspondences shall meet consistency expectations in terms of integrity constraints. We specifically focus on how to handle data models that show some internal structure and potentially differ in terms of their assumed level of abstraction. We argue that such a setting calls for a probabilistic model of integrity constraints, for which satisfaction is preferred, but not required. In this work, we present a model for probabilistic constraints that enables reasoning on the correctness of individual correspondences within a network of data models, in order to guide an expert in the validation process. To support pay-as-you-go reconciliation, we also show how to construct a set of high-quality correspondences, even if an expert validates only a subset of all generated correspondences. We demonstrate the efficiency of our techniques for real-world datasets comprising database schemas and ontologies from various application domains. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:166 / 180
页数:15
相关论文
共 50 条
  • [41] REDISTRIBUTION AND THE EFFICIENCY OF THE PAY-AS-YOU-GO PENSION SYSTEM
    BRUNNER, JK
    [J]. JOURNAL OF INSTITUTIONAL AND THEORETICAL ECONOMICS-ZEITSCHRIFT FUR DIE GESAMTE STAATSWISSENSCHAFT, 1994, 150 (03): : 511 - 523
  • [42] The case for Pay-As-You-Go pensions in a service economy
    van Groezen, Bas
    Meijdam, Lex
    Verbon, Harrie A. A.
    [J]. SCOTTISH JOURNAL OF POLITICAL ECONOMY, 2007, 54 (02) : 151 - 165
  • [43] Disk based pay-as-you-go record linkage
    Chenchen Sun
    Derong Shen
    [J]. Frontiers of Computer Science, 2022, 16
  • [44] Pay-as-you-go searching - A review of HaveltAll IR
    Rydzak, JW
    Bourassa, PN
    [J]. SPECTROSCOPY, 2001, 16 (02) : 18 - +
  • [45] Disk based pay-as-you-go record linkage
    Sun, Chenchen
    Shen, Derong
    [J]. FRONTIERS OF COMPUTER SCIENCE, 2022, 16 (04)
  • [46] POSSIBLE REFORMS OF PAY-AS-YOU-GO PENSION SYSTEMS
    Banyar, Jozsef
    [J]. EUROPEAN JOURNAL OF SOCIAL SECURITY, 2016, 18 (03) : 286 - 308
  • [47] Pay-as-you-go pensions and the political power of the retirees
    Casamatta, G
    [J]. REVUE ECONOMIQUE, 2000, 51 : 133 - 142
  • [48] Repayment performance for pay-as-you-go solar lamps
    Guajardo, Jose A.
    [J]. ENERGY FOR SUSTAINABLE DEVELOPMENT, 2021, 63 : 78 - 85
  • [49] Longevity, health spending, and pay-as-you-go pensions
    Pestieau, Pierre
    Ponthiere, Gregory
    Sato, Motohiro
    [J]. FINANZARCHIV, 2008, 64 (01): : 1 - 18
  • [50] Pay-As-You-Go Pension, Bargaining Power, and Fertility
    Komura, Mizuki
    Ogawa, Hikaru
    [J]. FINANZARCHIV, 2018, 74 (02): : 235 - 259