Information retrieval and machine learning for probabilistic schema matching

被引:22
|
作者
Nottelmann, Henrik
Straccia, Umberto
机构
[1] CNR, ISTI, I-56124 Pisa, Italy
[2] Univ Duisburg Essen, Dept Informat, D-47048 Duisburg, Germany
关键词
schema matching; data exchange; probability theory; sPLMap;
D O I
10.1016/j.ipm.2006.10.014
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Schema matching is the problem of finding correspondences (mapping rules, e.g. logical formulae) between heterogeneous schemas e.g. in the data exchange domain, or for distributed IR in federated digital libraries. This paper introduces a probabilistic framework, called sPLMap, for automatically learning schema mapping rules, based on given instances of both schemas. Different techniques, mostly from the IR and machine learning fields, are combined for finding suitable mapping candidates. Our approach gives a probabilistic interpretation of the prediction weights of the candidates, selects the rule set with highest matching probability, and outputs probabilistic rules which are capable to deal with the intrinsic uncertainty of the mapping process. Our approach with different variants has been evaluated on several test sets. (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:552 / 576
页数:25
相关论文
共 50 条
  • [1] Semantic Retrieval of Learning Objects with Schema Matching
    Di Martino, Beniamino
    [J]. JOURNAL OF E-LEARNING AND KNOWLEDGE SOCIETY, 2009, 5 (03): : 49 - 58
  • [2] Thesaurus Performance with Information Retrieval: Schema Matching as A Case Study
    Sabbah, Thabit
    Selamat, Ali
    [J]. 2013 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2013), 2013, : 4494 - 4498
  • [3] Interface Schema Matching with the Machine Learning for Deep Web
    Zhu, Guanwen
    Wang, Hongbin
    Wang, Nianbin
    Jiao, QianQian
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 822 - 825
  • [4] SchemaLogix: Advancing Interoperability with Machine Learning in Schema Matching
    Raoui, Mohamed
    Ennaouri, Mohammed
    El Yazidi, Moulay Hafid
    Zellou, Ahmed
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (05) : 420 - 430
  • [5] Machine Learning for Information Retrieval
    Si, Luo
    Jin, Rong
    [J]. PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1293 - 1293
  • [6] A study on machine learning techniques for the schema matching network problem
    Rodrigues, Diego
    Silva, Altigran da
    [J]. Journal of the Brazilian Computer Society, 2021, 27 (01)
  • [7] sPLMap: A probabilistic approach to schema matching
    Nottelmann, H
    Straccia, U
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2005, 3408 : 81 - 95
  • [8] Probabilistic model for schema understanding and matching
    Ratinov, LA
    Shimony, SE
    Gudes, E
    [J]. 2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 4768 - 4773
  • [9] Applications of machine learning in information retrieval
    Cunningham, SJ
    Witten, IH
    Littin, J
    [J]. ANNUAL REVIEW OF INFORMATION SCIENCE AND TECHNOLOGY, 1999, 34 : 341 - 384
  • [10] PROBABILISTIC RECORD MATCHING USING MACHINE LEARNING TECHNIQUES
    Ross, J. M.
    Cota, Fermin R. N.
    Zaric, G. S.
    [J]. VALUE IN HEALTH, 2016, 19 (03) : A82 - A83