Semantic integration of heterogeneous information sources

被引:165
|
作者
Bergamaschi, S
Castano, S
Vincini, M
Beneventano, D
机构
[1] Univ Modena & Reggio Emilia, Dipt Sci Ingn, I-41100 Modena, Italy
[2] CNR, CSITE, I-40126 Bologna, Italy
[3] Univ Milan, Milan, Italy
关键词
information extraction; information integration; semistructured data; semantic heterogeneity; Description Logics; clustering techniques;
D O I
10.1016/S0169-023X(00)00047-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Developing intelligent tools for the integration of information extracted from multiple heterogeneous sources is a challenging issue to effectively exploit the numerous sources available on-line in global information systems. In this paper, we propose intelligent, tool-supported techniques to information extraction and integration from both structured and semistructured data sources. An object-oriented language, with an underlying Description Logic, called ODLI3, derived from the standard ODMG is introduced for information extraction. ODLI3 descriptions of the source schemas are exploited first to set a Common Thesaurus for the sources. Information integration is then performed in a semiautomatic way by exploiting the knowledge in the Common Thesaurus and ODLI3 descriptions of source schemas with a combination of clustering techniques and Description Logics. This integration process gives rise to a virtual integrated view of the underlying sources for which mapping rules and integrity constraints are specified to handle heterogeneity. Integration techniques described in the paper are provided in the framework of the MOMIS system based on a conventional wrapper/mediator architecture. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:215 / 249
页数:35
相关论文
共 50 条
  • [41] Semantic integration of manufacturing data sources
    Wang, Mingwei
    Zhang, Shusheng
    Zhou, Jingtao
    Zhao, Han
    [J]. ADVANCES IN MATERIALS MANUFACTURING SCIENCE AND TECHNOLOGY II, 2006, 532-533 : 1156 - +
  • [42] Semantic Based Query Rewriting in Heterogeneous Sources
    Aslam, Ammara
    Khan, Sharifullah
    Latif, Khalid
    [J]. 2008 INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2008, : 294 - 299
  • [43] Semantic Summarization of News from Heterogeneous Sources
    Amato, Flora
    d'Acierno, Antonio
    Colace, Francesco
    Moscato, Vinenzo
    Penta, Antonio
    Picariello, Antonio
    [J]. ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING, 2017, 1 : 305 - 314
  • [44] Authors semantic disambiguation on heterogeneous bibliographic sources
    Ortiz, Jose
    Segarra, Jose
    Sumba, Xavier
    Cullcay, Jose
    Espinoza, Mauricio
    Saquicela, Victor
    [J]. 2017 XLIII LATIN AMERICAN COMPUTER CONFERENCE (CLEI), 2017,
  • [45] Semantic matching across heterogeneous data sources
    Zhao, Huimin
    [J]. COMMUNICATIONS OF THE ACM, 2007, 50 (01) : 45 - 50
  • [46] A Semantic Matching Approach for Mediating Heterogeneous Sources
    Schneider, Michel
    Bejaoui, Lotfi
    Bertin, Guillaume
    [J]. METADATA AND SEMANTICS, 2009, : 537 - +
  • [47] A Semantic Integration System for Heterogeneous Bioinformatics Data
    Dai, Weidi
    Cheng, Jianlai
    Wang, Qiuwen
    [J]. PROCEEDINGS OF 2012 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT 2012), 2012, : 1072 - 1076
  • [48] A Semantic Web Approach to Heterogeneous Metadata Integration
    Liao, Shu-Hsien
    Huang, Hong-Chu
    Chen, Ya-Ning
    [J]. COMPUTATIONAL COLLECTIVE INTELLIGENCE: TECHNOLOGIES AND APPLICATIONS, PT I, 2010, 6421 : 205 - +
  • [49] Semantic Integration of Heterogeneous and Complex Spreadsheet Tables
    Bonfitto, Sara
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT III, 2021, 12683 : 643 - 646
  • [50] Integration of pre-existing heterogeneous information sources inn a knowledge management system
    Staniszkis, W
    Kalka, E
    Nittner, G
    Staniszkis, E
    Strychowski, J
    [J]. ELECTRONIC GOVERNMENT, PROCEEDINGS, 2004, 3183 : 507 - 514