Semantic integration of heterogeneous information sources

被引:165
|
作者
Bergamaschi, S
Castano, S
Vincini, M
Beneventano, D
机构
[1] Univ Modena & Reggio Emilia, Dipt Sci Ingn, I-41100 Modena, Italy
[2] CNR, CSITE, I-40126 Bologna, Italy
[3] Univ Milan, Milan, Italy
关键词
information extraction; information integration; semistructured data; semantic heterogeneity; Description Logics; clustering techniques;
D O I
10.1016/S0169-023X(00)00047-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Developing intelligent tools for the integration of information extracted from multiple heterogeneous sources is a challenging issue to effectively exploit the numerous sources available on-line in global information systems. In this paper, we propose intelligent, tool-supported techniques to information extraction and integration from both structured and semistructured data sources. An object-oriented language, with an underlying Description Logic, called ODLI3, derived from the standard ODMG is introduced for information extraction. ODLI3 descriptions of the source schemas are exploited first to set a Common Thesaurus for the sources. Information integration is then performed in a semiautomatic way by exploiting the knowledge in the Common Thesaurus and ODLI3 descriptions of source schemas with a combination of clustering techniques and Description Logics. This integration process gives rise to a virtual integrated view of the underlying sources for which mapping rules and integrity constraints are specified to handle heterogeneity. Integration techniques described in the paper are provided in the framework of the MOMIS system based on a conventional wrapper/mediator architecture. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:215 / 249
页数:35
相关论文
共 50 条
  • [1] Integration of heterogeneous information sources
    Kaukal, M
    Werthner, H
    [J]. INFORMATION AND COMMUNICATION TECHNOLOGIES IN TOURISM 2000, 2000, : 81 - 92
  • [2] Semantic integration of heterogeneous information sources using a knowledge-based system
    Adams, T
    Dullea, J
    Clark, P
    Sripada, S
    Barrett, T
    [J]. PROCEEDINGS OF THE FIFTH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1 AND 2, 2000, : 289 - 294
  • [3] An approach for semantic integration of heterogeneous data sources
    Fusco, Giuseppe
    Aversano, Lerina
    [J]. PEERJ COMPUTER SCIENCE, 2020, PeerJ Inc. (2020): : 1 - 30
  • [4] Semantic integration of XML heterogeneous data sources
    Reynaud, C
    Sirot, JP
    Vodislav, D
    [J]. 2001 INTERNATIONAL DATABASE ENGINEERING & APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2001, : 199 - 208
  • [5] Semantic integration of heterogeneous XML data sources
    Kim, HH
    Park, SS
    [J]. OBJECT-ORIENTED INFORMATION SYSTEMS, PROCEEDINGS, 2002, 2425 : 95 - 107
  • [6] Heterogeneous information integration method based on information semantic
    Dong, Mingzhe
    Zhang, Tongjun
    [J]. Jisuanji Gongcheng/Computer Engineering, 2005, 31 (02): : 202 - 203
  • [7] Integration of heterogeneous information sources in InfoWeaver
    Kitagawa, H
    Morishima, A
    Mizuguchi, H
    [J]. ADVANCES IN MULTIMEDIA AND DATABASES FOR THE NEW CENTURY: A SWISS/JAPANESE PERSPECTIVE, 2000, 10 : 124 - 137
  • [8] A semantic information gathering approach for heterogeneous information sources on WWW
    Arch-int, N
    Sophatsathit, P
    [J]. JOURNAL OF INFORMATION SCIENCE, 2003, 29 (05) : 357 - 374
  • [9] MISSION: an agent-based system for semantic integration of heterogeneous distributed statistical information sources
    McClean, S
    Scotney, B
    Rutjes, H
    Hartkamp, J
    Karali, I
    Hatzopoulos, M
    Lamb, J
    Ma, DF
    [J]. 16TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2004, : 337 - 340
  • [10] Towards intelligent integration of heterogeneous information sources
    Navathe, SB
    Donahoo, MJ
    [J]. DATABASE REENGINEERING AND INTEROPERABILITY, 1996, : 275 - 282