Using ontologies for measuring semantic similarity in data warehouse schema matching process

被引:0
|
作者
Banek, M. [1 ]
Vrdoljak, B. [1 ]
Tjoa, A. M. [2 ]
机构
[1] Univ Zagreb, Fac Elect Engn & Comp, Zagreb, Croatia
[2] Vienna Univ Technol, Inst Software Technol & Interact Syst, Vienna, Austria
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The key step of data warehouse integration is the construction of mappings that link mutually compatible components of data warehouse schemas: dimensions, aggregation levels, attributes and facts. In order to perform the integration process in a semi-automated manner, we must define similarity functions that compare the names and substructures of those structure elements. During the last decade, many approaches to measuring semantic similarity between lexical terms have been introduced, most of them based either on the taxonomy of WordNet, a large lexical and thesaurus database of English language, or on the previously measured language statistic corpus. This paper presents a novel semantic similarity technique, based on edge counting, which combines WordNet and domain ontologies written in OWL and is implemented as a Java software. Ontologies are designed by domain experts and thus provide a better and more trustworthy source for calculating similarity, and the fact that the terms are related closer than in WordNet results in a higher similarity.
引用
收藏
页码:227 / +
页数:2
相关论文
共 50 条
  • [21] Semantic Similarity Analysis of XML Schema Using Grid Computing
    Kim, Jaewook
    Lee, Sookyoung
    Halem, Milton
    Peng, Yun
    PROCEEDINGS OF THE 2009 IEEE INTERNATIONAL CONFERENCE ON INFORMATION REUSE AND INTEGRATION, 2008, : 57 - 62
  • [22] Automating the schema matching process for heterogeneous data warehouses
    Banek, Marko
    Vrdoljak, Boris
    Tjoa, A. Min
    Skocir, Zoran
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2007, 4654 : 45 - +
  • [23] Ontology-based semantic matching in distributed Active data warehouse
    Hu, Hua
    Ji, Lidan
    Xu, Bin
    Yuan, Chenxiang
    DCABES 2006 PROCEEDINGS, VOLS 1 AND 2, 2006, : 160 - 164
  • [24] Measuring concept similarity in ontologies using weighted concept paths
    Rusu, Delia
    Fortuna, Blaz
    Mladenic, Dunja
    APPLIED ONTOLOGY, 2014, 9 (01) : 65 - 95
  • [25] Measuring Taxonomic Relationships in Ontologies Using Lexical Semantic Relatedness
    Wu, Fang
    Lu, Zhao
    Yan, Yu
    Gu, Junzhong
    2009 SECOND INTERNATIONAL CONFERENCE ON THE APPLICATIONS OF DIGITAL INFORMATION AND WEB TECHNOLOGIES (ICADIWT 2009), 2009, : 784 - 789
  • [26] Measuring quality of similarity functions in approximate data matching
    da Silva, Roberto
    Stasiu, Raquel
    Orengo, Viviane Moreira
    Heuser, Carlos A.
    JOURNAL OF INFORMETRICS, 2007, 1 (01) : 35 - 46
  • [27] Searching for services on the semantic web using process ontologies
    Klein, M
    Bernstein, A
    EMERGING SEMANTIC WEB, 2002, 75 : 153 - 166
  • [28] XML application schema matching using similarity measure and relaxation labeling
    Yi, SZ
    Huang, B
    Chan, WT
    INFORMATION SCIENCES, 2005, 169 (1-2) : 27 - 46
  • [29] Interpreting similarity measures: Bridging the gap between schema matching and data integration
    Gal, Avigdor
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1 AND 2, 2008, : 345 - 352
  • [30] Using process data to populate ontologies
    Venkataraman, P
    Mendonça, D
    2003 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS, VOLS 1-5, CONFERENCE PROCEEDINGS, 2003, : 2156 - 2161