Using ontologies for measuring semantic similarity in data warehouse schema matching process

被引:0
|
作者
Banek, M. [1 ]
Vrdoljak, B. [1 ]
Tjoa, A. M. [2 ]
机构
[1] Univ Zagreb, Fac Elect Engn & Comp, Zagreb, Croatia
[2] Vienna Univ Technol, Inst Software Technol & Interact Syst, Vienna, Austria
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The key step of data warehouse integration is the construction of mappings that link mutually compatible components of data warehouse schemas: dimensions, aggregation levels, attributes and facts. In order to perform the integration process in a semi-automated manner, we must define similarity functions that compare the names and substructures of those structure elements. During the last decade, many approaches to measuring semantic similarity between lexical terms have been introduced, most of them based either on the taxonomy of WordNet, a large lexical and thesaurus database of English language, or on the previously measured language statistic corpus. This paper presents a novel semantic similarity technique, based on edge counting, which combines WordNet and domain ontologies written in OWL and is implemented as a Java software. Ontologies are designed by domain experts and thus provide a better and more trustworthy source for calculating similarity, and the fact that the terms are related closer than in WordNet results in a higher similarity.
引用
收藏
页码:227 / +
页数:2
相关论文
共 50 条
  • [31] SLCO and DLCO: Two Ontologies for Detecting and Resolving Schema and Data-Level Semantic Conflicts
    Wen, Jing
    Zhang, Shidong
    Yan, Zhongmin
    ICIA: 2009 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-3, 2009, : 622 - 627
  • [32] Measuring product semantic similarity by exploiting a manufacturing process ontology
    Bruno, Giulia
    2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL ENGINEERING AND SYSTEMS MANAGEMENT (IESM), 2015, : 1251 - 1257
  • [33] The semantic measures library and toolkit: fast computation of semantic similarity and relatedness using biomedical ontologies
    Harispe, Sebastien
    Ranwez, Sylvie
    Janaqi, Stefan
    Montmain, Jacky
    BIOINFORMATICS, 2014, 30 (05) : 740 - 742
  • [34] Data warehouse integration using best fit matching
    Holmes, D
    Maxwell, D
    IKE'03: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGINEERING, VOLS 1 AND 2, 2003, : 177 - 181
  • [35] Measuring Semantic Similarity between Words Using Wikipedia
    Lu Zhiqiang
    Shao Werimin
    Yu Zhenhua
    WISM: 2009 INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, : 251 - +
  • [36] Measuring Semantic Similarity between Words Using HowNet
    Dai, Liuling
    Liu, Bin
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, 2008, : 601 - +
  • [37] Measuring Semantic Similarity Using Web Search Engine
    Shanmugapriya
    Latha, K.
    2013 INTERNATIONAL CONFERENCE ON ADVANCED NANOMATERIALS AND EMERGING ENGINEERING TECHNOLOGIES (ICANMEET), 2013, : 639 - 644
  • [38] Evaluating Taxonomic Relationships Using Semantic Similarity Measures on Sensor Domain Ontologies
    Tovar Vidal, Mireya
    Hernandez Garcia, Aimee Cecilia
    Lavalle Martinez, Jose de Jesus
    Reyes-Ortiz, Jose A.
    Vilarino Ayala, Dames
    ADVANCES IN INFORMATION AND COMMUNICATION, VOL 2, 2020, 1130 : 282 - 294
  • [39] Towards the estimation of feature-based semantic similarity using multiple ontologies
    Sole-Ribalta, Albert
    Sanchez, David
    Batet, Montserrat
    Serratosa, Francesc
    KNOWLEDGE-BASED SYSTEMS, 2014, 55 : 101 - 113
  • [40] Selecting a semantic similarity measure for concepts in two different CAD model data ontologies
    Lu, Wenlong
    Qin, Yuchu
    Qi, Qunfen
    Zeng, Wenhan
    Zhong, Yanru
    Liu, Xiaojun
    Jiang, Xiangqian
    ADVANCED ENGINEERING INFORMATICS, 2016, 30 (03) : 449 - 466