Ontology Alignment Based on Word Embedding and Random Forest Classification

被引:9
|
作者
Nkisi-Orji, Ikechukwu [1 ]
Wiratunga, Nirmalie [1 ]
Massie, Stewart [1 ]
Hui, Kit-Ying [1 ]
Heaven, Rachel [2 ]
机构
[1] Robert Gordon Univ, Aberdeen, Scotland
[2] British Geol Survey, Nottingham, England
关键词
Ontology alignment; Word embedding; Machine classification; Semantic web; AGGREGATION;
D O I
10.1007/978-3-030-10925-7_34
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ontology alignment is crucial for integrating heterogeneous data sources and forms an important component of the semantic web. Accordingly, several ontology alignment techniques have been proposed and used for discovering correspondences between the concepts (or entities) of different ontologies. Most alignment techniques depend on string-based similarities which are unable to handle the vocabulary mismatch problem. Also, determining which similarity measures to use and how to effectively combine them in alignment systems are challenges that have persisted in this area. In this work, we introduce a random forest classifier approach for ontology alignment which relies on word embedding for determining a variety of semantic similarity features between concepts. Specifically, we combine string-based and semantic similarity measures to form feature vectors that are used by the classifier model to determine when concepts align. By harnessing background knowledge and relying on minimal information from the ontologies, our approach can handle knowledge-light ontological resources. It also eliminates the need for learning the aggregation weights of a composition of similarity measures. Experiments using Ontology Alignment Evaluation Initiative (OAEI) dataset and real-world ontologies highlight the utility of our approach and show that it can outperform state-of-the-art alignment systems. Code related to this paper is available at: https://bitbucket.org/paravariar/rafcom.
引用
收藏
页码:557 / 572
页数:16
相关论文
共 50 条
  • [31] Forest resource classification based on random forest and object oriented method
    Wang M.
    Zhang X.
    Wang J.
    Sun Y.
    Jian G.
    Pan C.
    Cehui Xuebao/Acta Geodaetica et Cartographica Sinica, 2020, 49 (02): : 235 - 244
  • [32] aiai at the FinSim-2 task: Finance Domain Terms Automatic Classification Via Word Ontology and Embedding
    Ke Tian
    Hua Chen
    WEB CONFERENCE 2021: COMPANION OF THE WORLD WIDE WEB CONFERENCE (WWW 2021), 2021, : 320 - 322
  • [33] Word Embedding-based Method for Entity Category Alignment of Geographic Knowledge Base
    Xu Z.
    Zhu Y.
    Song J.
    Sun K.
    Wang S.
    Zhu, Yunqiang (zhuyq@igsnrr.ac.cn); Zhu, Yunqiang (zhuyq@igsnrr.ac.cn), 1600, Science Press (23): : 1372 - 1381
  • [34] Standardization of Robot Instruction Elements Based on Conditional Random Fields and Word Embedding
    Hengsheng Wang
    Zhengang Zhang
    Jin Ren
    Tong Liu
    Journal of Harbin Institute of Technology(New Series), 2019, 26 (05) : 32 - 40
  • [35] MANDARIN STOPS CLASSIFICATION BASED ON RANDOM FOREST APPROACH
    Lin, Chi-Yueh
    Wang, Hsiao-Chuan
    2008 6TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2008, : 241 - 244
  • [36] Classification of cattle breeds based on the random forest approach
    Kasarda, Radovan
    Moravcikova, Nina
    Meszaros, Gabor
    Simcic, Mojca
    Zaborski, Daniel
    LIVESTOCK SCIENCE, 2023, 267
  • [37] Random Forest Classifier Based ECG Arrhythmia Classification
    Mahesh, V.
    Kandaswamy, A.
    Vimal, C.
    Sathish, B.
    INTERNATIONAL JOURNAL OF HEALTHCARE INFORMATION SYSTEMS AND INFORMATICS, 2010, 5 (02) : 1 - 10
  • [38] Random forest ensemble classification based fuzzy logic
    Ben Ayed, Abdelkarim
    Benhammouda, Marwa
    Ben Halima, Mohamed
    Alimi, Adel M.
    NINTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2016), 2017, 10341
  • [39] A classification based on random forest for partial discharge sources
    Pu, Senlin
    Zhang, Huajun
    Mao, Cuimin
    Yang, Guang
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2307 - 2311
  • [40] Discriminative Word Alignment with Conditional Random Fields
    Blunsom, Phil
    Cohn, Trevor
    COLING/ACL 2006, VOLS 1 AND 2, PROCEEDINGS OF THE CONFERENCE, 2006, : 65 - 72