Linking Datasets Using Semantic Textual Similarity

被引:12
|
作者
McCrae, John P. [1 ]
Buitelaar, Paul [1 ]
机构
[1] Natl Univ Ireland Galway, Insight Ctr Data Analyt, Galway H91 A06C, Ireland
基金
欧盟地平线“2020”;
关键词
Linked data; link discovery; ontology alignment; semantic textual similarity; structural similarity; NLP architectures;
D O I
10.2478/cait-2018-0010
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Linked data has been widely recognized as an important paradigm for representing data and one of the most important aspects of supporting its use is discovery of links between datasets. For many datasets, there is a significant amount of textual information in the form of labels, descriptions and documentation about the elements of the dataset and the fundament of a precise linking is in the application of semantic textual similarity to link these datasets. However, most linking tools so far rely on only simple string similarity metrics such as Jaccard scores. We present an evaluation of some metrics that have performed well in recent semantic textual similarity evaluations and apply these to linking existing datasets.
引用
收藏
页码:109 / 123
页数:15
相关论文
共 50 条
  • [31] Collective Human Opinions in Semantic Textual Similarity
    Wang, Yuxia
    Tao, Shimin
    Xie, Ning
    Yang, Hao
    Baldwin, Timothy
    Verspoor, Karin
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2023, 11 : 997 - 1013
  • [32] Czech news dataset for semantic textual similarity
    Sido, Jakub
    Sejak, Michal
    Prazak, Ondrej
    Konopik, Miloslav
    Moravec, Vaclav
    LANGUAGE RESOURCES AND EVALUATION, 2024,
  • [33] Interpretable semantic textual similarity of sentences using alignment of chunks with classification and regression
    Majumder, Goutam
    Pakray, Partha
    Das, Ranjita
    Pinto, David
    APPLIED INTELLIGENCE, 2021, 51 (10) : 7322 - 7349
  • [34] Predicting Semantic Textual Similarity of Arabic Question Pairs using Deep Learning
    Einea, Omar
    Elnagar, Ashraf
    2019 IEEE/ACS 16TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA 2019), 2019,
  • [35] Interpretable semantic textual similarity of sentences using alignment of chunks with classification and regression
    Goutam Majumder
    Partha Pakray
    Ranjita Das
    David Pinto
    Applied Intelligence, 2021, 51 : 7322 - 7349
  • [36] Semantic textual similarity for modern standard and dialectal Arabic using transfer learning
    Sulaiman, Mansour Al
    Moussa, Abdullah M.
    Abdou, Sherif
    Elgibreen, Hebah
    Faisal, Mohammed
    Rashwan, Mohsen
    PLOS ONE, 2022, 17 (08):
  • [37] Evaluation of semantic similarity using vector space model based on textual corpus
    Hssina, Badr
    Bouikhalene, Belaid
    Merbouha, Abdelkrim
    2016 13TH INTERNATIONAL CONFERENCE ON COMPUTER GRAPHICS, IMAGING AND VISUALIZATION (CGIV), 2016, : 295 - 300
  • [38] Cross-Lingual Semantic Textual Similarity Modeling Using Neural Networks
    Li, Xia
    Chen, Minping
    Zeng, Zihang
    MACHINE TRANSLATION, CWMT 2018, 2019, 954 : 52 - 62
  • [39] Evaluating Question generation models using QA systems and Semantic Textual Similarity
    Shaheer, Safwan
    Hossain, Ishmam
    Sarna, Sudipta Nandi
    Mehedi, Md Humaion Kabir
    Rasel, Annajiat Alim
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 431 - 435
  • [40] A Multi-Layer System for Semantic Textual Similarity
    Ngoc Phuoc An Vo
    Popescu, Octavian
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 56 - 67