TermitUp: Generation and enrichment of linked terminologies

被引:2
|
作者
Martin-Chozas, Patricia [1 ]
Vazquez-Flores, Karen [1 ]
Calleja, Pablo [1 ]
Montiel-Ponsoda, Elena [1 ]
Rodriguez-Doncel, Victor [1 ]
机构
[1] Univ Politecn Madrid, Ontol Engn Grp, Madrid, Spain
基金
欧盟地平线“2020”;
关键词
Terminology generation; terminology enrichment; linguistic linked data; Multilingualism; THESAURUS; INFORMATION; WEB;
D O I
10.3233/SW-222885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain-specific terminologies play a central role in many language technology solutions. Substantial manual effort is still involved in the creation of such resources, and many of them are published in proprietary formats that cannot be easily reused in other applications. Automatic term extraction tools help alleviate this cumbersome task. However, their results are usually in the form of plain lists of terms or as unstructured data with limited linguistic information. Initiatives such as the Linguistic Linked Open Data cloud (LLOD) foster the publication of language resources in open structured formats, specifically RDF, and their linking to other resources on the Web of Data. In order to leverage the wealth of linguistic data in the LLOD and speed up the creation of linked terminological resources, we propose TermitUp, a service that generates enriched domain specific terminologies directly from corpora, and publishes them in open and structured formats. TermitUp is composed of five modules performing terminology extraction, terminology post-processing, terminology enrichment, term relation validation and RDF publication. As part of the pipeline implemented by this service, existing resources in the LLOD are linked with the resulting terminologies, contributing in this way to the population of the LLOD cloud. TermitUp has been used in the framework of European projects tackling different fields, such as the legal domain, with promising results. Different alternatives on how to model enriched terminologies are considered and good practices illustrated with examples are proposed.
引用
收藏
页码:967 / 986
页数:20
相关论文
共 50 条
  • [41] Gatica: Linked Sensed Data Enrichment and Analytics Middleware for IoT Gateways
    Qanbari, Soheil
    Behinaein, Negar
    Rahimzadeh, Rabee
    Dustdar, Schahram
    2015 3RD INTERNATIONAL CONFERENCE ON FUTURE INTERNET OF THINGS AND CLOUD (FICLOUD) AND INTERNATIONAL CONFERENCE ON OPEN AND BIG (OBD), 2015, : 38 - 43
  • [42] LogMap plus : Relational data enrichment and linked data resources matching
    Zitnik, Slavko
    Bajec, Marko
    Lavbic, Dejan
    2017 11TH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS), 2017, : 267 - 275
  • [43] Enrichment and Ranking of the YouTube Tag Space and Integration with the Linked Data Cloud
    Choudhury, Smitashree
    Breslin, John G.
    Passant, Alexandre
    SEMANTIC WEB - ISWC 2009, PROCEEDINGS, 2009, 5823 : 747 - 762
  • [44] The gaseous phase enrichment techniques in hydride generation (review)
    Guo, XM
    Guo, XW
    Huang, BL
    SPECTROSCOPY AND SPECTRAL ANALYSIS, 2000, 20 (04) : 533 - 536
  • [45] The Challenges of Linked Open Data Semantic Enrichment, Discovery, and Dissemination (GRID)
    Akatkin, Yu.
    Yasinovskaya, E.
    Bich, M.
    Shilin, A.
    PHYSICS OF PARTICLES AND NUCLEI, 2024, 55 (03) : 538 - 540
  • [46] Improving Bibliographic Search through Dataset Enrichment Using Linked Data
    Zarrinkalam, Fattane
    Kahani, Mohsen
    2011 1ST INTERNATIONAL ECONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2011, : 254 - 259
  • [47] Deaths linked to third-generation contraceptives
    Coney, S
    LANCET, 1999, 353 (9150): : 389 - 389
  • [48] Linked Open Data Driven Game Generation
    Warren, Rob
    Champion, Erik
    SEMANTIC WEB - ISWC 2014, PT II, 2014, 8797 : 358 - 373
  • [49] Influence of linked data in the knowledge generation and management
    Avila Barrientos, Eder
    E-CIENCIAS DE LA INFORMACION, 2021, 11 (01):
  • [50] Linked Data Generation from Digital Libraries
    Dimou, Anastasia
    Heyvaert, Pieter
    Demeester, Ben
    DIGITAL LIBRARIES FOR OPEN KNOWLEDGE, TPDL 2018, 2018, 11057 : 389 - 389