TermitUp: Generation and enrichment of linked terminologies

被引:2
|
作者
Martin-Chozas, Patricia [1 ]
Vazquez-Flores, Karen [1 ]
Calleja, Pablo [1 ]
Montiel-Ponsoda, Elena [1 ]
Rodriguez-Doncel, Victor [1 ]
机构
[1] Univ Politecn Madrid, Ontol Engn Grp, Madrid, Spain
基金
欧盟地平线“2020”;
关键词
Terminology generation; terminology enrichment; linguistic linked data; Multilingualism; THESAURUS; INFORMATION; WEB;
D O I
10.3233/SW-222885
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Domain-specific terminologies play a central role in many language technology solutions. Substantial manual effort is still involved in the creation of such resources, and many of them are published in proprietary formats that cannot be easily reused in other applications. Automatic term extraction tools help alleviate this cumbersome task. However, their results are usually in the form of plain lists of terms or as unstructured data with limited linguistic information. Initiatives such as the Linguistic Linked Open Data cloud (LLOD) foster the publication of language resources in open structured formats, specifically RDF, and their linking to other resources on the Web of Data. In order to leverage the wealth of linguistic data in the LLOD and speed up the creation of linked terminological resources, we propose TermitUp, a service that generates enriched domain specific terminologies directly from corpora, and publishes them in open and structured formats. TermitUp is composed of five modules performing terminology extraction, terminology post-processing, terminology enrichment, term relation validation and RDF publication. As part of the pipeline implemented by this service, existing resources in the LLOD are linked with the resulting terminologies, contributing in this way to the population of the LLOD cloud. TermitUp has been used in the framework of European projects tackling different fields, such as the legal domain, with promising results. Different alternatives on how to model enriched terminologies are considered and good practices illustrated with examples are proposed.
引用
收藏
页码:967 / 986
页数:20
相关论文
共 50 条
  • [1] Quality assurance and enrichment of biological and biomedical ontologies and terminologies
    Ankur Agrawal
    Licong Cui
    BMC Medical Informatics and Decision Making, 20
  • [2] Quality assurance and enrichment of biological and biomedical ontologies and terminologies
    Agrawal, Ankur
    Cui, Licong
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (Suppl 10)
  • [3] BioPortal as a dataset of linked biomedical ontologies and terminologies in RDF
    Salvadores, Manuel
    Alexander, Paul R.
    Musen, Mark A.
    Noy, Natalya F.
    SEMANTIC WEB, 2013, 4 (03) : 277 - 284
  • [4] Towards a new generation of terminologies and coding systems
    Mori, AR
    MEDICAL INFORMATICS EUROPE '96: HUMAN FACETS IN INFORMATION TECHNOLOGIES, 1996, 34 : 208 - 212
  • [5] Neuroanatomical term generation and comparison between two terminologies
    Prashanti R. Srinivas
    Daniel Gusfield
    Oliver Mason
    Michael Gertz
    Michael Hogarth
    James Stone
    Edward G. Jones
    Fredric A. Gorin
    Neuroinformatics, 2003, 1 : 177 - 192
  • [6] Neuroanatomical term generation and comparison between two terminologies
    Srinivas, PR
    Gusfield, D
    Mason, O
    Gertz, M
    Hogarth, M
    Stone, J
    Jones, EG
    Gorin, FA
    NEUROINFORMATICS, 2003, 1 (02) : 177 - 192
  • [7] Special supplement issue on quality assurance and enrichment of biological and biomedical ontologies and terminologies
    Cui, Licong
    Agrawal, Ankur
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 23 (SUPPL 1)
  • [8] Semantic Enrichment of Linked Archival Materials
    Chen, Shu-Jiun
    KNOWLEDGE ORGANIZATION, 2019, 46 (07): : 530 - 547
  • [9] Review of Approaches for Linked Data Ontology Enrichment
    Subhashree, S.
    Irny, Rajeev
    Kumar, P. Sreenivasa
    DISTRIBUTED COMPUTING AND INTERNET TECHNOLOGY (ICDCIT 2018), 2018, 10722 : 27 - 49
  • [10] Development Issues on Linked Data Weblog Enrichment
    Ruiz-Rube, Ivan
    Cornejo, Carlos M.
    Dodero, Juan Manuel
    Garcia, Vicente M.
    METADATA AND SEMANTIC RESEARCH, 2010, 108 : 235 - +