Semantic transference for enriching multilingual biomedical knowledge resources

被引:3
|
作者
Perez, Maria [1 ]
Berlanga, Rafael [1 ]
机构
[1] Univ Jaume 1, Dept Comp Languages & Syst, Avda Vicent Sos Baynat S-N, Castellon de La Plana 12071, Spain
关键词
Semantic transference; Multilingual biomedical knowledge resources; Term alignment; WORD ALIGNMENT; ONTOLOGY; UMLS; WEB;
D O I
10.1016/j.jbi.2015.08.026
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Biomedical knowledge resources (KRs) are mainly expressed in English, and many applications using them suffer from the scarcity of knowledge in non-English languages. The goal of the present work is to take maximum profit from existing multilingual biomedical KRs lexicons to enrich their non-English counterparts. We propose to combine different automatic methods to generate pair-wise language alignments. More specifically, we use two well-known translation methods (GIZA++ and Moses), and we propose a new ad hoc method specially devised for multilingual KRs. Then, resulting alignments are used to transfer semantics between KRs across their languages. Transference quality is ensured by checking the semantic coherence of the generated alignments. Experiments have been carried out over the Spanish, French and German UMLS Metathesaurus counterparts. As a result, the enriched Spanish KR can grow up to 1,514,217 concepts (originally 286,659), the French KR up to 1,104,968 concepts (originally 83,119), and the German KR up to 1,136,020 concepts (originally 86,842). (C) 2015 Elsevier Inc. All rights reserved.
引用
收藏
页码:1 / 10
页数:10
相关论文
共 50 条
  • [1] Multilingual Knowledge Resources for wide-coverage Semantic Processing
    Cuadros, Montse
    Rigau, German
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2008, (40): : 35 - 42
  • [2] Enriching Semantic Knowledge for WSD
    Chen, Junpeng
    Yu, Wei
    [J]. IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (08): : 2212 - 2216
  • [3] Enriching the Integration of Semantic Resources based on WordNet
    Gutierrez Vazquez, Yoan
    Fernandez Orquin, Antonio
    Montoyo Guijarro, Andres
    Vazquez Perez, Sonia
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2011, (47): : 249 - 257
  • [4] Enriching Frame Semantic Resources with Dependency Graphs
    Fuerstenau, Hagen
    [J]. SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 1478 - 1484
  • [5] A framework for enriching lexical semantic resources with distributional semantics
    Biemann, Chris
    Faralli, Stefano
    Panchenko, Alexander
    Ponzetto, Simone Paolo
    [J]. NATURAL LANGUAGE ENGINEERING, 2018, 24 (02) : 265 - 312
  • [6] Semantic Interoperability of Multilingual Language Resources by Automatic Mapping
    Schmitz, P.
    Francesconi, E.
    Hajlaoui, N.
    Batouche, B.
    Stellato, A.
    [J]. ELECTRONIC GOVERNMENT AND THE INFORMATION SYSTEMS PERSPECTIVE, EGOVIS 2018, 2018, 11032 : 153 - 163
  • [7] Multilingual Acquisition of Large Scale Knowledge Resources
    Cuadros, Montse
    [J]. PROCESAMIENTO DEL LENGUAJE NATURAL, 2013, (50): : 115 - 118
  • [8] Enriching a biomedical event corpus with meta-knowledge annotation
    Thompson, Paul
    Nawaz, Raheel
    McNaught, John
    Ananiadou, Sophia
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [9] Assessing the quality of multilingual open knowledge resources
    Obradovic, Ivan
    Stankovic, Ranka
    Linzalone, Roberto
    Radojicic, Marija
    [J]. IFKAD 2015: 10TH INTERNATIONAL FORUM ON KNOWLEDGE ASSET DYNAMICS: CULTURE, INNOVATION AND ENTREPRENEURSHIP: CONNECTING THE KNOWLEDGE DOTS, 2015, : 1778 - 1788
  • [10] Enriching a biomedical event corpus with meta-knowledge annotation
    Paul Thompson
    Raheel Nawaz
    John McNaught
    Sophia Ananiadou
    [J]. BMC Bioinformatics, 12