Reuse of termino-ontological resources and text corpora for building a multilingual domain ontology: An application to Alzheimer's disease

被引:23
|
作者
Drame, Khadim [1 ]
Diallo, Gayo [1 ]
Delva, Fleur [1 ]
Dartigues, Jean Francois [1 ]
Mouillet, Evelyne [1 ]
Salamon, Roger [1 ]
Mougin, Fleur [1 ]
机构
[1] Univ Bordeaux, ISPED, Ctr INSERM Epidemiol Biostat U897, F-33000 Bordeaux, France
关键词
Ontology development; Alzheimer's disease; Ontological resource reuse; Term alignment; Parallel corpus; METHODOLOGY;
D O I
10.1016/j.jbi.2013.12.013
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Ontologies are useful tools for sharing and exchanging knowledge. However ontology construction is complex and often time consuming. In this paper, we present a method for building a bilingual domain ontology from textual and termino-ontological resources intended for semantic annotation and information retrieval of textual documents. This method combines two approaches: ontology learning from texts and the reuse of existing terminological resources. It consists of four steps: (i) term extraction from domain specific corpora (in French and English) using textual analysis tools, (ii) clustering of terms into concepts organized according to the UMLS Metathesaurus, (iii) ontology enrichment through the alignment of French and English terms using parallel corpora and the integration of new concepts, (iv) refinement and validation of results by domain experts. These validated results are formalized into a domain ontology dedicated to Alzheimer's disease and related syndromes which is available online (http://lesim.isped.u-bordeaux2.fr/SemBiP/ressources/ontoAD.owl). The latter currently includes 5765 concepts linked by 7499 taxonomic relationships and 10,889 non-taxonomic relationships. Among these results, 439 concepts absent from the UMLS were created and 608 new synonymous French terms were added. The proposed method is sufficiently flexible to be applied to other domains. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:171 / 182
页数:12
相关论文
共 1 条
  • [1] Design of the formalized and integrated Alzheimer's Disease Ontology and its application in retrieving textual data via text mining
    Zhang, Bide
    Lage-Rupprecht, Vanessa
    Wegner, Philipp
    Sargsyan, Astghik
    Gebel, Stephan
    Jacobs, Marc
    Klein, Juergen
    Hofmann-Apitius, Martin
    Kodamullil, Alpha Tom
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2023, 2023