Validating multilingual hybrid automatic term extraction for search engine optimisation: the use case of EBM-GUIDELINES

被引:0
|
作者
Terryn, Ayla Rigouts [1 ]
Hoste, Veronique [1 ]
Buysschaert, Joost [1 ]
Vander Stichele, Robert [1 ]
Van Campen, Elise [2 ]
Lefever, Els [1 ]
机构
[1] Univ Ghent, Ghent, Belgium
[2] Ebpracticenet, Leuven, Belgium
来源
关键词
automatic terminology extraction; ATR; terminology;
D O I
暂无
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Tools that automatically extract terms and their equivalents in other languages from parallel corpora can contribute to multilingual professional communication in more than one way. By means of a use case with data from a medical web site with point of care evidence summaries (Ebpracticenet), we illustrate how hybrid multilingual automatic term extraction from parallel corpora works and how it can be used in a practical application such as search engine optimisation. The original aim was to use the result of the extraction to improve the recall of a search engine by allowing automated multilingual searches. Two additional possible applications were found while considering the data: searching via related forms and searching via strongly semantically related words. The second stage of this research was to find the most suitable format for the required manual validation of the raw extraction results and to compare the validation process when performed by a domain expert versus a terminologist.
引用
下载
收藏
页码:93 / 108
页数:16
相关论文
empty
未找到相关数据