Automatic Data Processing;
Medical Informatics/methods;
Reproducibility of Results;
Terminology as Topic;
SEMANTIC SIMILARITY;
ONTOLOGY;
D O I:
10.3233/978-1-61499-101-4-153
中图分类号:
R19 [保健组织与事业(卫生事业管理)];
学科分类号:
摘要:
Within the biomedical area over one hundred terminologies exist and are merged in the Unified Medical Language System Metathesaurus, which gives over 1 million concepts. When such huge terminological resources are available, the users must deal with them and specifically they must deal with irrelevant parts of these terminologies. We propose to exploit seed terms and semantic distance algorithms in order to customize the terminologies and to limit within them a semantically homogeneous space. An evaluation performed by a medical expert indicates that the proposed approach is relevant for the customization of terminologies and that the extracted terms are mostly relevant to the seeds. It also indicates that different algorithms provide with similar or identical results within a given terminology. The difference is due to the terminologies exploited. A special attention must be paid to the definition of optimal association between the semantic similarity algorithms and the thresholds specific to a given terminology.
机构:
Univ Penn Hlth Syst, Dept Rehabil Med, Philadelphia, PA 19104 USA
Univ Penn Hlth Syst, Inst Aging, Philadelphia, PA USA
Univ Penn Hlth Syst, Inst Med & Bioengn, Philadelphia, PA USAUniv Penn Hlth Syst, Dept Rehabil Med, Philadelphia, PA 19104 USA