Decisions in thesaurus construction and use

被引:7
|
作者
Losee, Robert M. [1 ]
机构
[1] Univ N Carolina, Chapel Hill, NC 27599 USA
关键词
thesaurus; ontology; evaluation; performance measurement; controlled vocabulary;
D O I
10.1016/j.ipm.2006.08.011
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A thesaurus and an ontology provide a set of structured terms, phrases, and metadata, often in a hierarchical arrangement, that may be used to index, search, and mine documents. We describe the decisions that should be made when including a term, deciding whether a term should be subdivided into its subclasses, or determining which of more than one set of possible subclasses should be used. Based on retrospective measurements or estimates of future performance when using thesaurus terms in document ordering, decisions are made so as to maximize performance. These decisions may be used in the automatic construction of a thesaurus. The evaluation of an existing thesaurus is described, consistent with the decision criteria developed here. These kinds of user-focused decision-theoretic techniques may be applied to other hierarchical applications, such as faceted classification systems used in information architecture or the use of hierarchical terms in "breadcrumb navigation". (c) 2006 Elsevier Ltd. All rights reserved.
引用
收藏
页码:958 / 968
页数:11
相关论文
共 50 条