AN APPROACH TO THE AUTOMATIC CONSTRUCTION OF GLOBAL THESAURI

被引:64
|
作者
CROUCH, CJ
机构
[1] Department of Computer Science, University of Minnesota, Duluth
关键词
D O I
10.1016/0306-4573(90)90106-C
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The benefits of a well constructed thesaurus to an information retrieval system have long been recognized by both researchers and practitioners in the field. Previous experiments have investigated the construction of thesauri by manual, semiautomatic, and automatic means. Automatic thesaurus generation in particular has proven to be an especially difficult problem. This paper examines both early and current approaches to automatic thesaurus construction and describes an approach to the automatic generation of global thesauri based on the term discrimination value model of Salton, Yang, and Yu and on an appropriate clustering algorithm. This method has been implemented and applied to two document collections. Preliminary results indicate that this method, which produces improvements in retrieval performance in excess of 10 and 15 percent in the test collections, is viable and worthy of continued investigation. © 1990.
引用
收藏
页码:629 / 640
页数:12
相关论文
共 50 条