EXTENDING THE DEWEY DECIMAL CLASSIFICATION VIA KEYWORD CLUSTERING - THE SCIENCE LIBRARY CATALOG PROJECT

被引:0
|
作者
ROSENBERG, JB [1 ]
BORGMAN, CL [1 ]
机构
[1] UNIV CALIF LOS ANGELES,GRAD SCH LIB & INFORMAT SCI,LOS ANGELES,CA 90024
来源
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Science Library Catalog is a direct-manipulation browsing-oriented online catalog intended for use by children. Our research goals are to construct a system that provides an innovative interface tailored to children's cognitive development, interests and capabilities, yet can be implemented by loading standard MARC records from extant library collections. We have addressed these goals by reorganizing the catalog databases of individual libraries based on the Dewey Decimal Classification, creating a radically different visualization of the bibliographic data from that of standard online catalogs. We report here on the clustering algorithms used to extend the Dewey class number assignments on a large database of more than 8200 bibliographic records from a catalog developed over a century-long period. By combining clustering algorithms with the assistance of a human editor, we reassigned 7076 records from classification numbers containing an average of 126 items each (maximum 1140 records in one classification) into 959 clusters averaging 7.4 items each, producing a manageable retrieval hierarchy. Our results suggest that considerable information retrieval improvements can be made on extant bibliographic data without the expense of re-cataloging and reclassifying established collections.
引用
收藏
页码:171 / 184
页数:14
相关论文
共 31 条