A SENSITIVITY ANALYSIS OF A PROBABILISTIC INFORMATION-RETRIEVAL SYSTEM

被引:0
|
作者
THOMPSON, P [1 ]
机构
[1] DREXEL UNIV,COLL INFORMAT STUDIES,PHILADELPHIA,PA 19104
关键词
D O I
10.1002/(SICI)1097-4571(199007)41:5<348::AID-ASI6>3.0.CO;2-K
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Results of a set of exploratory simulations to test the effects of errors in estimation of individual term probabilities on the performance of a probabilistic information retrieval system are presented. Searches were executed with various levels of term error on a test collection of probabilistically indexed information sources. The amount of error in the final probability of relevance used to rank sources introduced by these errors was analytically determined. The first simulation analyzed simulated rankings obtained by simulating probabilities of relevance according to the error distribution; in the second simulation measures of rank correlation between simulated rankings and the system's ranking were calculated; in the third, the effect of the error on retrieval performance using the measure expected search length was determined. It was found that substantial error was introduced into final probabilities of relevance, but that for low levels of term error the impact on ranking and retrieval performance was moderate, while even with high levels the actual ranking performed significantly better than a random ranking of retrieved sources. © 1990 John Wiley & Sons, Inc. Copyright © 1990 John Wiley & Sons, Inc.
引用
收藏
页码:348 / 358
页数:11
相关论文
共 50 条