ON MODELING INFORMATION-RETRIEVAL WITH PROBABILISTIC INFERENCE

被引:80
|
作者
WONG, SKM [1 ]
YAO, YY [1 ]
机构
[1] LAKEHEAD UNIV, DEPT MATH SCI, THUNDER BAY, ON P7B 5E1, CANADA
关键词
BOOLEAN MODEL; DOCUMENT REPRESENTATION; FUZZY SET MODEL; MAXIMUM AND MINIMUM ENTROPY PRINCIPLES; PROBABILISTIC INFERENCE; PROBABILISTIC MODEL; RELEVANCE; SIMILARITY MEASURES; SUBJECTIVE PROBABILITY; UNCERTAIN IMPLICATIONS; VECTOR SPACE MODEL;
D O I
10.1145/195705.195713
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article examines and extends the logical models of information retrieval in the context of probability theory. The fundamental notions of term weights and relevance are given probabilistic interpretations. A unified framework is developed for modeling the retrieval process with probabilistic inference. This new approach provides a common conceptual and mathematical basis for many retrieval models, such as the Boolean, fuzzy set, vector space, and conventional probabilistic models. Within this framework, the underlying assumptions employed by each model are identified, and the inherent relationships between these models are analyzed. Although this article is mainly a theoretical analysis of probabilistic inference for information retrieval, practical methods for estimating the required probabilisties are provided by simple examples.
引用
收藏
页码:38 / 68
页数:31
相关论文
共 50 条