Citation-Key:
Fuhr/Huether:89
Title:
Optimum Probability Estimation from Empirical Distributions
Author(s):
N. Fuhr
H. Hüther
Journal:
Information Processing and Management
Volume:
25
Number:
5
Page(s):
493--507
Year:
1989

Abstract:
Probability estimation is important for the application of probabilistic models as well as for any evaluation in IR. We discuss the interdependencies between parameter estimation and certain properties of probabilistic models: dependence assumptions, binary vs.\ non-binary features, estimation sample selection. Then we define an optimum estimate for binary features which can be applied to various typical estimation problems in IR. A method for computing this estimate using empirical data is described. Some experiments show the applicability of our method, whereas comparable approaches are partially based on false assumptions or yield biased estimates.
Classification(s):
H.3.3, G.3
Subject descriptor(s):
probability estimation, Retrieval models
Keywords:
Probabilistic retrieval

BibTeX entry

Fulltext as PDF