Citation-Key:
Fuhr:99a
Title:
Probabilistic Datalog: Implementing Logical Information Retrieval for Advanced Applications
Author(s):
Norbert Fuhr
Journal:
Journal of the American Society for Information Science
Volume:
51
Number:
2
Page(s):
95--110
Year:
2000

Abstract:
In the logical approach to information retrieval (IR), retrieval is considered as uncertain inference. Whereas classical IR models are based on propositional logic, we combine Datalog (function-free Horn clause predicate logic) with probability theory. Therefore, probabilistic weights may be attached to both facts and rules. The underlying semantics extends the well-founded semantics of modularly stratified Datalog to a possible worlds semantics. By using default independence assumptions with explicit specification of disjoint events, the inference process always yields point probabilities. We describe an evaluation method and present an implementation. This approach allows for easy formulation of specific retrieval models for arbitrary applications, and classical probabilistic IR models can be implemented by specifying the appropriate rules. In comparison to other approaches, the possibility of recursive rules allows for more powerful inferences, and predicate logic gives the expressiveness required for multimedia retrieval. Furthermore, probabilistic Datalog can be used as a query language for integrated information retrieval and database systems.

BibTeX entry

Fulltext as PS

Fulltext as PDF