Citation-Key:
Fuhr:02a
Title:
XML Information Retrieval and Information Extraction
Author(s):
Norbert Fuhr
Editor(s):
F. Franke
G. Nakhaeizadeh
I. Renz
Publisher:
Physica Verlag
In:
Text Mining. Theoretical Aspects and Applications
Page(s):
21--32
Year:
2003

Abstract:
We present a new query language for information retrieval in XML documents and discuss its combination with information extraction methods. XIRQL is an XML query language which implements IR-related features such as weighting and ranking, relevance-oriented search, datatypes with vague predicates, and structural relativism. For information extracted from texts, XIRQL can rank records based on uncertainty weights, and single conditions may be evaluated using vague predicates for fact retrieval. When IE is used for automatic XML markup of plain texts, XIRQL is able to consider uncertainty weights resulting from this process, and the markup leads to increased precision of text searches.

BibTeX entry

Fulltext as PDF