Zitationsschlüssel:
Fuhr:02a
Titel:
XML Information Retrieval and Information Extraction
Autor(en):
Norbert Fuhr
Herausgeber:
F. Franke
G. Nakhaeizadeh
I. Renz
Verlag:
Physica Verlag
In:
Text Mining. Theoretical Aspects and Applications
Seite(n):
21--32
Jahr:
2003

Zusammenfassung:
We present a new query language for information retrieval in XML documents and discuss its combination with information extraction methods. XIRQL is an XML query language which implements IR-related features such as weighting and ranking, relevance-oriented search, datatypes with vague predicates, and structural relativism. For information extracted from texts, XIRQL can rank records based on uncertainty weights, and single conditions may be evaluated using vague predicates for fact retrieval. When IE is used for automatic XML markup of plain texts, XIRQL is able to consider uncertainty weights resulting from this process, and the markup leads to increased precision of text searches.

BibTeX-Eintrag

Volltext als PDF