Universität Duisburg-Essen
Startseite Arbeitsgruppe Informationsysteme

Applying the Divergence From Randomness Approach for Content-Only Search in XML Documents

Zitationsschlüssel:
Abolhassani/Fuhr:04
Titel:
Applying the Divergence From Randomness Approach for Content-Only Search in XML Documents
Autor(en):
M. Abolhassani
N. Fuhr
In:
Zitationsschlüssel:
ECIR:04
Titel:
26th European Conference on Information Retrieval Research (ECIR 2004)
Herausgeber:
Verlag:
Springer
In:
26th European Conference on Information Retrieval Research (ECIR 2004)
Jahr:
2004

BibTeX-Eintrag

Jahr:
2004

Zusammenfassung:
Content-only retrieval of XML documents deals with the problem of locating the smallest XML elements that satisfy the query. In this paper, we investigate the application of a specific language model for this task, namely Amati's approach of divergence from randomness. First, we investigate different ways for applying this model without modification by redefining the concept of an (atomic) document for the XML setting. However, this approach yields a retrieval quality lower than the best method known before. We improved the retrieval quality through extending the basic model by an additional factor that refers to the hierarchical structure of XML documents.

BibTeX-Eintrag

Volltext als PDF