Zitationsschlüssel:
Ernst/Fuhr:07
Titel:
Retrieval in text collections with historic spelling using linguistic and spelling variants
Autor(en):
Andrea Ernst-Gerlach
Norbert Fuhr
In:
JCDL
In:
Zitationsschlüssel:
JCDL:07
Titel:
ACM/IEEE Joint Conference on Digital Libraries, JCDL 2007, Vancouver, BC, Canada, June 18-23, 2007, Proceedings
Herausgeber:
Edie M. Rasmussen
Ray R. Larson
Elaine Toms
Shigeo Sugimoto
Verlag:
ACM
In:
JCDL
Jahr:
2007

BibTeX-Eintrag

Seite(n):
333-341
Jahr:
2007

Zusammenfassung:
We present a new approach for the retrieval of texts with non-standard spelling, which is important for historic texts e.g. in English or German. In this paper, we describe the overall architecture of our system, followed by its evaluation. Given a search term as lemma, we use a dictionary of contemporary German for finding all inflected and derived forms of the lemma. Then we apply transformation rules (derived from training data) for generating historic spelling variants. For the evaluation, we regard the resulting retrieval quality. The experimental results show that we can improve the retrieval quality for historic collections substantially.

BibTeX-Eintrag

Volltext als PDF