Universität Duisburg-Essen
Startseite Arbeitsgruppe Informationsysteme

CLASSIX

Classification and Intelligent Search on Information in XML


Projektzeitraum:
Vom 01. 02. 2002 bis zum 30. 09. 2006
Kontaktpersonen:
Beteiligte Personen:
Gesponsert von:
  • DFG
Teilnehmende Institutionen:

XML can be used for representing all kinds of documents in product cataloges, digital libraries and scientific data repositories, and across the Web. However, merely casting the documents into XML does not necessarily make their semantics explicit and more amenable for effective information searching. To fully leverage XML on a global scale, CLASSIX addresses the following issues:

  • Providing an easy-to-use yet powerful and efficient search language that combines concepts from the current XML pattern-matching languages, such as XPath and XQuery, with ontology-backed information-retrieval style search result ranking.
  • Extracting more semantics from existing document collections by constructing structural and ontological skeletons, e.g., in the form of DTDs or XML schemas that describe the data at a higher semantic level and can also facilitate new forms of indexing for efficiency.
  • Classifying existing documents according to a given thematic or personalized, hierarchical ontology to make searching more effective, e.g., exploit relevance feedback, and efficient, e.g., limit the search focus.

Publikationen

Ingo Frommholz (2008).
A Probabilistic Framework for Information Modelling and Retrieval Based on User Annotations on Digital Objects. PhD thesis

Norbert Fuhr; Mounia Lalmas; Andrew Trotman; Jaap Kamps (Hrsg.) (2008).
Focused access to XML documents: 6th International Workshop of the Initiative for the Evaluation of XML Retrieval (INEX 2007). Number in LNCS, Springer

Ingo Frommholz (2007).
Annotation-based Document Retrieval with Probabilistic Logics. In ECDL:07

Norbert Fuhr; Mounia Lalmas; Andrew Trotman (Hrsg.) (2007).
Comparative Evaluation of XML Information Retrieval Systems, 5th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2006. Number 4518 in LNCSSpringer, Heidelberg et al..

Ingo Frommholz; Norbert Fuhr (2006).
Probabilistic, Object-oriented Logics for Annotation-based Retrieval in Digital Libraries. In JCDL:06

Ingo Frommholz; Norbert Fuhr (2006).
Evaluation of Relevance and Knowledge Augmentation in Discussion Search. In ECDL:06

Norbert Fuhr; Mounia Lalmas; Saadia Malik; Gabriella Kazai (Hrsg.) (2006).
Advances in XML Information Retrieval and Evaluation: Fourth Workshop of the INitiative for the Evaluation of XML Retrieval (INEX 2005), Dagstuhl 28-30 November 2005, Lecture Notes in Computer Science. 3977, Springer-Verlag GmbH

Gudrun Fischer; Igor Jacy Lino Campista (2005).
A Template-Based Approach to Summarize XML Collections. In: Proceedings of LWA 2005, October 10-12, Saarbrcken, Germany

N. Fuhr; N. Goevert (2006).
Retrieval Quality vs. Effectiveness of Specificity-Oriented Search in XML Collections. Information Retrieval 9

Norbert Fuhr; Mounia Lalmas; Saadia Malik; Zoltan Szlavik (Hrsg.) (2005).
Advances in XML Information Retrieval: Third International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2004, Dagstuhl Castle, Germany, December 6-8, 2004, Revised Selected Papers. 3493, Springer-Verlag GmbH

Mohammad Abolhassani; Norbert Fuhr; Saadia Malik (2004).
HyREX at INEX 2003. In INEX:04

M. Abolhassani; N. Fuhr (2004).
Applying the Divergence From Randomness Approach for Content-Only Search in XML Documents. In ECIR:04

N. Fuhr; K. Großjohann (2004).
XIRQL: An XML Query Language Based on Information Retrieval Concepts. ACM Transactions on Information Systems 22

Norbert Fuhr; Mounia Lalmas; Saadia Malik (Hrsg.) (2004).
INitiative for the Evaluation of XML Retrieval (INEX). Proceedings of the Second INEX Workshop. Dagstuhl, Germany, December 15--17, 2003.

M. Theobald; C.-P. Klas (2004).
BINGO! and Daffodil: Personalized Exploration of Digital Libraries and Web Sources. In RIAO:04

M. Abolhassani; N. Fuhr; N. Gövert (2003).
Information Extraction and Automatic Markup for XML documents. In Blanken/etal:03,

Mohammad Abolhassani; Norbert Fuhr; Norbert Gövert (2003).
Applying The Divergence From Randomness Approach For Relevance-Oriented Search In XML Documents. Technischer Bericht, University of Duisburg-Essen

Norbert Fuhr; Norbert Gövert; Mohammad Abolhassani (2003).
Retrieval Quality vs.\ Effectiveness of Relevance-Oriented Search in XML Documents. Technischer Bericht, University of Duisburg-Essen

N. Fuhr; K. Großjohann; S. Kriewel (2003)
A Query Language and User Interface for XML Information Retrieval.

Norbert Gövert; Norbert Fuhr; Mohammad Abolhassani; Kai Großjohann (2003).
Content-oriented XML retrieval with HyREX. In INEX:03

Norbert Gövert; Gabriella Kazai; Norbert Fuhr; Mounia Lalmas (2003).
Evaluating the effectiveness of content-oriented XML retrieval. Technischer Bericht, University of Dortmund, Computer Science 6

Norbert Gövert; Gabriella Kazai (2003).
Overview of the INitiative for the Evaluation of XML retrieval (INEX) 2002. In INEX:03

Norbert Fuhr; Norbert Gövert; Gabriella Kazai; Mounia Lalmas (Hrsg.) (2003).
INitiative for the Evaluation of XML Retrieval (INEX). Proceedings of the First INEX Workshop. Dagstuhl, Germany, December 8--11, 2002. , ERCIM Workshop ProceedingsERCIM, Sophia Antipolis, France.

Gabriella Kazai; Mounia Lalmas; Norbert Fuhr; Norbert Gövert (2003).
A report on the first year of the INitiative for the Evaluation of XML Retrieval (INEX 02). Journal of the American Society for Information Science and Technology 54

G. Kazai; N. Gövert; M. Lalmas; N. Fuhr (2003)
The INEX evaluation initiative.

Mohammad Abolhassani; Norbert Fuhr; Norbert Gövert; Kai Großjohann (2002).
HyREX: Hypermedia Retrieval Engine for XML. Research Report , University of Dortmund, Department of Computer Science, Dortmund, Germany

Norbert Fuhr (2003).
XML Information Retrieval and Information Extraction. In: F. Franke; G. Nakhaeizadeh; I. Renz (Hrsg.): Text Mining. Theoretical Aspects and Applications

Norbert Fuhr; Norbert Gövert; Gabriella Kazai; Mounia Lalmas (2002).
INEX: INitiative for the Evaluation of XML Retrieval. In SIGIR/XML:02

Norbert Fuhr; Norbert Gövert; Kai Großjohann (2002).
HyREX: Hyper-media Retrieval Engine for XML. In SIGIR:02

Norbert Fuhr; Norbert Gövert (2002).
Index Compression vs. Retrieval Time of Inverted Files for XML Documents. In CIKM:02

Norbert Fuhr; Norbert Gövert (2002).
Index Compression vs. Retrieval Time of Inverted Files for XML Documents. Technical Report , University of Dortmund

N. Fuhr; K. Großjohann (2002).
XIRQL: An XML Query Language Based on Information Retrieval Concepts. (Submitted for publication)

N. Fuhr; G. Weikum (2002).
Classification and Intelligent Search on Information in XML. IEEE Data Engineering Bulletin 25(1)

K. Großjohann; N. Fuhr; D. Effing; S. Kriewel (2002).
Query Formulation and Result Visualization for XML Retrieval. In: Proceedings ACM SIGIR 2002 Workshop on XML and Information Retrieval, ACM

K. Großjohann; N. Fuhr; D. Effing; S. Kriewel (2002).
A User Interface for XML Document Retrieval. In: Informatik 2002


Vorträge

Mohammad Abolhassani; Norbert Fuhr; Saadia Malik (2003).
HyREX at INEX 2003. Talk at the INEX Workshop, Dagstuhl

M. Abolhassani; N. Fuhr (2004).
Applying the Divergence From Randomness Approach for Content-Only Search in XML Documents. Talk at the European Conference on Information Retrieval, Sunderland, U.K.

Norbert Fuhr (2004).
XML Information Retrieval - Achievements and Challenges. Talk at the Twente Data Mangement Workshop


Diplom-, Master- und Bachelorarbeiten

Semi-Automatische Inhaltsübersicht für XML-Kollektionen
Abgeschlossene Diplomarbeit

Verwandte Projekte

FOCUS
Focussed retrieval of structured documents
HyREX
Hyper-media Retrieval Engine for XML
INEX
Initiative for the Evaluation of XML retrieval

Projektreffen

14.-15. März 2002
Kick-off-Meeting in Dortmund