G. Ramirez and T. Westerveld and A.P. de Vries

Structural Features in Content Oriented XML Retrieval

ABSTRACT

The structural features of XML components are an extra source of information that should be used in a content-oriented retrieval task on this type of documents. This paper explores three different structural features from the INEX collection that could be used in content-oriented search. We analyse the gain this knowledge could add to the performance of an information retrieval system, and present a first approach on how this structural information could be extracted from a relevance feedback process to be used as priors in a language modelling framework.