Merging XML Indices
نویسندگان
چکیده
Using separate indices for each element and merging their results has proven to be a feasible way of performing XML element retrieval; however, there has been little work on evaluating how the main method parameters affect the results. We study the effect of using different weighting models for computing rankings at the single index level and using different merging techniques for combining such rankings. Our main findings are that (i) there are large variations on retrieval effectiveness when choosing different techniques for weighting and merging, with performance gains up to 102%, and (ii) although there does not seem to be any best weighting model, some merging schemes perform clearly better than others.
منابع مشابه
A Structural Merging Algorithm for XML Documents
Document merging is essential to synchronizing several versions of a document concurrently edited by two or more users. Conventional methods for document merging are designed for (unstructured) text files, thereby the structures of XML documents, modeled by ordered trees, are not handled appropriately. While a few merging methods that can handle the structure of an XML document are being constr...
متن کاملXaver and the SearX-Engine: XML Retrieval in Real World Applications
In this paper we present the merging component of the MUMIS project which combines several xml-documents into one resulting xml-document. The domain of application of the MUMIS project is soccer, and each xml-document contains information about soccer games. The input documents for the merging component contain incomplete, often erroneous, and mutually inconsistent information on the same game....
متن کاملThe First Twente Data Management Workshop TDM ’ 04 on XML Databases and Information Retrieval
In this paper we present the merging component of the MUMIS project which combines several xml-documents into one resulting xml-document. The domain of application of the MUMIS project is soccer, and each xml-document contains information about soccer games. The input documents for the merging component contain incomplete, often erroneous, and mutually inconsistent information on the same game....
متن کاملA Unifying Framework for Merging and Evaluating XML Information
With the ever increasing connection between XML information systems over the Web, users are able to obtain integrated sources of XML information in a cooperative manner, such as developing an XML mediator schema or using eXtensible Stylesheet Language Transformation (XSLT). However, it is not trivial to evaluate the quality of such merged XML data, even when we have the knowledge of the involve...
متن کاملTowards Semantic-based RSS Merging
Merging information can be of key importance in several XML-based applications. For instance, merging the RSS news from different sources and providers can be beneficial for end-users (journalists, economists, etc.) in various scenarios. In this work, we address this issue and mainly explore the relatedness relationships between RSS entities/elements. To validate our approach, we also provide a...
متن کامل