Anytime Ranking on Document-Ordered Indexes

نویسندگان

چکیده

Inverted indexes continue to be a mainstay of text search engines, allowing efficient querying large document collections. While there are number possible organizations, document-ordered the most common, since they amenable various query types, support index updates, and allow for dynamic pruning operations. One disadvantage with is that high-scoring documents can distributed across identifier space, meaning traversal algorithms terminate early might put effectiveness at risk. The alternative impact-ordered indexes, which primarily top- disjunctions but also anytime processing, where terminated any time, quality improving as processing latency increases. Anytime used effectively reduce high-percentile tail essential operational scenarios in service level agreement (SLA) imposes response time requirements. In this work, we show how organized such queried an fashion, enabling strict control effective termination. Our experiments topical segments selected by simple score estimator outperforms existing algorithms, allows runtimes accurately limited comply SLA

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Access-Ordered Indexes

Search engines are an essential tool for modern life. We use them to discover new information on diverse topics and to locate a wide range of resources. The search process in all practical search engines is supported by an inverted index structure that stores all search terms and their locations within the searchable document collection. Inverted indexes are highly optimised, and significant wo...

متن کامل

XML Document Indexes: A Classification

Because choosing the most efficient query execution plan relies on indexing techniques, such techniques play an important role in developing query processors. In the Web context, they’re even more crucial, as XML documents are massively used and frequently queried. Given that XML documents are semistructured, however, general query processing techniques — such as those for relational or objecto...

متن کامل

Re-ranking model based on document clusters

In this paper, we describe a model of information retrieval system that is based on a document reranking method using document clusters. In the ®rst step, we retrieve documents based on the inverted®le method. Next, we analyze the retrieved documents using document clusters, and re-rank them. In this step, we use static clusters and dynamic cluster view. Consequently, we can produce clusters th...

متن کامل

A Complete Survey on Web Document Ranking

Today, web plays a critical role in human life and also simplifies the same to a great extent. However, due to the towering increase in the number of web pages, the challenge of providing quality and relevant information to the users also needs to be addressed. Thus, search engines need to implement such algorithms which spans the pages as per user's interest and satisfaction and rank them...

متن کامل

Latent Document Re-Ranking

The problem of re-ranking initial retrieval results exploring the intrinsic structure of documents is widely researched in information retrieval (IR) and has attracted a considerable amount of time and study. However, one of the drawbacks is that those algorithms treat queries and documents separately. Furthermore, most of the approaches are predominantly built upon graph-based methods, which m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Information Systems

سال: 2021

ISSN: ['1558-1152', '1558-2868', '1046-8188', '0734-2047']

DOI: https://doi.org/10.1145/3467890