Document Language Models, Query Models, and Risk Minimization for Information Retrieval
نویسندگان
چکیده
منابع مشابه
Why Language Models and Inverse Document Frequency for Information Retrieval?
The issue of term weighting has been traditionally addressed in a heuristic way through TF.IDF. TF.IDF is a term weighting measure which has been developed as a heuristic. This measure can be seen as an information theoretical approach that adds all the information contained in a document set. Statistical language models have been developed as a new form of automatically incorporating term freq...
متن کاملInter-document Similarities, Language Models, and Ad Hoc Information Retrieval
Search engines have become a crucial tool for finding information in repositories containing large amounts of textual data in unstructured form (e.g., the Web). However, the task of ad hoc information retrieval, that is, finding documents within a corpus that are relevant to an information need specified using a query, remains a hard challenge. The language modeling approach to information retr...
متن کاملLanguage Models and Structured Document Retrieval
We discuss possibilities for the use of language models in structured document retrieval. We use a tree-based generative language model for ranking documents and components. Nodes in the tree correspond to document components such as titles, sections, and paragraphs. At each node in the document tree, there is a language model. The language model for a leaf node is estimated directly from the t...
متن کاملSpoken Document Retrieval Using Neighboring Documents and Extended Language Models for Query Likelihood Model
This paper proposes several approaches for NTCIR-12 SpokenQuery & Doc-2[1]. Our methods are based on the query likelihood model which is one of the probabilisrtic language models choosing Dirichlet smoothing. We try to improve the performance by using extended language models. First, this paper develops and uses the language model obtained from related research papers. Second, this paper propos...
متن کاملStatistical Language Models for Information Retrieval
Dependency-based methods for syntactic parsing have become increasingly popular in natural language processing in recent years. This book gives a thorough introduction to the methods that are most widely used today. After an introduction to dependency grammar and dependency parsing, followed by a formal characterization of the dependency parsing problem, the book surveys the three major classes...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: ACM SIGIR Forum
سال: 2017
ISSN: 0163-5840
DOI: 10.1145/3130348.3130375