Query Terms Extraction from Patent Document for Invalidity Search
نویسنده
چکیده
This paper describes our patent retrieval system participated in the NTCIR-5 Patent Retrieval Task, Document Retrieval Subtask. The main scope of our method is the appropriate query expansion to improve recall. We extracted query terms from the topic claim, and expanded query terms extracted from sentences explained in the patent document including the topic claim. The explanation sentences were extracted by the method based on pattern matching and by the method based on the longest common subsequence length.
منابع مشابه
Invalidity Patent Search System of NTT DATA
In this paper, we give an overview of our invalidity patent search system for NTCIR-4 PATENT. The system is based on the document retrieval technique and the new methods that are suitable for the invalidity search; the query term extraction based on characteristics of invention, the retrieval model using components of invention, the ranking using the term weighting based on category information...
متن کاملInvalidity Search for USPTO Patent Documents Using Different Patent Surrogates
This paper describes our work at the sixth NTCIR workshop on the subtask of invalidity search for patent retrieval. We compared different patent surrogates for their effectiveness on invalidity search. Our preliminary results show that the query by the Claims field plus PRF (pseudo relevance feedback) leads to the best results in terms of relevance degree A while the query by all free-text fiel...
متن کاملTREC Chemical IR Track 2009: A Distributed Dimensional Indexing Model for Chemical Patent Search
For the TREC-2009 Chemical IR Track, we explore development of a distributed information retrieval system based on a dimensional data model. The indexing model supports named entity identification and aggregation of term statistics at multiple levels of patent structure including individual words, sentences, claims, descriptions, abstracts, and titles. The system was deployed across 15 Amazon W...
متن کاملمدل جدیدی برای جستجوی عبارت بر اساس کمینه جابهجایی وزندار
Finding high-quality web pages is one of the most important tasks of search engines. The relevance between the documents found and the query searched depends on the user observation and increases the complexity of ranking algorithms. The other issue is that users often explore just the first 10 to 20 results while millions of pages related to a query may exist. So search engines have to use sui...
متن کاملروش جدید متنکاوی برای استخراج اطلاعات زمینه کاربر بهمنظور بهبود رتبهبندی نتایج موتور جستجو
Today, the importance of text processing and its usages is well known among researchers and students. The amount of textual, documental materials increase day by day. So we need useful ways to save them and retrieve information from these materials. For example, search engines such as Google, Yahoo, Bing and etc. need to read so many web documents and retrieve the most similar ones to the user ...
متن کامل