Document Structure Analysis for the NTCIR-5 Patent Retrieval Task
نویسندگان
چکیده
This paper describes our system participated in the Document and Passage Retrieval Subtasks at the NTCIR-5 Patent Retrieval Task. The purpose of these subtasks was the invalidity search, in which a patent application including a target claim is used to search documents that can invalidate the demand in the claim. Our system is characterized by the structure analysis for both target claim and entire application. The target claim is segmented into components, each of which is used to produce an initial query. The structure of the application is used to enhance each query. The candidates of relevant documents are retrieved and ranked on a component-by-component basis. The final document list is obtained by integrating these document lists. All passages in each document are ranked according to the relevance to the target claim. We show the effectiveness of our system experimentally.
منابع مشابه
Overview of Patent Retrieval Task at NTCIR-5
In the Fifth NTCIR Workshop, we organized the Patent Retrieval Task and performed three subtasks; Document Retrieval, Passage Retrieval, and Classification. This paper describes the Document Retrieval Subtask and Passage Retrieval Subtask, both of which were intended for patent-to-patent invalidity search task. We show the evaluation results of the groups participating in those subtasks.
متن کاملPOSTECH at NTCIR-5 Patent Retrieval: Smoothing Experiments in a Language Modeling Approach to Patent Retrieval
This report describes the experimental results of our participation at the Document Retrieval Subtask of NTCIR-5 Patent Retrieval Task. Unlike newspaper articles which belong to the main document type handled in previous information retrieval experiments, patent documents have many different characteristics in terms of length, technicality, structureness, etc. Among these, we focus on the lengt...
متن کاملQuery Terms Extraction from Patent Document for Invalidity Search
This paper describes our patent retrieval system participated in the NTCIR-5 Patent Retrieval Task, Document Retrieval Subtask. The main scope of our method is the appropriate query expansion to improve recall. We extracted query terms from the topic claim, and expanded query terms extracted from sentences explained in the patent document including the topic claim. The explanation sentences wer...
متن کاملA Patent Retrieval Method Using a Hierarchy of Clusters at TUT
To retrieve relevant documents from an enormous document collection, we usually utilize the similarity or distance measure between a query and the documents, or apply document clustering techniques to the document collection and partition it into relevant document groups. For patent retrieval, however, it is difficult to retrieve documents by using query terms only, because complex terminologie...
متن کاملDocument Structure Analysis in Associative Patent Retrieval
This paper describes our retrieval system participated in the Patent Retrieval Task at the Fourth NTCIR Workshop. The main task was an associative patent retrieval task, in which a patent application including a target claim is used to search documents that can invalidate the demand in the claim. Our system can be characterized by the structure analysis for both target claim and entire applicat...
متن کامل