BioTextRetriever: A Tool to Retrieve Relevant Papers

نویسندگان

  • Célia Talma Gonçalves
  • Rui Camacho
  • Eugénio C. Oliveira
چکیده

Whenever new sequences of DNA or proteins have been decoded it is almost compulsory to look at similar sequences and papers describing those sequences in order to both collect relevant information concerning the function and activity of the new sequences and/or know what is known already about similar sequences that might be useful in the explanation of the function or activity of the newly discovered ones. In current web sites and data bases of sequences there are, usually, a set of curated paper references linked to each sequence. Those links are very useful since the papers describe useful information concerning the sequences. They are, therefore, a good starting point to look for relevant information related to a set of sequences. One way is to implement such approach is to do a blast with the new decoded sequences, and collect similar sequences. Then one looks at the papers linked with the similar sequences. Most often the number of retrieved papers is small and one has to search large data bases for relevant papers. In this paper we propose a process of generating a classifier based on the initially set of relevant papers. First we collect similar sequences using an alignment algorithm like Blast. We then use the enlarges set of papers to construct a classifier. Finally we use that classifier to automatically enlarge the set of relevant papers by searching the MEDLINE using the automatically constructed classifier. We have empirically evaluated our proposal and report very promising results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

BioTextRetriever: yet another Information Retrieval system

It is of capital importance for every researcher to be aware of the work that has been done in his research area. However, finding “interesting/relevant” publications in the overwhelming amount of documents available in the Internet is quite difficult. We propose the use of Text Mining to address this information overload problem by automating the process of extracting relevant papers from very...

متن کامل

A review of 17 years of application of partnership care model on the consequences of chronic diseases: Describing and assessing the quality of the methodology of papers

Background: Regarding the widespread prevalence of chronic diseases, nurses need to understand the choices, priorities, and abilities of patients in reality, their communication, and the social context in order to play their professional role and responsibility. This review study was conducted with two purposes: determining the effect of partnership-care-model (PCM) on the outcomes of chronic d...

متن کامل

A brief review of plagiarism in medical scientific research papers [RETRACTED]

[THIS ARTICLE IS RETRACTED] Plagiarism refers to “adopting someone else’s words, work or ideas and passing them off as one’s own”. It is potentially considered as the most prevalent form of scientific dishonesty discovered in research papers. The present review aims to provide a thorough account of plagiarism to build awareness about all dimensions of plagiarism.The key...

متن کامل

تطابق اهداف ابتدایی و ساختار سازمانی فعلی در نظام ارائه مراقبت بهداشتی اولیه در ایران: مطالعه‌ی مروری نظام‌مند

Background and Aim: In recent years, the family physician plan has been implemented as a main strategy of the health system in Iran. Therefore, the necessity to reform organizational structure based on new goals and strategies is felt more than before. The aim of this study is to review and summarize all cases about Iran’s organizational structure and its challenges in primary healthcare system...

متن کامل

PaperFinder: A Tool for Scalable Search of Digital Libraries

The invention and spread of the World Wide Web made the process of paper publication significantly easier than before, and added a large repository of on-line (electronic) papers to our body of knowledge. To make the process of information gathering easier for scientists, Digital Libraries usually maintain Search Engines, which can be used to find papers about a specified topic. However, the us...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJKDB

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2011