Integrating Multiple On-line Knowledge Bases for Disease-Lab Test Relation Extraction
نویسندگان
چکیده
A computable knowledge base containing relations between diseases and lab tests would be a great resource for many biomedical informatics applications. This paper describes our initial step towards establishing a comprehensive knowledge base of disease and lab tests relations utilizing three public on-line resources. LabTestsOnline, MedlinePlus and Wikipedia are integrated to create a freely available, computable disease-lab test knowledgebase. Disease and lab test concepts are identified using MetaMap and relations between diseases and lab tests are determined based on source-specific rules. Experimental results demonstrate a high precision for relation extraction, with Wikipedia achieving the highest precision of 87%. Combining the three sources reached a recall of 51.40%, when compared with a subset of disease-lab test relations extracted from a reference book. Moreover, we found additional disease-lab test relations from on-line resources, indicating they are complementary to existing reference books for building a comprehensive disease and lab test relation knowledge base.
منابع مشابه
Ontology-based Normalization for Disease-Lab test Relation Extraction
This poster describes our preliminary work on ontology-based normalization for diseases and lab tests, as a fundamental step toward disease-lab test relation extraction. Multiple ontologies are leveraged for this aim. Specifically, diseases and lab tests are first extracted and mapped to the Concept Unique Identifier (CUI) of the Unified Medical Language System (UMLS) by MetaMap. Codes of Inter...
متن کاملPairwise Tensor Factorization for learning new facts in Knowledge Bases
Knowledge bases provide with the benefit of organizing knowledge in the relational form but suffer from incompleteness of new entities and relationships. Prior work on relation extraction has been focused on supervised learning techniques which are quite expensive. An alternative approach based on distant supervision has been of significant interest where one aligns database records with senten...
متن کاملKnowledge Base Unification via Sense Embeddings and Disambiguation
We present KB-UNIFY, a novel approach for integrating the output of different Open Information Extraction systems into a single unified and fully disambiguated knowledge repository. KB-UNIFY consists of three main steps: (1) disambiguation of relation argument pairs via a sensebased vector representation and a large unified sense inventory; (2) ranking of semantic relations according to their d...
متن کاملDistantly supervised Web relation extraction for knowledge base population
Extracting information from Web pages for populating large, cross-domain knowledge bases requires methods which are suitable across domains, do not require manual effort to adapt to new domains, are able to deal with noise, and integrate information extracted from different Web pages. Recent approaches have used existing knowledge bases to learn to extract information with promising results, on...
متن کاملCORE: Context-Aware Open Relation Extraction with Factorization Machines
We propose CORE, a novel matrix factorization model that leverages contextual information for open relation extraction. Our model is based on factorization machines and integrates facts from various sources, such as knowledge bases or open information extractors, as well as the context in which these facts have been observed. We argue that integrating contextual information—such as metadata abo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
دوره 2015 شماره
صفحات -
تاریخ انتشار 2015