Automatic Recognition of Topic-Classified Relations between Prostate Cancer and Genes from Medline Abstracts
نویسندگان
چکیده
To recognize instances of medical information concerning prostate cancer and its relevant genes, we developed a machine learning-based relation recognizer using rich contextual features. We collected prostate cancer-related abstracts from Medline. We then constructed an annotated corpus of prostate cancer and gene relations, which consisted of six topic − classified categories, with more detailed information describing the type of prostate cancer and gene relation. The corpus was made with the help of biologists and a disease and gene dictionarybased name recognition technique. The process of dictionary-based name recognition generates disease-gene pairs that become candidates for biomedically related pairs. Since dictionary matching tends to over-generate candidates, we used a machine learning-based named entity recognition method (1) to provide a feature for each candidate, and (2) to filter out overgenerated candidates. Experimental results showed that using a maximum entropy-based relation recognition method and a maximum entropybased named entity recognition method together greatly improves precision at the cost of a small reduction in recall for the topic-classified relations.
منابع مشابه
A Database for Automatic Persian Speech Emotion Recognition: Collection, Processing and Evaluation
Abstract Recent developments in robotics automation have motivated researchers to improve the efficiency of interactive systems by making a natural man-machine interaction. Since speech is the most popular method of communication, recognizing human emotions from speech signal becomes a challenging research topic known as Speech Emotion Recognition (SER). In this study, we propose a Persian em...
متن کاملRecognizing Gene and Protein Function in MEDLINE Abstracts
Identification of genes and proteins that affect biological function in humans and other organisms is a critical step in the discovery of new medicinal therapies. Automatic recognition of MEDLINE abstracts that describe gene/protein function would be of tremendous benefit to researchers in industry, government, and academia. Our approach uses simple syntax and domain semantics to both identify ...
متن کاملAutomatic Prostate Cancer Segmentation Using Kinetic Analysis in Dynamic Contrast-Enhanced MRI
Background: Dynamic contrast enhanced magnetic resonance imaging (DCE-MRI) provides functional information on the microcirculation in tissues by analyzing the enhancement kinetics which can be used as biomarkers for prostate lesions detection and characterization.Objective: The purpose of this study is to investigate spatiotemporal patterns of tumors by extracting semi-quantitative as well as w...
متن کاملEvaluation of gene expression level of GPRC6A, E.cadherin and ZEB1 in prostate cancer in comparison to benign tissues
Background: Prostate cancer is currently the third malignant disease in Iran and fifth common cancer worldwide. The aim of this study was to determine the expression of GPRC6A, E.cadherin, and ZEB1 genes in prostate cancer in comparison with benign tumor. Since early detection of cancer plays an important role in treatment, this study aims to identify the role of GPRC6A, E.cadherin and ZEB1 gen...
متن کاملComparison of dosimetric and radiobiological effects of various IMRT techniques regarding to joint volume between target tissue and organs at risk in prostate cancer patients
Background: Intensity-modulated radiotherapy (IMRT) is one of the most usable methods in prostate radiotherapy that is used with different techniques. The aim of this study was to evaluate and compare the dosimetric and radiobiological effects of prostate IMRT techniques regarding to joint volume between the target tissue and organs at risk as a patients anatomical parameter. Methods: This res...
متن کامل