Large-Scale Semantic Indexing of Biomedical Publications
نویسندگان
چکیده
Automated annotation of scientific publications in real-world digital libraries requires dealing with challenges such as large number of concepts and training examples, multi-label training examples and hierarchical structure of concepts. BioASQ is a European project that contributes a large-scale biomedical publications corpus for working on these challenges. This paper documents the participation of our team to the large-scale biomedical semantic indexing task of BioASQ.
منابع مشابه
Large-scale Semantic Indexing with Biomedical Ontologies
We introduce PubTator, a web-based application that enables large-scale semantic indexing and automatic concept recognition in biomedical ontologies. Not only was PubTator formally evaluated and top-rated in BioCreative, it also has been widely adopted and used by the scientific community from around the world, supporting both research projects and real-world applications in biocuration, crowds...
متن کاملDeepMeSH: deep semantic representation for improving large-scale MeSH indexing
MOTIVATION Medical Subject Headings (MeSH) indexing, which is to assign a set of MeSH main headings to citations, is crucial for many important tasks in biomedical text mining and information retrieval. Large-scale MeSH indexing has two challenging aspects: the citation side and MeSH side. For the citation side, all existing methods, including Medical Text Indexer (MTI) by National Library of M...
متن کاملBioASQ: A Challenge on Large-Scale Biomedical Semantic Indexing and Question Answering
This article provides an overview of BIOASQ, a new competition on biomedical semantic indexing and question answering (QA). BIOASQ aims to push towards systems that will allow biomedical workers to express their information needs in natural language and that will return concise and user-understandable answers by combining information from multiple sources of different kinds, including biomedica...
متن کاملGenre-Based Search through Biomedical Images
We exploit the retrieval of visual information from biomedical scientific publication databases. Therefore, we consider the use of domain specific genres to automatically subdivide large image databases into smaller, consistent parts. Combination with Latent Semantic Indexing on the picture captions allows for efficient retrieval of images in specific categories. We demonstrate our approach on ...
متن کاملLarge-scale online semantic indexing of biomedical articles via an ensemble of multi-label classification models
BACKGROUND In this paper we present the approach that we employed to deal with large scale multi-label semantic indexing of biomedical papers. This work was mainly implemented within the context of the BioASQ challenge (2013-2017), a challenge concerned with biomedical semantic indexing and question answering. METHODS Our main contribution is a MUlti-Label Ensemble method (MULE) that incorpor...
متن کامل